...
Scrubber can use different classifier implementations without recompiling the software.
By default scrubber dynamically loads the popular WEKA C4.5 decision tree classifier with multi-class support.
...
Functionality
Annotation
- Annotate word tokens and redact PHI from physician notes
- cTAKES lexical parsing and medical dictionary annotation
- WEKA multi-class decision tree classifier (plugin default)
- Protege UI support for human expert curators (reads output)
- Generate feature set of lexical properties, medical concept codes, and human defined rules
- Compare lexical properties of public and private text sources
Distinguish private patient data from coded medical concepts and commonly used wordsModels
- Prebuilt train and test models can be imported to Weka (default), Matlab, or R
- (default) Test your local physician notes without retraining
- (optional) retrain model using local physician note samples, publications, and medical dictionaries.
Classification
- Generate feature set of lexical properties, medical concept codes, and human defined rules
- Compare lexical properties of public and private text sources
- Distinguish private patient data from coded medical concepts and commonly used words
How-To Guide
Install
Train
Test
...