Overview
Intended usages
Info |
---|
We recommend starting with the default properties and the prebuilt train/test models. scrubber.properties all supported setup options in one place. Apache UIMA, Apache cTAKES, and WEKA packages are loaded dynamically. |
...
- Annotate word tokens and redact PHI from physician notes
- cTAKES lexical parsing and medical dictionary annotation
- WEKA multi-class decision tree classifier (plugin default)
- Protege UI support for human expert curators (reads output)
- Generate feature set of lexical properties, medical concept codes, and human defined rules
- Compare lexical properties of public and private text sources
- Distinguish private patient data from coded medical concepts and commonly used words
- Prebuilt train and test models can be imported to Weka (default), Matlab, or R
- (default) Test your local physician notes without retraining
- (optional) retrain model using local physician note samples, publications, and medical dictionaries
How To
Install,Train, Test, Scrub, Configure
Office Word |
---|
name | scrubber-3.x-runtime-guide.doc |
---|
|
scrubber.properties
- ScrubberProperties.java statically binds scrubber.properties at startup
- ScrubberPropertiesTest.java demonstrates binding scrubber.properties to shell commands.
- setClassPath.sh sets the java classpath and exports the shell variables