...
Info |
---|
We recommend starting with the default properties and prebuilt train/test models. |
Default configuration
- Developers can use Scrubber 3.X in "default mode" with the same settings as the provided train and test model files. Input and output settings are managed in scrubber.properties (file paths, database settings, method implementations).
Customize NLP pipeline
- Scrubber uses Apache UIMA and Apache cTAKES packages, which together provide the NLP pipeline for lexical parsing and medical concept annotation. Generated feature sets are exported to the SQL database or model file (CSV, ARFF). The UIMA and cTAKES services used by Scrubber are defined and configured using scrubber.properties.
Customize Classifier
- Scrubber can use different classifier implementations without recompiling the software.
- By default scrubber dynamically loads the popular WEKA C4.5 decision tree classifier with multi-class support.
Software Features
Annotate text automatically
...