Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Scrubber can use different classifier implementations without recompiling the software.
  • By default scrubber dynamically loads the popular WEKA C4.5 decision tree classifier with multi-class support.


Software Features

...


Annotation

  • Annotate word tokens and redact PHI from physician notes
  • cTAKES lexical parsing and medical dictionary annotation
  • WEKA multi-class decision tree classifier (plugin default)
  • Protege UI support for human expert curators (reads output) 

...

Models

  • Prebuilt train and test models can be imported to Weka (default), Matlab, or R
  • (default) Test your local physician notes without retraining
  • (optional) Retrain model using local physician note samples, publications, and medical dictionaries.  

Compare and classify medical text

Classification

  • Generate feature set of lexical properties, medical concept codes, and human defined rules
  • Compare lexical properties of public and private text sources
  • Distinguish (classify) private patient data from coded medical concepts and commonly used words

    Compare Medical Text Sources

  • Compare lexical properties and distributions of public and private text sources

    How-To Guide


    Installation

    Train and test models

    Scrub physician notes


    Office Word
    namescrubber-3.x-runtime-guide.doc
    Scrubber Property KEY = VALUE

Anchor
properties
properties


...