Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Overview

Table of Contents



Intended usages

Info

We recommend starting with the default properties and the prebuilt train/test models.
scrubber.properties all supported setup options in one place.
Apache UIMA, Apache cTAKES, and WEKA packages are loaded dynamically.

...

  • Annotate word tokens and redact PHI from physician notes
  • cTAKES lexical parsing and medical dictionary annotation
  • WEKA multi-class decision tree classifier (plugin default)
  • Protege UI support for human expert curators (reads output)
  • Generate feature set of lexical properties, medical concept codes, and human defined rules
  • Compare lexical properties of public and private text sources
  • Distinguish private patient data from coded medical concepts and commonly used words
  • Prebuilt train and test models can be imported to Weka (default), Matlab, or R
  • (default) Test your local physician notes without retraining
  • (optional) retrain model using local physician note samples, publications, and medical dictionaries

How To

Install,Train, Test, Scrub, Configure

Office Word
namescrubber-3.x-runtime-guide.doc

scrubber.properties

  • ScrubberProperties.java statically binds scrubber.properties at startup
  • ScrubberPropertiesTest.java demonstrates binding scrubber.properties to shell commands.
  • setClassPath.sh sets the java classpath and exports the shell variables