You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 23 Next »

Patient Text De-identification, Natural Language Processing

Scrubber helps investigators ensure patient privacy when using physician notes for clinical research. Scrubber removes HIPAA defined Protected Health Identifiers by matching human expert and and machine defined rules for text processing. This software can work with note text either unformatted or contained within XML files or SQL databases. Launched in 2006, the use of Scrubber has been approved by numerous hospital IRBs and quality has been validated by physician review. 

Update OCT 16, 2013

New Scrubber paper with Apache cTAKES McMurry* AJ, Fitch* B, Savova G, Kohane IS, Reis BY. “Improved de-identification of physician notes through integrative modeling of both identifying and non-identifying medical text”, BMC Medical Informatics and Decision Making  "Improved de-identification of physician notes through integrative modeling of both public and private medical text"

Update JUNE 18, 2013

Scrubber pipeline is moving to Apache cTAKES! .
For recent progress, see this presentation to the I2b2 National Center for Biomedical Computing:

  • No labels