Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 5.3



Information for Research Investigators & Staff Scientists

>> >> Pathology Specimen Locator Core Facility at Dana-Farber Harvard Cancer Center <<



Development Status

History
The PSL has been operational since 2006. Many developers in the NCI funded SPIN program contributed to this work.


Current (2011)

Renewed interested in searching De-Identified Medical Text and Cancer Biospecimens motivated us to create this community portal for PSL.
There are 3 paid programmers working on various aspects of PSL.

Info

Much of the code is using older versions of the SPIN and Scrubber Programs.
You help make this project better by contributing documentation, code, and new ideas.

Current Committers

This project is actively sponsored by Cancer.gov to aid sharing human biospecimens with select diagnostic and treatment criteria.

Status:

2005: Approved for use at four Harvard affiliated teaching hospitals
2006: Initial open source release for Pathology Diagnoses (Linux.com article)
2007: Completely rewritten API to improve performance, reproducibility, and hospital-specific customizations.
2008: Extended to support scrubbing other kinds of notes such as patient discharge summaries.
2009: Approved for use at two large HMO sites.
2010: Machine Learning work begins using millions of peer-reviewed publications to train "ham" (medical concepts) from "spam" (patient identifiers).

2011 Roadmap

  • Currently statistical evaluation of the scrubber performance is underway for upcoming publications.
  • Active development on De-ID improvements using corpus data.
  • Active development on new Concept Extraction module for Scrubber.