Information for Research Investigators & Staff Scientists
>> >> Pathology Specimen Locator Core Facility at Dana-Farber Harvard Cancer Center <<
Development Status
History
The PSL has been operational since 2006. Many developers in the NCI funded SPIN program contributed to this work.
Current (2011)
Renewed interested in searching De-Identified Medical Text and Cancer Biospecimens motivated us to create this community portal for PSL.
There are 3 paid programmers working on various aspects of PSL.
Info |
---|
Much of the code is using older versions of the SPIN and Scrubber Programs. |
Current Committers
- Biospecimen search at Harvard : Larry Chung
- Open Source development: Andrew McMurry
- De-Identification : Britt Fitch
This project is actively sponsored by Cancer.gov to aid sharing human biospecimens with select diagnostic and treatment criteria.
Status:
2005: Approved for use at four Harvard affiliated teaching hospitals
2006: Initial open source release for Pathology Diagnoses (Linux.com article)
2007: Completely rewritten API to improve performance, reproducibility, and hospital-specific customizations.
2008: Extended to support scrubbing other kinds of notes such as patient discharge summaries.
2009: Approved for use at two large HMO sites.
2010: Machine Learning work begins using millions of peer-reviewed publications to train "ham" (medical concepts) from "spam" (patient identifiers).
2011 Roadmap
- Currently statistical evaluation of the scrubber performance is underway for upcoming publications.
- Active development on De-ID improvements using corpus data.
- Active development on new Concept Extraction module for Scrubber.