Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 5.3

Contents

  1. Overview
  2. Required Software
  3. Definitions
  4. Installation
  5. Running the ETL to extract from your database or file
  6. Scrubbing Protected Health Identifiers from Free Text

Anchor
overview
overview


Overview

The ETL program Extracts patient information from existing systems, Transforms the results into standard HIPAA safe medical vocabularies, and Loads the results into a locally controlled Peer node.

...

Anchor
definitions
definitions


Definitions

ETL:  (1) Extract (2) Transform (3) Load

...

Anchor
required-software
required-software


Required Software:

Java 1.5 or later

MySQL 4.1 or later

Anchor
installation
installation


Installation

(1)  Unzip the ETL zip archive to a directory of your choice

...

# Extractor for the VSL pipeline #

Wiki Markup
                               \[DB\]               Database Connection

Wiki Markup
                               \[XML\]            XML Formatted File

                               [DB]               Database Connection

                               [XML]            XML Formatted File

                               [CUSTOM]    Wiki Markup                               \[CUSTOM\]    Custom

Extractor for the VSL pipeline=DB

The wizard will then ask which type of DB extractor to use:

# Database Extractors #

Wiki Markup
                    \[STANDARD\]    Standard Pathology LIMS

Wiki Markup
                    \[MGH\]                MGH Frozen Tissue Repository

Wiki Markup
                    \[BWH\]                BWH Frozen Tissue Repository

                    [STANDARD]    Standard Pathology LIMS

                    [MGH]                MGH Frozen Tissue Repository

                    [BWH]                BWH Frozen Tissue Repository

                    [CRIMSON]       Crimson Biomaterials Collection Wiki Markup                    \[CRIMSON\]       Crimson Biomaterials Collection Core

Database Extractors=STANDARD

...

Done.

Anchor
run-etl
run-etl


Running the ETL Program

Running ETL on Demand

Syntax:

...

ETLRunner myConfig.xml hours 1 MM-DD-YYYY HH:mm:ss

 

Anchor
scrubber
scrubber

SCRUBBER


The SCRUBBER can be setup to de-identify text "on the fly" before it is loaded into the peer database.
We recommend using the 2.8 scrubber because it has numerous upgrades. For more information, contact us.