Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

OVERVIEW

This database was created to store hundreds of thousands of human and machine annotations.
In this context "human" annotations refer to annotations created by an expert reviewer using a program such as protege, whereas "machine" annotations are automatically labeled. Medical dictionaries and journal publications are parsed and stored in this database.

LIST OF TABLES

Table Name

Used for Scrubbing

Used for Publication Analysis

Feature_matrix_test

YES

 

Feature_matrix_train

YES

 

Human_annotations_test

YES

 

Human_annotations_train

YES

 

Machine_annotations_test

YES

 

Machine_annotations_train

YES

 

Lookup_dictionary

YES

 

Lookup_term_frequency

YES

 

Lookup_umls

YES

 

Pubs_authors

 

YES

Pubs_keywords

 

YES

Pubs

 

YES

Pubs_refs

 

YES

TABLE DESCRIPTIONS

Feature_matrix_*

Stores feature matrix that is built from the Machine_annotations_* and Human_annotations_*  tables. This is the rolled up feature set used for classification.

...