Vocabulary

Resource = a record of a "thing" in eagle-i. A resource is an instance of a class defined in the eagle-i ontology (e.g. "My nifty DNA Microarray" is an instance of the class "DNA MicroArray", which is a subclass of "Instrument")

A note of formats

See the MIME Type Section in the eagle-i repo API guide for the different formats that are supported by the SPARQL endpoint (all our queries are SELECT, so probably the best is SPARQL Result Set, which is XML representing a table)

Configuration

Organization Identifier

The organization to which biostats tools are attached in the eagle-i repository is:

http://harvard.eagle-i.net/i/0000012e-5946-2efe-55da-381e80000000

The same URI can be used in the dev environment, by changing the hostname as in the config property below.

Config File
# General
# What general type of resources we are dealing with

baseType=http://purl.obolibrary.org/obo/ERO_0000071

# What organization are the resources "attached" to (all resources in eagle-i need to belong to an organization)

organization=http://harvard.eagle-i.net/i/0000012e-5946-2efe-55da-381e80000000

# What properties to show in the overview record
# Literal properties (i.e. properties whose value is a simple string)
# Website
lp1=http://purl.obolibrary.org/obo/ERO_0000480
# Object properties (i.e. properties whose value is a link to another resource or ontology term)
# Not used for Biostats
# op1=
# op2=

################
# Filters
################
# Filter Group titles - for UI (do not grab these labels from ontology, they're usually not what you need)
fg1="Software Purpose"
fg2="Study Design"
fg3="Algorithm"
fg4="Data Type"
fg5="Measurement Scale"

# Properties for creating filters
# Each property corresponds to a filter group
# Direct filter properties
#has software purpose
fp1=http://purl.obolibrary.org/obo/ERO_0000078
#has related Study Design
fp2=http://eagle-i.org/ont/app/1.0/has_related_study_design
#Algorithm used
fp3=http://www.ebi.ac.uk/efo/swo/SWO_0000740

#Indirect filter properties (the property hangs off an embedded class) - need embedded property and indirect property
#has data input
fp4=http://purl.obolibrary.org/obo/ERO_0000076
#data type
ip4=http://www.w3.org/1999/02/22-rdf-syntax-ns#type
#has data input
fp5=http://purl.obolibrary.org/obo/ERO_0000076
#has measurement scale
ip5=http://eagle-i.org/ont/app/1.0/has_measurement_scale

More advanced version with open-ended lists, potentially using yaml (but need to get the syntax right (wink) , for example:

# What properties to show in the overview record
# Literal properties (i.e. properties whose value is a simple string)
literalProperties:
   propertyUri: http://purl.obolibrary.org/obo/ERO_0000480

# Object properties (i.e. properties whose value is a link to another resource or ontology term)
objectProperties:
  propertyUri: http://purl.obolibrary.org/obo/ERO_0001719
  propertyUri: http://purl.obolibrary.org/obo/ERO_0000078

Base queries

Get base, unfiltered dataset

Find all instances of baseType affiliated with organization, returns a table with a column per query variable defined in property configuration above

Need to split in two queries (one for Ontology labels, one for instances). Note that if desired props don't have preferred label we'll need to alter query 1

Query 1 => to be executed only once (labels don't change)

PREFIX eiapp: <http://eagle-i.org/ont/app/1.0/>
SELECT

?lp1Label
?op1Label
?op2Label

 WHERE {
     OPTIONAL {
          <${lp1}> eiapp:preferredLabel ?lp1Label .
          <${op1}> eiapp:preferredLabel ?op1Label .
          <${op2}> eiapp:preferredLabel ?op2Label .
     }
}

Query 2
Note that if a property is not defined in the config file (e.g. op1), the variable in the SELECT clause, as well as the entire triple pattern/pattern group in the WHERE clause (i.e. the corresponding line) needs to be omitted. So there need to be some conditional logic in the query generation.

PREFIX ei: <http://purl.obolibrary.org/obo/>
PREFIX eiapp: <http://eagle-i.org/ont/app/1.0/>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>

SELECT
?resourceUri
?resourceLabel
?resourceDescription
?lp1Value
?op1ValueLabel
?op1ValueUri
?op2ValueLabel
?op2ValueUri

 WHERE {
     ?resourceUri a <${baseType}> .
     ?resourceUri ei:ERO_0000070 <${organization}> .
     ?resourceUri rdfs:label ?resourceLabel .
     OPTIONAL {
          ?resourceUri eiapp:resource_description ?resourceDescription .
          ?resourceUri <${lp1}> ?lp1Value .
          ?resourceUri <${op1}> ?op1ValueUri . ?op1ValueUri rdfs:label ?op1ValueLabel .
          ?resourceUri <${op2}> ?op2ValueUri . ?op2ValueUri rdfs:label ?op2ValueLabel .
     }
}

Example with substituted values

PREFIX ei: <http://purl.obolibrary.org/obo/>
PREFIX eiapp: <http://eagle-i.org/ont/app/1.0/>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>

SELECT
?resourceUri
?resourceLabel
?resourceDescription
?lp1Label
?lp1Value

 WHERE {
     ?resourceUri a <http://purl.obolibrary.org/obo/ERO_0000071> .
     ?resourceUri ei:ERO_0000070 <http://harvard.dev.eagle-i.net/i/0000012e-5946-2efe-55da-381e80000000> .
     ?resourceUri rdfs:label ?resourceLabel .
     OPTIONAL {
          ?resourceUri eiapp:resource_description ?resourceDescription .
          ?resourceUri <http://purl.obolibrary.org/obo/ERO_0000480> ?lp1Value .
     }
}
Get detailed view of a resource

For the main resource, simply do a GET of its URI; the results can be in a variety of formats - see: eagle-i repo API guide

There is an additional parameter forceXML, which returns a format that may be the most useful (it's the XML from which the HTML version is generated) - pure RDF/XML is horrible

Example:

GET http://harvard.qa.eagle-i.net/i/0000012e-5480-32e6-55da-381e80000000?forceXML

After getting the main resource, the embedded resources need to be processed. Assume main resource has uri http://harvard.eagle-i.net/123

This query will return embedded resources:

PREFIX eiapp: <http://eagle-i.org/ont/app/1.0/>
SELECT
?embeddedResourceUri
WHERE {
<http://harvard.eagle-i.net/123> ?anyProperty ?embeddedResourceUri .
?embeddedResourceUri a ?type .
?type eiapp:inClassGroup eiapp:ClassGroup_EmbeddedResourceType .
}

To obtain the embedded resource's details, iterate over this list and do a GET as with the main resource.

Queries for filters

Get actual annotation values that have been used in the dataset

Do this for each direct filter property, e.g. fp1

PREFIX ei: <http://purl.obolibrary.org/obo/>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>

SELECT
?filterValueUri
?filterValueLabel

WHERE {
     ?resourceUri a <${baseType}> .
     ?resourceUri ei:ERO_0000070 <${organization}> .
     ?resource <${fp1}> ?filterValueUri .
     ?filterValueUri rdfs:label ?filterValueLabel .
}

Do this for each indirect filter property, e.g. ep4 and ip4

PREFIX ei: <http://purl.obolibrary.org/obo/>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>

SELECT
?filterValueUri
?filterValueLabel

WHERE {
?resourceUri a <${baseType}> .
?resourceUri ei:ERO_0000070 <${organization}> .
?resource <${ep4}> ?embeddedResource .
?embeddedResource <${ip4}> ?filterValueUri .
?filterValueUri rdfs:label ?filterValueLabel .
}
Get the filtered dataset

First part is identical to unfiltered query (remember to remove unset config params, e.g. op1), then conditions are added as triple patterns;
Assume filterValueUrix is the filter value selected by the user, for each filter property, e.g. filterValueUri1 for fp1
NOTE if only one filter is selected, add only that triple pattern/triple pattern group.

PREFIX ei: <http://purl.obolibrary.org/obo/>
PREFIX eiapp: <http://eagle-i.org/ont/app/1.0/>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>

SELECT
?resourceUri
?resourceLabel
?resourceDescription
?lp1Value
?op1ValueLabel
?op1ValueUri
?op2ValueLabel
?op2ValueUri

WHERE {
?resourceUri a <${baseType}> .
?resourceUri ei:ERO_0000070 <${organization}> .
?resourceUri rdfs:label ?resourceLabel .
OPTIONAL {
?resourceUri eiapp:resource_description ?resourceDescription .
?resourceUri <${lp1}> ?lp1Value .
?resourceUri <${op1}> ?op1ValueUri . ?op1ValueUri rdfs:label ?op1ValueLabel .
?resourceUri <${op2}> ?op2ValueUri . ?op2ValueUri rdfs:label ?op2ValueLabel .
}

?resourceUri <${fp1}> <${filterValueUri1}> .
?resourceUri <${fp2}> <${filterValueUri2}> .
?resourceUri <${fp3}> <${filterValueUri3}> .

?resourceUri <${ep4}> ?embeddedResource1 . ?embeddedResource1 <${ip4}> <${filterValueUri4}> .
?resourceUri <${ep5}> ?embeddedResource2 . ?embeddedResource2 <${ip5}> <${filterValueUri5}> .
}
  • No labels