10
A Semantic Web Platform for Genomics Experiments Emily Merrill 1 , Stéphane Corlosquet 1 , Paolo Ciccarese 1,2 , Tim Clark 1,2,3 & Sudeshna Das 1,2 1 Massachusetts General Hospital 2 Harvard Medical School 3 University of Manchester †Authors contributed equally

exFrame: a Semantic Web Platform for Genomics Experiments

  • Upload
    twclark

  • View
    744

  • Download
    3

Embed Size (px)

DESCRIPTION

slides from talk given at Bio-ontologies 2013, Berlin DE, 20 July 2013 Emily Merrill*, Stephane Corlosquet*, Paolo Ciccarese†*, Tim Clark*†‡, Sudeshna Das†* * Massachusetts General Hospital † Harvard Medical School ‡ School of Computer Science, University of Manchester

Citation preview

Page 1: exFrame: a Semantic Web Platform for Genomics Experiments

A Semantic Web Platform for Genomics

ExperimentsEmily Merrill1, Stéphane Corlosquet1, Paolo Ciccarese1,2,

Tim Clark1,2,3 & Sudeshna Das1,2

1Massachusetts General Hospital 2Harvard Medical School

3University of Manchester

†Authors contributed equally

Page 2: exFrame: a Semantic Web Platform for Genomics Experiments

What is eXframe?• reusable framework for creating online data

repositories• upgraded version based on Drupal 7 (drupal.org)• structured annotation of experiments, bio-

materials and assays• publishes Semantic Web data automatically (RDF

& SPARQL endpoint)• first instance of upgraded version - Stem Cell

Commons (http://stemcellcommons.org)

Page 3: exFrame: a Semantic Web Platform for Genomics Experiments
Page 4: exFrame: a Semantic Web Platform for Genomics Experiments

Architecture

• re-factored second generation of eXframe

• updated experimental model mapped to ontologies

• Drupal RDF modules used to generate RDF

• RDF store (SPARQL endpoint) powered by ARC2 PHP library

Page 5: exFrame: a Semantic Web Platform for Genomics Experiments

Data Model• experiment

(obi:investigation); meta-data mapped to Dublin Core (dc)

• researchers & citations mapped to foaf & bibo respectively

• experiments are comprised of bioassays (obo:assay)

• bioassays have replicates (efo:replicate)

• replicates are associated with biomaterials (obo:specimen)

Page 6: exFrame: a Semantic Web Platform for Genomics Experiments

Biomaterials• biomaterials are deeply annotated, key examples:- organism (NCBI Taxonomy)- tissue (FMA: Foundation Model of Anatomy)- cell type (CL: Cell Type ontology)- disease state (DO: Disease ontology)- treatment compound (CheBI: Chemical Entities of

Biological Interest)• )

Page 7: exFrame: a Semantic Web Platform for Genomics Experiments

Sample RDF from Stem Cell Commons

Page 8: exFrame: a Semantic Web Platform for Genomics Experiments

SPARQL queries• flexible query system with

SPARQL

• allows graph queries

• integration with other endpoints

• sample query on right: “find experiments done on mouse, hematopoietic stem cells”.

Page 9: exFrame: a Semantic Web Platform for Genomics Experiments

Security• Stem Cell Commons: selected experiments

are accessible only to researchers from Harvard Stem Cell Institute

• created two stores: - public with limited data- private with all data

Page 10: exFrame: a Semantic Web Platform for Genomics Experiments

Conclusion• groups & institutions can create databases

simply by configuring eXframe• structured repository serves as institutional

memory and facilitates publication• automatic RDF generation & SPARQL

endpoint lowers barrier to Semantic Web adoption