27
The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for Bioinformatics University of Pennsylvania

The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

  • View
    217

  • Download
    0

Embed Size (px)

Citation preview

Page 1: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

The MGED Ontology:A framework for describing

functional genomics experiments

SOFG Nov. 19, 2002

Chris Stoeckert, Ph.D.

Dept. of Genetics & Center for Bioinformatics

University of Pennsylvania

Page 2: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

Nature, October 3, 2002

http://plasmodb.org/David Roos, Jessie Kissinger, Bindu Gajria, Martin Fraunholz, Jules Milgram, Phil

Labo, Amit Bahl, Dave Pearson, Dinesh Gupta, Hagai GinsburgJonathan Crabtree, Jonathan Schug, Brian Brunk, Greg Grant, Trish Whetzel, Matt

Mailman, Li Li

Page 3: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for
Page 4: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

Desirable Microarray Queries

• Return all experiments using developmental stage X.– Sort by platform type– Which are untreated? Treated?

• Treated by what

• How comparable are these?

• What can these experiments tell me?

Page 5: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

Microarray Information to be Shared

Figure from:David J. Duggan et al. (1999) Expression Profiling using cDNA microarrays. Nature Genetics 21: 10-14

Page 6: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

The Computational View of Microarray Information

Need an ontology to unambiguously represent this information.

Page 7: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

An Experimental Ontology

• An ontology for microarray experiments– Not an ontology of life but of experiments – Parts are applicable to describing experiments in

general

• Our approach to interfacing with other ontologies is “experimental”– Not mapping terms from related ontologies– Provide a framework to hang other ontologies off of

• Know where to find different types of annotation• How to interpret that annotation

Page 8: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

http://www.mged.org

Page 9: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

Relationship of MGED Efforts

MAGEMIAMEDB

MIAMEDBExternal

Ontologies/CVs

MGED Ontology

Software and database developers

Investigators annotating experiments

Page 10: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

The MGED Ontology Home Page

http://mged.sourceforge.net/ontologies/

Page 11: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

The MGED Ontology Provides a Listing of Resources for Many Species

Page 12: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

The MGED Ontology Organizes the Resources According to Concepts

Page 13: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

The MGED Ontology is Structured in DAML+OIL using OILed 3.4

Page 14: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

MGED Ontology: BiomaterialDescription: BiosourceProperty: Age

Page 15: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

MGED Ontology: BiosourceOntologyEntry: DiseaseState

Page 16: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

ArrayExpress

MIAMExpress

RADMAGE-ML data exchange

Ontology instances propagated to submission/annotation web forms

Curation of user defined terms, before inclusion in the ontology

User defined terms collected via forms

MGED Ontology

BiomaterialDescription

SexC

C

C

C Gender

documentation: Subclass of sex applicable to heterogametic species (i.e., those in which the sexes produce gametes of markedly different size). Males produce small numerous gametes. Females produce small numbers of large gametes. Hermaphrodites are individuals with both male and female characteristics. Mixed refers to a population of individuals with more than one type of gender.

used in individuals: female, hermaphrodite,male,mixed_sex,unknown_sex

Page 17: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

The MGED Ontology in Action: MIAMExpress

Page 18: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

ElementAnnotation

Analysis

AnalysisImplementationParam

AnalysisInput

AnalysisImplementation1

0..*1

0..*

1 0..*1 0..*

AnalysisInvocationParamAnalysisInvocation1

0..*1

0..*

1

0..*

1

0..*

1 0..*1 0..*

AnalysisOutput

1

0..*

1

0..*

CompositeElementAnnotation

ArrayAnnotation

CompositeElementImp

0..*0..1 0..*0..1

1

0..*

1

0..*

ElementResultImp CompositeElementResultImp

1

0..*

1

0..*

0..10..* 0..10..*

QuantificationParam

RelatedQuantification

Study

StudyDesignDescription

StudyAssay10..* 10..*

StudyDesignAssay

StudyFactorValueAssayLabeledExtract

BioMaterialImp1

0..*

1

0..*

LabelMethod

0..1

0..*

0..1

0..*

ProtocolParam

MAGEDocumentation

MAGE_ML

0..*

1

0..*

1

AcquisitionParam

Assay

1

0..*

1

0..*

1

0..*

1

0..*

1

0..*

1

0..*

1

0..*

1

0..*

Channel

1

0..*

1

0..*

0..*0..1

0..*0..1

Quantification1

0..*

1

0..*1

0..*

1

0..*

10..*

10..*

1 0..*1 0..*1 0..*1 0..*

Acquisition1

0..*

1

0..*

1

0..*

1

0..*

1

0..*

1

0..*

1

0..*

1

0..*

RelatedAcquisition1 0..*1 0..*1 0..*1 0..*

ProcessImplementationParam

ProcessIO

ProcessInvocation

1

0..*

1

0..*

ProcessInvocationParam10..* 10..*

Array

1

0..*

1

0..*

10..*

10..* 1 0..*1 0..*

BioMaterialMeasurement1 0..*1 0..*

Protocol

1

0..*

1

0..*

1

0..*

1

0..*

0..1

0..*

0..1

0..*

0..1

0..*

0..1

0..*Treatment

1

0..*

1

0..*

1

0..*

1

0..*

0..1

0..*

0..1

0..*

StudyDesign

1

0..*

1

0..*10..* 10..*

1 0..*1 0..*

BioMaterialCharacteristic1

0..*1

0..*

ProcessImplementation10..* 10..*

1

0..*

1

0..*

ElementImp

0..10..* 0..10..*

1

0..*

1

0..*

1

0..*

1

0..*

1

0..*

1

0..*

Control

1

0..*

1

0..*

ProcessResult1 0..*1 0..*

StudyFactor

1

0..*

1

0..*

10..* 10..*

OntologyEntry10..* 10..*

0..*0..1

0..*0..1

1

0..*

1

0..*

RAD schema uses MAGE/MIAMEMAGE

ExperimentArray

BioMaterialBioAssay

BioAssayData Protocol, Descr.

HigherLevelAnalysis

MAGEExperiment

ArrayBioMaterial

BioAssayBioAssayData

Protocol, Descr.HigherLevelAnalysis

MIAMEExperimental Design

Array designSamples

Hybridization, MeasureNormalization

.

MIAMEExperimental Design

Array designSamples

Hybridization, MeasureNormalization

.

Page 19: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

The MGED Ontology in Action: RAD

• Add screen shot of study factor

Page 20: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

RAD Generic Form for BioMaterial Characteristics

Page 21: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

RAD Project-Specific Form - PlasmoDB

Page 22: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

Acquiring New Terms

Add term from SRes

Add term from an External Database

OR

Page 23: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

Generating Forms from the MGED Ontology

OntologyEntry

ExternalDatabases

PHP/SQL WWW

RAD Forms

MGED OntologyAnatomy

DevelopmentalStageDiseaseLineage

PATOAttributePhenotype

Taxon

SRES

RAD3

MGED Ontology

Page 24: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

RAD is now part of GUS-3.0 GUS has 5 name spaces compartmentalizing different

types of information.

Namespace Domain Features

Core Data Provenance Workflows

Sres Shared resources Ontologies

DoTSsequence and

annotationCentral dogma

RAD Gene expresssion MIAME/MAGE

TESS Gene regulation Grammars

Page 25: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

GUS Supports Multiple ProjectsAllGenesAllGenes PlasmoDBPlasmoDB

EPConDBEPConDB

CoreSRESTESSRADDoTS

Oracle RDBMS Object Layer for Data Loading

Java ServletsOther sites,Other projects,e.g. GeneDB

Other sites,Other projects,e.g. GeneDB

Available at http://www.gusdb.org

Page 26: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

http://www.gusdb.org

Page 27: The MGED Ontology: A framework for describing functional genomics experiments SOFG Nov. 19, 2002 Chris Stoeckert, Ph.D. Dept. of Genetics & Center for

Acknowledgements

• MGED Ontology– Helen Parkinson (EBI)

– Trish Whetzel

– The MGED Ontology Working Group

– MAGE working group

– Angel Pizarro

– Nelson Axelrod

• RAD/GUS– Brian Brunk– Jonathan Crabtree– Steve Fischer– Yongchang Gan– Greg Grant – Hongxian He– Li Li– Junmin Liu – Matt Mailman– Elizabetta Manduchi– Joan Mazzarelli– Shannon McWeeney (OHSU) – Debbie Pinney– Angel Pizarro– Jonathan Schug– Trish Whetzel

www.mged.org www.cbil.upenn.edu