48
The Mouse Gene Expression Database (GXD) Martin Ringwald The Jackson Laboratory

Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

Embed Size (px)

DESCRIPTION

The Mouse Gene Expression Database (GXD)

Citation preview

Page 1: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

The Mouse Gene Expression Database (GXD)

Martin Ringwald

The Jackson Laboratory

Page 2: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

Mouse developmental gene expression data provide insights into

• organismal function of genes

• molecular mechanism of differentiation

• molecular basis of disease

Genotype Phenotype Expression

Mouse Strains and Mutants

Of mice and men …..

Page 3: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

• integrates different types of expression data

RNA in situ hybridization Northern blot

Immunohistochemistry Western blot

Knock-in reporter studies RT-PCR

• focus on endogenous gene expression

during mouse development

• all developmental stages

• expression data from wild-type and mutant mice

The Gene Expression Database (GXD)

Gene

RNA

Protein

1…n

1…p

Time Space

Genotype

Page 4: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

Standardized description of expression patterns

Hierarchical structure: • Extensibility • Hierarchical searches • Integrated description of expression patterns from assays with differing spatial resolution

Anatomical Ontology for Mouse Development: developed by Edinburgh Mouse Atlas Project maintained and expanded by EMAP and GXD

Anatomical Ontology for the Adult Mouse: developed and maintained by GXD

Page 5: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

Integrated access to complex and heterogeneous data to facilitate the use of the mouse as an experimental model to study human development and disease.

Integration with all the other data in MGI

Genotype Phenotype Expression

Function!

PubMed

OMIM

GenBank/EMBL/DDBJ

Entrez Gene

UniProt

InterPro

EMAGE

GenePaint

GEO

Array Express

IMSR

Other species DB

Many links to other resources:

Page 6: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

MGI Home Page: www.informatics.jax.org

Page 7: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

GXD Home Page

Page 8: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

• Data Acquisition and Current Data Content

• New Search and Display Features

Recent Progress

Page 9: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

• curation of expression data from literature

• electronic submission from laboratories – small and large scale data

• collaboration with projects that generate data at a large scale

Data Acquisition for GXD

Page 10: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

First step of literature curation: Each article is indexed with regard to -  Genes -  Assay types -  Embryonic ages -  Bibliographic information

Page 11: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

as of 6/15/13: 149,941 entries 20,996 references 15,033 genes up-to-date complete from 1993 (1990) to the present

Page 12: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
Page 13: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

Superior to PubMed: • Manual annotation of whole manuscript • Use of standard gene nomenclature • Indexing of assay types and embryonic ages

Page 14: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

Primary Image Data

Example: RT-PCR

Page 15: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

Primary Image Data

Example: Immunohistochemistry

Sections

Page 16: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

Antibody detail

Gene

Specimens

Mutant "alleles

Results

Link to"images

Page 17: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

• Standard nomenclature • Extensive use of controlled vocabularies • Manual and computational consistency checks • Editorial Interface and QC reports • Detailed and regularly updated editorial guidelines

Data Quality Control

Page 18: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

Data Quality Control

• Text-based annotations complemented by primary image data • Annotations are NOT based on our own interpretation of the images. They strictly rely on the statements of the authors. • Resolution of annotations is determined by details provided in the text of the manuscript. • We notify authors once data for their publications have been entered. Authors can provide comments and additional information.

Page 19: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

Gene Expression Data – Result Annotations

Page 20: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

Large-scale Gene Expression Data Sets

Page 21: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

Incorporation of large-scale data sets

• Develop parsers to extract and evaluate data • Manual and computational quality controls - verify gene identity: probe to gene mapping - verify probe identity: probe already in database? - map results to anatomical ontology and other controlled vocabularies - resolve ambiguities - complete annotations • Bring data in standardized format for data loads • Bulk-load curated data in GXD

Page 22: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

GXD adds value to large-scale data sets

from other databases

• data are integrated with all the other data in GXD and MGI • data are accessible via many new search parameters • data and data connections are maintained and kept up-to-date

Page 23: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

GXD: Current Data Content

249,010 Expression Images 1,394,685 Annotated Expression Results 63,374 Expression Assays 13,751 Genes 1,820 Mouse Mutants with Expression Data

Page 24: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

• Gene Expression Data Query Forms

• Expression Data Summaries

• Expression Assay Details

• Images

Improved Search and Display Capabilities

Page 25: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

MGI

Gene Detail Page

Page 26: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
Page 27: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
Page 28: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

Function (GO) Phenotype Disease

Anatomy Dev. Stage Age

Wild-type / mutant

Assay type

New Query Form - Standard Search

Page 29: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

New Query Form - Differential Expression Search

Page 30: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

Function (GO) Phenotype Disease

Anatomy Dev. Stage Age

Wild-type / mutant

Assay type

New Query Form - Standard Search

Page 31: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

1824 genes annotated to DNA binding Expression data are available for this gene set (otherwise ‘DNA binding’ would be greyed out).

Auto-fill function

Page 32: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

DNA binding genes

detected in

diencephalon at TS 17-20

by Immunohistochemistry

Page 33: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

New Summary – Assay Results

• 4 sortable data summaries: genes, assays, assay results, images • links to detailed annotations and images • summary data can be downloaded and exported to other applications

Sort

Page 34: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

New Summary – Assays

Page 35: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

New Summary – Genes

Page 36: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

New Summary – Assay Results

• 4 sortable data summaries: genes, assays, assay results, images • links to detailed annotations and images • summary data can be downloaded and exported to other applications

Sort

Page 37: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

45

Previous Assay Details

reference to 1H, 1J; link to Figure 1

all specimen information displayed upfront

reference to 1E, 1F; link to Figure 1

Page 38: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

Links to 3-D mapped images in EMAGE 46

Page 39: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

45

New Assay Details

focus on most important specimen information

images displayed together with result annotations

Page 40: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

New Summary – Images

Search directly for images using many different query criteria

Page 41: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

New Summary – Images

Page 42: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

45

New Assay Details

Page 43: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

• Gene Expression Data Query Forms

- improved layout

- new query capabilities

• Strongly enhanced query performance

• Expression Data Summaries

- more flexible and interactive

- option to download and export data

- image summaries

• Expression Assay Details

- integration of images and annotations

- improved layout - focus on essential data

Improved Search and Display Capabilities

Page 44: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

• MGI Batch Query

• GXD BioMart

New ways to access GXD Data

Page 45: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

• Enter list of gene symbols or IDs and look up associated expression data

• Download data and export data to other applications

Page 46: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

GXD BioMart

Find expression data • for a gene • for a list of genes • for an anatomical structure • for a mutant • for a reference Integrated searches across different BioMarts

Page 47: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

GXD BioMart: Query Results (default view)

Export Data

Link to Images Link to Assay Details

Page 48: Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013

Constance Smith Jacqueline Finger Terry Hayamizu Ingeborg McCright Jingxia Xu David Shaw Joanne Berghout MGI Software Group Jim Kadin Joel Richardson Janan Eppig

Acknowledgements

GXD is supported by NICHD