69
The STRING database Lars Juhl Jensen EMBL Heidelberg

The STRING database

Embed Size (px)

DESCRIPTION

14th International Conference on Intelligent Systems for Molecular Biology, Software demo, Fortaleza Conference Center, Fortaleza, Brazil, August 6-10, 2006

Citation preview

Page 1: The STRING database

The STRING database

Lars Juhl Jensen

EMBL Heidelberg

Page 2: The STRING database

data integration

Page 3: The STRING database
Page 4: The STRING database

functional interactions

Page 5: The STRING database
Page 6: The STRING database

179 proteomes

Page 7: The STRING database

Ensembl

Page 8: The STRING database

SWISS-PROT

Page 9: The STRING database

genomic context methods

Page 10: The STRING database

phylogenetic profiles

Page 11: The STRING database
Page 12: The STRING database
Page 13: The STRING database
Page 14: The STRING database
Page 15: The STRING database

Cell

Cellulosomes

Cellulose

Page 16: The STRING database

gene fusion

Page 17: The STRING database
Page 18: The STRING database

gene neighborhood

Page 19: The STRING database
Page 20: The STRING database

questionable reliability

Page 21: The STRING database

raw quality scores

Page 22: The STRING database

gene neighborhood

Page 23: The STRING database

sum of intergenic distances

Page 24: The STRING database
Page 25: The STRING database

many types of evidence

Page 26: The STRING database

raw quality scores

Page 27: The STRING database

not directly comparable

Page 28: The STRING database

benchmarking

Page 29: The STRING database

calibrate against KEGG

Page 30: The STRING database
Page 31: The STRING database

curated knowledge

Page 32: The STRING database

KEGGKyoto Encyclopedia of Genes and Genomes

Page 33: The STRING database

Reactome

Page 34: The STRING database

MIPSMunich Information center

for Protein Sequences

Page 35: The STRING database

STKESignal Transduction Knowledge Environment

Page 36: The STRING database

primary experimental data

Page 37: The STRING database

many sources

Page 38: The STRING database

parsers

Page 39: The STRING database

co-expression

Page 40: The STRING database

GEOGene Expression Omnibus

Page 41: The STRING database

SMDStanford Microarray Database

Page 42: The STRING database

physical protein interactions

Page 43: The STRING database

BINDBiomolecular Interaction Network Database

Page 44: The STRING database

MINTMolecular Interactions Database

Page 45: The STRING database

GRIDGeneral Repository for Interaction Datasets

Page 46: The STRING database

DIPDatabase of Interacting Proteins

Page 47: The STRING database

HPRDHuman Protein Reference Database

Page 48: The STRING database

literature mining

Page 49: The STRING database

different gene identifiers

Page 50: The STRING database

synonyms lists

Page 51: The STRING database

MEDLINE

Page 52: The STRING database

SGDSaccharomyces Genome Database

Page 53: The STRING database

The Interactive Fly

Page 54: The STRING database

OMIMOnline Mendelian Inheritance in Man

Page 55: The STRING database

co-mentioning

Page 56: The STRING database

NLPNatural Language Processing

Page 57: The STRING database

Gene and protein namesCue words for entity recognitionVerbs for relation extraction

[nxgene The GAL4 gene]

[nxexpr The expression of [nxgene the cytochrome genes [nxpg CYC1 and CYC7]]]is controlled by[nxpg HAP1]

Page 58: The STRING database
Page 59: The STRING database

combine all evidence

Page 60: The STRING database

spread over many species

Page 61: The STRING database

transfer by orthology

Page 62: The STRING database
Page 63: The STRING database

orthologous groups

Page 64: The STRING database
Page 65: The STRING database

fuzzy orthology

Page 66: The STRING database

?

Source species

Target species

Page 67: The STRING database

Bayesian scoring scheme

Page 68: The STRING database
Page 69: The STRING database

Acknowledgments

The STRING team (EMBL)– Christian von Mering

– Berend Snel

– Martijn Huynen

– Sean Hooper

– Samuel Chaffron

– Julien Lagarde

– Mathilde Foglierini

– Peer Bork

Literature mining project(EML Research)– Jasmin Saric

– Rossitza Ouzounova

– Isabel Rojas