20
Chemical Interoperability Egon Willighagen BiGCaT – Dept of Bioinformatics 2012-02-17, Utrecht

Chemical Interoperability - Utrecht / NBIC

Embed Size (px)

DESCRIPTION

Presentation given at a parallel NBIC meeting in Utrecht at 17 Feb 2012. This presentation is not Open; figures are available from cited papers.

Citation preview

Page 1: Chemical Interoperability - Utrecht / NBIC

Chemical Interoperability

Egon Willighagen

BiGCaT – Dept of Bioinformatics

2012-02-17, Utrecht

Page 2: Chemical Interoperability - Utrecht / NBIC

Interoperability in Chemistry

• = Communication, meaning

• Why?–Reproducibility–Understanding molecular properties

• Interoperability Needs–Open Data, Open Standards, Open Source

Page 3: Chemical Interoperability - Utrecht / NBIC

What's that Open fetish?

• Gives–Collaboration. Credit. Ownership.–Sustainability.

• Trough rights–Redistribution (under same rights)–Modification (not with Open Standards)

Page 4: Chemical Interoperability - Utrecht / NBIC

The Blue Obelisk Movement

R. Guha et al. J. Chem. Inf. Mod. 2006 N. O'Boyle et al. J. Cheminf. 2011

Page 5: Chemical Interoperability - Utrecht / NBIC

1. Open Standards

• Allows:–Software to talk to each other

• For:–Well-defined formats–Concepts (aromaticity!)–Practices (licenses, peer review, ...)

Page 6: Chemical Interoperability - Utrecht / NBIC

CML: Exchange of Chemical Structures

P. Murray-Rust et al, J. Chem. Inf. Comput. Sci., 1999E.L. Willighagen, Internet J. Chem., 2000

Page 7: Chemical Interoperability - Utrecht / NBIC

CML in RSS news feeds

P. Murray-Rust et al., J. Chem. Inf. Comput. Sci, 2004

Page 8: Chemical Interoperability - Utrecht / NBIC

Resource Description Framework

• Family of W3C technologies–RDF / RDFS: basic framework–Serialization formats

• RDF/XML, Turtle, RDFa, RDF/JSON

–OWL: ontologies (BioPortal, OLS, …)–SPARQL

Page 9: Chemical Interoperability - Utrecht / NBIC

SemanticWeb

Page 10: Chemical Interoperability - Utrecht / NBIC

IUPAC International Chemical Identifier

N. Day. InChI FAQ, http://wwmm.ch.cam.ac.uk/inchifaq/

Page 11: Chemical Interoperability - Utrecht / NBIC

2. Open Data

• What is the average C=N bond length?–Bioinformatics: force fields

• What is the LogP of aspirin?–Drug discovery

• What metabolite is that?–Metabolomics

Page 12: Chemical Interoperability - Utrecht / NBIC

Linked Open Drug Data

M. Samwald et al., J. Cheminf, 2011

Page 13: Chemical Interoperability - Utrecht / NBIC

Blue Obelisk Data Repository

● Which two tools calculate the same molecular weight?

● Isotope abundancies and weights!● IUPAC reports

R. Guha et al. J. Chem. Inf. Mod. 2006

Page 14: Chemical Interoperability - Utrecht / NBIC

3. Open Source

• What is aromaticity?

Page 15: Chemical Interoperability - Utrecht / NBIC

Chemistry Development Kit

• Cheminformatics–Aromaticity– ...

• Many derived tools–Taverna, KNIME, Bioclipse, Cinfony, ...

C. Steinbeck et al. J. Chem. Inf. Mod. 2003C. Steinbeck et al. Curr. Pharm. Des. 2006

Page 16: Chemical Interoperability - Utrecht / NBIC

Scripting ...

O. Spjuth et al. BMC Bioinf. 2009T. Kuhn et al. BMC Bioinf. 2010

Page 17: Chemical Interoperability - Utrecht / NBIC

Predictive Toxicology I

O. Spjuth et al. J. Chem. Inf. Mod. 2011E.L. Willighagen et al. BMC Research Notes. 2011

Page 18: Chemical Interoperability - Utrecht / NBIC

Predictive Toxicology II

E.L. Willighagen et al. J. Biomed Sem. 2011

Page 19: Chemical Interoperability - Utrecht / NBIC

Conclusions

• Interoperability via ODOSOS–Open Data–Open Standards–Open Source

Page 20: Chemical Interoperability - Utrecht / NBIC

Thanx / More info

• Blue Obelisk movement• CDK developers (Miguel!)

• http://scholar.google.com/citations?user=u8SjMZ0AAAAJ

• http://chem-bla-ics.blogspot.com/