14
Miguel Angel Mayer MD PhD MPH Research Programme on Biomedical Informatics (GRIB) Institut de Recerca de l’Hospital del Mar (IMIM) i Universitat Pompeu Fabra (IMIM-UPF) Coordinator ICT Working Group of CAMFiC Big Data: Big Opportunity? Club Salud y Farma ESADE Alumni Club Digital Business & ICT ESADE Alumni 22nd April 2015, Barcelona

Big Data: Big Opportunity?

Embed Size (px)

Citation preview

Page 1: Big Data: Big Opportunity?

Miguel Angel Mayer MD PhD MPH Research Programme on Biomedical Informatics (GRIB) Institut de Recerca de l’Hospital del Mar (IMIM) i Universitat Pompeu Fabra (IMIM-UPF) Coordinator ICT Working Group of CAMFiC

Big Data: Big Opportunity?

Club Salud y Farma ESADE Alumni Club Digital Business & ICT ESADE Alumni 22nd April 2015, Barcelona

Page 2: Big Data: Big Opportunity?

BIG… WHAT?

Image from InformationWeek

Page 3: Big Data: Big Opportunity?

What are the DM2 patient profile in terms of age, comorbidities, treatment received, genetics and environmental data in different areas?

Surveillance of adverse events in Social Media?

What are the best preventive activities and treatment for a particular rare familial disorder?

Page 4: Big Data: Big Opportunity?

“Big Data are data whose scale, diversity and complexity

require new architecture, techniques, algorithms, and analytics to manage it

and extract value and hidden knowledge from it”

IMIA working group on “Data Mining and Big Data Analytics” From R. Bellazi, IMIA Yearbook of Medical Informatics 2014

Image from forbes.com

Page 5: Big Data: Big Opportunity?
Page 6: Big Data: Big Opportunity?

Integrative Bioinformatics

Biomedical Literature

Drugs & Other Chemicals

‘Omics & Systems Biology

Biomedical Imaging

Integration of heterogeneous biomedical information

Modified from Ferran Sanz - GRIB (IMIM-UPF)

to gain a more complete and powerful view on diseases and therapeutics

Social Media

Clinical Data

Page 7: Big Data: Big Opportunity?

Integrative Bioinformatics

Clinical Data

Biomedical Literature

Integration of heterogeneous biomedical information

23 million scientific papers referenced in PubMed®, and more than 700,000 are added each year

40+ million of European clinical records will be reused for research in the EMIF project (www.emif.eu)

Biomedical Imaging

‘Omics & Systems Biology

The genome of a person contains > 3,000 M base pairs {G,A,T,C}

Drugs & Other Chemicals

ChEMBL: > 10K targets; >1.4M compounds; >12.8M activities

Estimated biomedical imaging worldwide in 2020: 3.5·1022 bytes S. Sarcar. GE Healthcare. http://es.slideshare.net/sarcar/data-explosion-in-medical-imaging

Modified from Ferran Sanz - GRIB (IMIM-UPF)

to gain a more complete and powerful view on diseases and therapeutics

Social Media

Page 8: Big Data: Big Opportunity?

Social Media

Groups of healthy eating on Facebook: content and features Leis A, Mayer MA et al. Gaceta Sanitaria 2013

Exploring Brand-Name Drug Mentions on Twitter for Pharmacovigilance Carbonell P, Mayer MA, Bravo A. Proceedings MIE 2015 (in press)

Page 9: Big Data: Big Opportunity?

GRIB participating in different IMI projects

• Exploitation of millions of electronic patient records for contributing to the advancement of biomedical research.

• Budget: 56.4 M€ (24.1 M€ of industrial contribution).

• Duration: January 2013 – December 2017. • Partners: 9 pharma companies, 7 SMEs, 36 academic

institutions and 3 patients organizations.

• Information sharing and integration for the development of advanced predictive models of drug toxicity.

• Budget: 18.7 M€ (10 M€ of industrial contribution). • Duration: January 2010 – December 2016 (5 years plus 2

years extension). • Partners: 13 pharma companies, 6 SMEs and 11 academic

institutions. • GRIB: academic coordinator.

Page 10: Big Data: Big Opportunity?

CTD human

UniProt

GAD

MGD

RDG

Curated Predicted Literature

LHGDN

BeFree

CTD mouse & rat

• A comprehensive resource on gene-disease associations • Integrates information from publicly available databases and

the literature (text mining)

http://ibi.imim.es/DisGeNET

Page 11: Big Data: Big Opportunity?

Challenges and Critical Issues

To implement specific extraction software and controlled processing of data by software environments and the use of novel analytical methods

To overcome the risk related to evolving ethical and legal regulations around the world

Databases ownership and full control over the data To manage

different languages and coding systems and versions such as ICD9-CM, ICD10, ICPC, READ, etc.

The bigger the data, the bigger the likelihood we will interpret it wrong

To assure that all data sources share a common understanding of the required data

Data anonymisation techniques is critical

Page 12: Big Data: Big Opportunity?

Final thoughts

Image: The Blue Marble, NASA

Page 13: Big Data: Big Opportunity?

Massive Open Online Medicine resources (MOOMs) Eric J. Topol. Nature Reviews Genetics Vol. 16, May 2015

Big Data (science-based approach) …is better data

…the reuse of data

…for helping people

Page 14: Big Data: Big Opportunity?

Contact: Miguel Angel Mayer @mmayerp [email protected] Research Programme on Biomedical Informatics (GRIB) IMIM-UPF http://grib.upf.edu