Biomedicine and Big Data Analyzing spatio-temporal patterns in biomedical data Normal Stiff Wavy

Embed Size (px)

DESCRIPTION

Our Mission High-throughput biomedical data analysis

Citation preview

Biomedicine and Big Data Analyzing spatio-temporal patterns in biomedical data Normal Stiff Wavy My Research Group Dr. Chakra Chennubhotla Ph.D. Computer Science University of Toronto Andrej Savol B.S. Applied Mathematics University of Pittsburgh Virginia Burger M.S. Mathematics University of Vienna Shannon Quinn B.S. Computer Science Georgia Tech Our Mission High-throughput biomedical data analysis Problem and Solution Biomedical and biological data are BIG MapReduce! C0C1C2C3 M0M1M2M3 IO0IO1IO2IO3 R0R1 FO0FO1 chunks mappers Reducers Map Phase Reduce Phase Shuffling Data Specifically Clustering ! Requirements Java Apache Hadoop or Amazon EC2 Apache Mahout Comfortable with linear algebra Ax = b X = U U T Hive, HBase, Giraph, GraphLab, etc optional but awesome Final Thoughts Distributed computing Open source development Programming at scale Large project management Software engineering principles, tools Biomedical context Biological data is huge Diagnostics: helping people Questions? Comments? Interested? || spq