Biomedical Informatics Hub

  • Upload
    jaron

  • View
    67

  • Download
    0

Embed Size (px)

DESCRIPTION

Biomedical Informatics Hub. www.exeter.ac.uk/biomedicalhub. Wellcome Trust ISSF Biomedical Informatics Hub. Dr Konrad Paszkiewicz. Contents. Wellcome Trust ISSF Background Seed Corn Fund Aims Biomedical Informatics Hub Purpose Expertise Resources Training Seminars - PowerPoint PPT Presentation

Citation preview

Wellcome Trust ISSF Biomedical Informatics Hub

www.exeter.ac.uk/biomedicalhubBiomedical Informatics Hub

Wellcome Trust ISSF Biomedical Informatics HubDr Konrad PaszkiewiczContentsWellcome Trust ISSFBackgroundSeed Corn FundAimsBiomedical Informatics HubPurposeExpertiseResourcesTrainingSeminarsProject booking/PrioritisationSustainability

Institutional Strategic Support FundA 1.5 million Wellcome Trust award.. .....match funded by the University

to enhance institutional strategies for the biomedical sciences to support scientific progress, translation and interdisciplinary collaboration to encourage greater efficiency, effectiveness and accountability in the stewardship of Wellcome Trust funding

University of Exeter ISSF projectCreate a virtual Biomedical Informatics Hubto maximise output from emerging technologies and large datasetsEstablish an ISSF Seed Corn Fund to support highly-talented researchers to generate preliminary data in support of independent fellowship and grant applications8 projects supported in 2012University match-funding = new academic posts that will utilise the Hub to enable major advances in interdisciplinary biomedical research

Seed Corn FundRound 3: Who?outstanding postdoctoral researchers - to generate preliminary data to support independent Fellowship applications early career academics - to generate preliminary data in support of research grant applicationsmid-career and senior academics seeking pump priming support for new activity that will lead to a Wellcome Trust application (first time applicants to the Trust particularly encouraged) Round 3: What?Must be in the Wellcome Trusts remit5 to 7 projects (up to 12 months)Up to 20k (30k for senior researchers)Closing date = 1st Feb 2013Assessment criteriatrack recordproposed projectoutputs Contact Allison McCaig ([email protected])

Biomedical Informatics HubPurpose1. Supply informatics expertise in bioinformatics, mass-spectrometry, imaging and statistics2. Train researchers through existing and newly developed programs to make best use of their data3. Supply PhD students, postdocs and junior researchers with the tools and expertise to produce 4* research 4. Provide leadership, governance and coordination to leverage and extend Exeter's IT research infrastructure5. Support researchers with access to centralised computational infrastructure and equipment6. Act as a central point of contact for initial research ideas

ExpertiseBioinformaticsMedical - Marcus Tuke, Anna Murray & Mike Weedon (Prof. Tim Frayling)Bioscience - Dr Christine Sambles (Sequencing Service)Metabolomics/ProteomicsDr Venura Perera (Dr Hannah Florance)Image analysisDr Jeremy Metz (starting November) (Prof. Rob Beardmore)Tomography and EMTBC (Prof. Gero Steinberg)StatisticsTBC (Prof. Tim Frayling)Computational modelling/sensor-based networksDr Mahmood Javaid (Prof. Ed Watkins)ResourcesSequencing Service (Konrad Paszkiewicz, Karen Moore)Illumina HiSeq 2000 Mass-spectrometry Service (Hannah Florance)Agilent QQQAgilent QTOFMetabolomics/ProteomicsImaging (Martin Schuster, Gero Steinberg)Confocal, EM Tomography, SEM, TEMComputational ResourcesZeus cluster

ResourcesZeus Systems Biology cluster

19 nodes with 144 coresNodes have between 24Gb and 192Gb RAM150TB of storageDesigned for serial jobs, not MPI/OpenMP jobsUpcoming 150k upgrade

Contact Konrad for access

Hub Training CoursesTraining courses currently offered: Amazon EC2 cloud computingUnix & PerlShort read genomicsPhylogeneticsUse of R statistical packageRNA-seqNew courses in 2013

6. SeminarsMonthly seminars/workshop meetingsScope: bioinformatics, statistics, image analysis, programming

Submitting requestsYou can approach Hub members informally for initial discussions about projectsIf youre unsure email me

Submitting requestsBUT before they undertake any work for you:You must apply via bioinformatics-hub.ex.ac.ukAwait approval by the head of discipline (no more than a few days)You need to specify up-front outcomes in terms of grant applications and/or publications and when you expect to achieve theseAllison McCaig will review outputs. Users who use Hub time and do not publish or apply for grants outcomes will be penalised in future hub applications.

Some screenshots.....Project proposal submission

bio-informaticsHub.ex.ac.ukProject proposal submission

bio-informaticsHub.ex.ac.ukProject proposal tracking

bio-informaticsHub.ex.ac.ukPrioritisationWellcome Trust remit projects have priority to achieve extraordinary improvements in human and animal health....... we support the brightest minds in biomedical research and the medical humanities.See Research Challenges at www.wellcome.ac.uk

Seed Corn Fund projectsExisting Wellcome Trust funded projectsWellcome Trust remit researchOther research

SustainabilityFunding via the Wellcome Trust ISSF for up to three years (annual approval)To continue the Hub, we need to ensure members are costed into grant applications nowAny projects which use Hub expertise and subsequently apply for a grant need to cost relevant members Hub team presentationsVen PereraWho Am IDr Venura PereraLocated Room: M01 Mezzanine FloorBiosciencesCollege of Life and Environmental SciencesGeoffrey PopeBackground:PhD in Systems biology/BioinformaticsUsing mathematical methods for analysis of untargeted mass profiling data sets.Applied mathematicsUse of numerical methods for forecasting and predictive modeling of non-linear systemsData drive techniques for parameter optimizationOrdinary and partial differential equations for system modeling

My skillsApplied MathematicsUse of Numerical models for data modelingDevelopment of dynamic models using numerical methodsStatistical Modeling using non-parametric methodsLarge Data Sets AnalysisFocus on Untargeted Metabolite dataNew methods to solve any question Metabolite Pipeline ExSpecDevelopment of metabolite centers untargeted pipelineProgrammingJava Data ModelingR Mainly for data visualizationMatlab Data Modeling and VisualizationHTML, PHP, MySQL and basic Java Script

Further the use of MS techniquesNumerical methods for data modellingUsing probabilistic methods for data driven coupled systemsDevelopment of techniques for the MS fingerprintingUsing MS methods for polygenetic tree reconstructionMy research interests

Targeted / Quantitative Analysis

Multiple Reaction MonitoringStatistical analysis, Modelling and ClusteringSampleComparisonQuantification

Exploratory Non-targeted AnalysisExtract DataMolecular Feature Extraction (MFE) and data pre-processingMy WorkSmall Molecule analysis: Untargeted profiling and Targeted quantificationData Reduction extracted data loaded into the pipeline which Ven will explain29Targeted / Quantitative Analysis

Multiple Reaction MonitoringStatistical analysis, Modelling and ClusteringSampleComparisonQuantification

Exploratory Non-targeted AnalysisExtract DataMolecular Feature Extraction (MFE) and data pre-processing

Example ProjectTwo groups of patient types : HealthyCancerThree patientsProject Aim: Using a variety of techniques both targeted analysis as well as untargeted profiling to determine key components which are effected by the cancer

Example Project

Example ProjectPatient 24 had a large amount of inflammation causing the control and cancer tissue to share a great deal of features similarities.33http://biosciences.exeter.ac.uk/facilities/spectrometry/http://bio-massspeclocal.ex.ac.uk/

Nick Smirnoff (Director of Mass Spectrometry) [email protected] Florance (MS Facility Manager) [email protected] Perera (Bioinformatics and Mathematical Support) [email protected]

Numerical Dynamic ModelingReaction scheme:

AK: known substrate of B (can be removed if un-mappedAU: Unknown substrate of BB : Substrate to modelE : Enzyme(s) controlling the reactionC : Known product of BCU: Unknown product of BBU : Unknown substrate of CAkAUBCUCEBUExample of Substrate-Product combo

Substrate-Product combo: Switch profiles show delayed affect indicating possible reaction schemeProfiles visually illustrate similarityAkAUBCUCEBUAnna Murray & Mike Weedon

Mike Weedon, PhDLecturer in Bioinformatics and Statistics, Exeter Medical School, St Lukes [email protected] in Biochemistry and Molecular BiologyPhD in molecular genetics of type 2 diabetes, Peninsula Medical SchoolPostdoc on genetics of complex traits, Peninsula Medical SchoolCurrent research interests genome-wide investigations of monogenic and polygenic traits Anna Murray, PhDSenior Lecturer in Human Genetics, Exeter Medical School, St Lukes [email protected] in BiologyPhD in molecular biology of T cell receptor rearrangement in coeliac disease in SouthamptonPostdoc on population genetics of Fragile X syndrome in Salisbury/SouthamptonCurrent research interests Genetics of female reproductive lifespan

Whole exome/genome sequencingENCODE and other projects to annotate non-coding genome

Genome-wide SNP and expression array studiesBiological pathway analysis

Statistical analysis of large datasetsMarcus Tuke

BackgroundComputer Science Graduate University of Exeter Medical SchoolSupercomputer systems administrator8 months working on projects at the Wellcome Trust Centre for Human GeneticsMSc Bioinformatics

42Expertise I can offer

Human Next Generation Sequence data processing and analysisHuman RNA-Seq processing and analysisComputational expertise4343Expertise I can offer

Human RNA-Seq processing and analysisHuman Next Generation Sequence data processing and analysisComputational expertise4444

Computational expertise

Help with projects that could benefit from expertise in:

Programming/scripting

Database/web development

Linux command line and cluster computing

4545Expertise I can offer

Human RNA-Seq processing and analysisComputational expertiseHuman Next Generation Sequence data processing and analysis4646Human Next Generation Sequence data processing and analysis

Align Next Generation Sequencing reads to reference genome

Pipelines to process and call variation in aligned genome including:

Pipelines to filter false positives and QC analysis

Reference GenomegccgggcDeletionSNP g>c heterozygoteRealign reads mis-aligned in genome due to indels 4747Expertise I can offer

Computational expertiseHuman Next Generation Sequence data processing and analysisHuman RNA-Seq processing and analysis4848Human RNA-Seq processing and analysis

Align RNA-Seq reads to Human Genome Transcriptome assemblyDifferential expression analysis

4949Research Interests

Understanding the genetic basis of human diseases and traits

Genome Wide Association Studies (GWAS) have identified numerous genomic regions associated with several major diseases

However, these studies have focussed mostly on common single nucleotide genetic variants

How much more disease-associated genetic variation can we discover from 'higher resolution' technologies such as next generation sequencing (NGS)?

- Rarer- Structural- Non-autosomal- Sub-groups

5050Whole-genome sequencing in the Inchianti StudyOngoing projects

A longitudinal study of aging from the Chianti region of Tuscany, Italy1453 individuals have been followed up over 4 waves since 1998Extensive phenotypic and biomedical information collectedHundreds of circulating biomarkers (e.g. Interleukins, sex hormones, vitamins) measuredRNA expression profiling in lymphocytes (Illumina 46K array)Methylation profiling (450K array)Already had the Illumina 550K SNP Chip genotypedWe are performing low-pass (median 7X) whole genome sequencing on 680 of these individuals

http://www.inchiantistudy.netRNA-Seq whole-transcriptome assembly and analysisAssemble transcriptomes for 3 primary human microvascular endothelial cells 2 treated with cathepsin L and D respectively, and 1 controlAnalyse whether there are any differences in expression of RNA between two treatments and control

5151Christine SamblesChristine's preziJeremy MetzJeremy Metz, PhDPhD: Quantum computing - theory & simulations Imperial College LondonPostdoc:Biological image analysis & cellular modelling Einstein College of Medicine, NYResearch Interests Image processing and computer vision applied to biological systemsObject segmentation Tracking in range of dynamical scenariosQuantitative analysis

Modelling biological systems - analytical and numerical approachesMonte-carlo simulationsIntegration of data and insight from multiple scales of inquiry, e.g. AFM, single cell analysis, high throughput screeningSkills and ExpertisePython, C++, Java, MatlabImage processingMatlab - image processing toolkitPython - Scipy and OpenCV bindingsJava - ImageJ pluginsSimulationC/C++ for fast low-level routinesPython as "glue" code and visualizationLinux, shell scripting, Oracle Grid Engine Past Projects - Image processingObject tracking: Developed novel cross-correlation and Bayesian state estimation based object tracker

Object segmentation: (In progress) Object detection and segmentation based on Scale-space representation formalismSimulationBiological system: Kinesin diffusion-reaction during mitosis, spindle photo -bleached at t=0. Model using minimalist feature cell:

Check for presence of competition for binding sites between diffusing species

Quantum system: Noisy (open) atom-cavity system, laser illuminationHow can we work together?

With my expertise in the fields of biological data/image analysis, and mathematical & computational modeling, how can we combine our skills to produce an outstanding research project? ExperimentsExtract dataModel, Simulation TheoryMahmood JavaidArea of expertise:Computational ModellingSensor-based Networks and Embedded SystemsHigh Performance ComputingSoftware Development

Mahmood Javaid PhD Computer scienceExperimental Officer- Research ComputingWellcome Trust Biomedical Informatics HubComputational ModellingAgent-based modelling using Agent-based simulation frameworks such as FLAME and SWARMBenefits:Close association between the model entities and the real-world agentsHeterogeneous agents within an environmentInteraction between the agents through message passingPotential to uncover emergent behavioursPossibility of multi-scale modelling

Some examplesProspecting asteroid belt

Hypothesis testing

Economics and Systems biological applications of ABMEurace project (Dept. of Computer Science, University of Sheffield.)

System Understanding of Microbial Oxygen Responses (SUMO) Sensor-based networks, embedded systemsInvolving proximity sensors such as Ultrasonic, Infrared, and Laser-based.Inertial management units such Accelerometer, Gyroscope, and Magnetometer.Interfacing between sensors and computation units using interconnects such as Serial ports, Bluetooth, and I2C.PIC and AVR microcontroller based systems Sensory augmentation deviceIn order to move around in the world safely and quickly most of us are highly reliant on our visual sense. When vision is compromised, the problem of safely finding our way becomes much more difficult.Possibility of augmenting our existing senses with a form of remote touch generated by artificial distance sensors and tactile stimulus.

Movement is critical to how we use our tactile sense.We explore objects through touch by controlling the way that we move our sensory surfaces over themstroking with the fingertips to investigate texture, for instance, or palpating to investigate shape.

Active TouchInspired from biological and biomimetics

Physical Components

First Prototype

High Performance ComputingAgent-based models running on Linux HPC clustersAdministration of Linux HPC clusterPublishing legacy applications running on Linux clusters using web-servicesSoftware DevelopmentDesktop applications for Linux and Windows platforms including MatlabWeb-based applications and publishing legacy systems using web-servicesMobile Operating System applications

Online Depression and Mood Disorder Screener

www.exeter.ac.uk/biomedicalhub

Thanks for listening.....