Upload
bioexcel
View
22
Download
0
Embed Size (px)
Citation preview
bioexcel.eu
Partners Funding
How to choose compute resources for your team
Presenters: Lee LarcombeHost: Adam Carter
BioExcel Webinar Series
8 March, 201714:00 GMT / 15:00 CET
bioexcel.eu
Thiswebinarisbeingrecorded
bioexcel.eu
BioExcel Overview• Excellence in Biomolecular Software
- Improve the performance, efficiency and scalability of key codes
• Excellence in Usability- Devise efficient workflow environments
with associated data integration
• Excellence in Consultancy and Training- Promote best practices and train end users
DMI Monitor
DMI Enactor
DMI Executor
DMI Enactor
Data Delivery Point
Data Source
Monitoring flow
Data flow
Service Invocation
DMI Optimiser
DMI Planner
DMIValidator
DMI Gateway
DMI Gateway
DMI Gateway
DMI Enactor
Portal / Workbench
DMI Request
DADC Engineer
DMI Expert
Repository
Registry
DMI Expert
Domain Expert
bioexcel.eu
Interest Groups
• Integrative Modeling IG• Free Energy Calculations IG• Hybrid methods for biomolecular systems IG• Biomolecular simulations entry level users IG• Practical applications for industry IG• Training IG• Workflows IG
Support platformshttp://bioexcel.eu/contact
Forums Code Repositories Chat channel Video Channel
bioexcel.eu
Audience Q&A session
Please use the Questionsfunction in GoToWebinar
application
Any other questions or points to discuss after the live
webinar? Join the discussion the discussion at
http://ask.bioexcel.eu.
bioexcel.eu
Today’s PresenterLee Larcombe began his scientific career in the lab, with an undergrad degree in Genetics from QMUL and later, a PhD studying Chlamydia trachomatis at Cranfield University from where he worked himself up to the post of Lecturer in Genetics and Computational biology. Here he was the course director of the University's MSc in Applied Bioinformatics. He subsequently made the move to industry: first as bioinformatics lead for Lonza Biologics, and then most recently as an independent consultant.
His research has focused mainly on functional genomics, and data integration for the study of oncology and the discovery of novel biomarkers and targets.
He’s currently ELIXIR-UK's Training Coordinator for Research Science based at the MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh with Prof. Chris Ponting.
6
CHOOSING COMPUTE RESOURCES FOR YOUR TEAMDR LEE LARCOMBE
8
Resources – first thoughts
9
• Before thinking about the technical resources, best to consider the people
• Who do you have – who do you need – is recruitment or collaboration the best way forward
• Most important …
Working with IT• You will probably decide you need to buy a computer (or
maybe re-purpose one)• Talking to your IT department may not be the best way to
do this (although you might not have an option)
• Do not:
10
Expect them to understand your requirements
Expect them to install your software
Offer end-user support
Treat this as scientific equipment!• Computers often have some special purchasing
arrangements – that might not suit you
• Departing from this normal route can be a fight – both with the IT dept and Accounting dept!
• You need to make people understand that this is not just a computer – it is a piece of scientific equipment
11
Data transfer & storage
12
1TB over 100Mbps = 22 hours
Many tasks do not need huge storage – although you should have plenty for backups and long-term archiving
Molecular dynamics simulations and genomic analysis can involved large files
You might need to look into options like RAID
You need to be mindful of your networking!
Platform considerations
13
A basic laptop/desktop
14
What is possible?• Quite a lot!
• Everything web-based. Eg EBI resources – integrated
data and huge amounts of information
• Basic coding – PERL, Python, R etc
• A window to another more powerful system!
15
A workstation
16
What is possible?• Everything we had for the smaller machine, plus…• More & faster – analysis that can use multiple CPUs• Simulation – particularly molecular simulation• Some genome analysis – differential expression/RNAseq,
microarrays, limited metagenomics, re-sequencing, perhaps some prokaryotic de-novo assembly
• Data visualisation – molecular modelling, image analysis (microscopy etc)
17
Something bigger
18
Question why you think you need this?
19
What is really the bottleneck to your research? ie.
• Simple job but lots of data
• Complex job but simple data
• Simple job – but many repetitions
• Is time the key factor? Is this need or convenience?
• (sometimes getting more resource can be slower than getting monkeys to do the job on typewriters)
Accessing local HPC resource
20
You might have local HPC…
Have you checked?
There is a significant chance though that it is not running the software you want!
Accessing alternative resource
21
The key to this challenge is not computing – it’s people
22
Understanding computers and your computational need is important – so you can communicate that need
Most of the battle of acquiring more compute resource is working with those who can support you in getting/using it
Making the most of the resources you have or acquire means having a team who know how to use it
Questions?
23
bioexcel.eu
Audience Q&A session
Please use the Questionsfunction in GoToWebinar
application
Any other questions or points to discuss after the live
webinar? Join the discussion the discussion at
http://ask.bioexcel.eu.