24
bioexcel.eu Partners Funding How to choose compute resources for your team Presenters: Lee Larcombe Host: Adam Carter BioExcel Webinar Series 8 March, 2017 14:00 GMT / 15:00 CET

BioExcel Webinar Series #12: "How to choose compute resources for your team"

Embed Size (px)

Citation preview

Page 1: BioExcel Webinar Series #12: "How to choose compute resources for your team"

bioexcel.eu

Partners Funding

How to choose compute resources for your team

Presenters: Lee LarcombeHost: Adam Carter

BioExcel Webinar Series

8 March, 201714:00 GMT / 15:00 CET

Page 2: BioExcel Webinar Series #12: "How to choose compute resources for your team"

bioexcel.eu

Thiswebinarisbeingrecorded

Page 3: BioExcel Webinar Series #12: "How to choose compute resources for your team"

bioexcel.eu

BioExcel Overview• Excellence in Biomolecular Software

- Improve the performance, efficiency and scalability of key codes

• Excellence in Usability- Devise efficient workflow environments

with associated data integration

• Excellence in Consultancy and Training- Promote best practices and train end users

DMI Monitor

DMI Enactor

DMI Executor

DMI Enactor

Data Delivery Point

Data Source

Monitoring flow

Data flow

Service Invocation

DMI Optimiser

DMI Planner

DMIValidator

DMI Gateway

DMI Gateway

DMI Gateway

DMI Enactor

Portal / Workbench

DMI Request

DADC Engineer

DMI Expert

Repository

Registry

DMI Expert

Domain Expert

Page 4: BioExcel Webinar Series #12: "How to choose compute resources for your team"

bioexcel.eu

Interest Groups

• Integrative Modeling IG• Free Energy Calculations IG• Hybrid methods for biomolecular systems IG• Biomolecular simulations entry level users IG• Practical applications for industry IG• Training IG• Workflows IG

Support platformshttp://bioexcel.eu/contact

Forums Code Repositories Chat channel Video Channel

Page 5: BioExcel Webinar Series #12: "How to choose compute resources for your team"

bioexcel.eu

Audience Q&A session

Please use the Questionsfunction in GoToWebinar

application

Any other questions or points to discuss after the live

webinar? Join the discussion the discussion at

http://ask.bioexcel.eu.

Page 6: BioExcel Webinar Series #12: "How to choose compute resources for your team"

bioexcel.eu

Today’s PresenterLee Larcombe began his scientific career in the lab, with an undergrad degree in Genetics from QMUL and later, a PhD studying Chlamydia trachomatis at Cranfield University from where he worked himself up to the post of Lecturer in Genetics and Computational biology. Here he was the course director of the University's MSc in Applied Bioinformatics. He subsequently made the move to industry: first as bioinformatics lead for Lonza Biologics, and then most recently as an independent consultant.

His research has focused mainly on functional genomics, and data integration for the study of oncology and the discovery of novel biomarkers and targets.

He’s currently ELIXIR-UK's Training Coordinator for Research Science based at the MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh with Prof. Chris Ponting.

6

Page 7: BioExcel Webinar Series #12: "How to choose compute resources for your team"

CHOOSING COMPUTE RESOURCES FOR YOUR TEAMDR LEE LARCOMBE

Page 8: BioExcel Webinar Series #12: "How to choose compute resources for your team"

8

Page 9: BioExcel Webinar Series #12: "How to choose compute resources for your team"

Resources – first thoughts

9

• Before thinking about the technical resources, best to consider the people

• Who do you have – who do you need – is recruitment or collaboration the best way forward

• Most important …

Page 10: BioExcel Webinar Series #12: "How to choose compute resources for your team"

Working with IT• You will probably decide you need to buy a computer (or

maybe re-purpose one)• Talking to your IT department may not be the best way to

do this (although you might not have an option)

• Do not:

10

Expect them to understand your requirements

Expect them to install your software

Offer end-user support

Page 11: BioExcel Webinar Series #12: "How to choose compute resources for your team"

Treat this as scientific equipment!• Computers often have some special purchasing

arrangements – that might not suit you

• Departing from this normal route can be a fight – both with the IT dept and Accounting dept!

• You need to make people understand that this is not just a computer – it is a piece of scientific equipment

11

Page 12: BioExcel Webinar Series #12: "How to choose compute resources for your team"

Data transfer & storage

12

1TB over 100Mbps = 22 hours

Many tasks do not need huge storage – although you should have plenty for backups and long-term archiving

Molecular dynamics simulations and genomic analysis can involved large files

You might need to look into options like RAID

You need to be mindful of your networking!

Page 13: BioExcel Webinar Series #12: "How to choose compute resources for your team"

Platform considerations

13

Page 14: BioExcel Webinar Series #12: "How to choose compute resources for your team"

A basic laptop/desktop

14

Page 15: BioExcel Webinar Series #12: "How to choose compute resources for your team"

What is possible?• Quite a lot!

• Everything web-based. Eg EBI resources – integrated

data and huge amounts of information

• Basic coding – PERL, Python, R etc

• A window to another more powerful system!

15

Page 16: BioExcel Webinar Series #12: "How to choose compute resources for your team"

A workstation

16

Page 17: BioExcel Webinar Series #12: "How to choose compute resources for your team"

What is possible?• Everything we had for the smaller machine, plus…• More & faster – analysis that can use multiple CPUs• Simulation – particularly molecular simulation• Some genome analysis – differential expression/RNAseq,

microarrays, limited metagenomics, re-sequencing, perhaps some prokaryotic de-novo assembly

• Data visualisation – molecular modelling, image analysis (microscopy etc)

17

Page 18: BioExcel Webinar Series #12: "How to choose compute resources for your team"

Something bigger

18

Page 19: BioExcel Webinar Series #12: "How to choose compute resources for your team"

Question why you think you need this?

19

What is really the bottleneck to your research? ie.

• Simple job but lots of data

• Complex job but simple data

• Simple job – but many repetitions

• Is time the key factor? Is this need or convenience?

• (sometimes getting more resource can be slower than getting monkeys to do the job on typewriters)

Page 20: BioExcel Webinar Series #12: "How to choose compute resources for your team"

Accessing local HPC resource

20

You might have local HPC…

Have you checked?

There is a significant chance though that it is not running the software you want!

Page 21: BioExcel Webinar Series #12: "How to choose compute resources for your team"

Accessing alternative resource

21

Page 22: BioExcel Webinar Series #12: "How to choose compute resources for your team"

The key to this challenge is not computing – it’s people

22

Understanding computers and your computational need is important – so you can communicate that need

Most of the battle of acquiring more compute resource is working with those who can support you in getting/using it

Making the most of the resources you have or acquire means having a team who know how to use it

Page 23: BioExcel Webinar Series #12: "How to choose compute resources for your team"

Questions?

23

Page 24: BioExcel Webinar Series #12: "How to choose compute resources for your team"

bioexcel.eu

Audience Q&A session

Please use the Questionsfunction in GoToWebinar

application

Any other questions or points to discuss after the live

webinar? Join the discussion the discussion at

http://ask.bioexcel.eu.