12
Cloudbiolinux = Genomics resources for use on cloud platforms. Includes: An Amazon Machine Image with lots of handy bioinformatics software pre-installed. – Eucalyptus image –…

Cloudbiolinux = Genomics resources for use on cloud platforms. Includes: – An Amazon Machine Image with lots of handy bioinformatics software pre-installed

Embed Size (px)

Citation preview

Page 1: Cloudbiolinux = Genomics resources for use on cloud platforms. Includes: – An Amazon Machine Image with lots of handy bioinformatics software pre-installed

Cloudbiolinux = Genomics resources for use on cloud platforms. Includes:– An Amazon Machine Image with

lots of handy bioinformatics software pre-installed.

– Eucalyptus image–…

Page 2: Cloudbiolinux = Genomics resources for use on cloud platforms. Includes: – An Amazon Machine Image with lots of handy bioinformatics software pre-installed

Cloudbiolinux is good for…

• You need lots of bioinformatics software and don’t want to bother installing all of it.

• You need a cluster to compute your job and you need some/lots of bioinformatics software pre-installed.

Page 3: Cloudbiolinux = Genomics resources for use on cloud platforms. Includes: – An Amazon Machine Image with lots of handy bioinformatics software pre-installed

Show software…

Page 4: Cloudbiolinux = Genomics resources for use on cloud platforms. Includes: – An Amazon Machine Image with lots of handy bioinformatics software pre-installed

Using cloudbiolinux

1. Fire up an individual EC2 instance using the AMIs

Page 5: Cloudbiolinux = Genomics resources for use on cloud platforms. Includes: – An Amazon Machine Image with lots of handy bioinformatics software pre-installed

Show AMIs…

Page 6: Cloudbiolinux = Genomics resources for use on cloud platforms. Includes: – An Amazon Machine Image with lots of handy bioinformatics software pre-installed

Using cloudbiolinux

1. Fire up an individual EC2 instance using the AMIs

2. Create a cluster using biocloudcentral.

Page 7: Cloudbiolinux = Genomics resources for use on cloud platforms. Includes: – An Amazon Machine Image with lots of handy bioinformatics software pre-installed

Demo cloudman…

Page 8: Cloudbiolinux = Genomics resources for use on cloud platforms. Includes: – An Amazon Machine Image with lots of handy bioinformatics software pre-installed

Head node (“master”)

Transient NFS

Compute nodes

Instance storage

Compute jobs (SGE)

Page 9: Cloudbiolinux = Genomics resources for use on cloud platforms. Includes: – An Amazon Machine Image with lots of handy bioinformatics software pre-installed

Head node (“master”)

Transient NFS

Compute nodes

Instance storage

Input data

Page 10: Cloudbiolinux = Genomics resources for use on cloud platforms. Includes: – An Amazon Machine Image with lots of handy bioinformatics software pre-installed

Head node (“master”)

Transient NFS

Compute nodes

Instance storageOutput

data

Page 11: Cloudbiolinux = Genomics resources for use on cloud platforms. Includes: – An Amazon Machine Image with lots of handy bioinformatics software pre-installed

Head node (“master”)

Transient NFS

Compute nodes

Instance storageOutput

data

Post-processed data

Page 12: Cloudbiolinux = Genomics resources for use on cloud platforms. Includes: – An Amazon Machine Image with lots of handy bioinformatics software pre-installed

Points to remember

• Select the right size worker nodes for your jobs.

• Job allocation scheme used in SGE can be adjusted for your needs.

• You can add EBS volumes if you want to save the data and compute again later.