8
The iPlant Collaborative Community Cyberinfrastructure for Life Science Network for Integrating Bioinformatics into Life Sciences Education April, 2014 – Omaha, Nebraska Jason Williams Cold Spring Harbor Laboratory / iPlant Collaborative

The iPlant Collaborative Community Cyberinfrastructure for Life Science Network for Integrating Bioinformatics into Life Sciences Education April, 2014

Embed Size (px)

Citation preview

Page 1: The iPlant Collaborative Community Cyberinfrastructure for Life Science Network for Integrating Bioinformatics into Life Sciences Education April, 2014

The iPlant Collaborative Community Cyberinfrastructure for Life Science

Network for Integrating Bioinformatics into Life Sciences EducationApril, 2014 – Omaha, Nebraska

Jason Williams Cold Spring Harbor Laboratory / iPlant Collaborative

Page 2: The iPlant Collaborative Community Cyberinfrastructure for Life Science Network for Integrating Bioinformatics into Life Sciences Education April, 2014

Cyberinfrastructure is a key enabler for bioinformatics education

Page 3: The iPlant Collaborative Community Cyberinfrastructure for Life Science Network for Integrating Bioinformatics into Life Sciences Education April, 2014

What is Cyberinfrastructure?

iPlant Data Store100 GB allocation per user – up to ~10TB

Discovery Environment

100’s Bioinformatics Apps in WYSIWYG interface

AtmosphereOne-click access to configurable

virtual machines

DNA SubwayClassroom-friendly workflows

for Annotation, DNA Barcoding, RNA-Seq

Training and SupportOnline and onsite training, user forums, support

HPC ResourcesUA Condor Cluster + XSEDE resources at TACC

Science APIsLow-level system APIs + Public Science APIs

Page 4: The iPlant Collaborative Community Cyberinfrastructure for Life Science Network for Integrating Bioinformatics into Life Sciences Education April, 2014

Faculty identified guiding requirements that shape the development of iPlant’s educational platforms:

• Mix lecture and lab – have a wet bench “hook”

• Student-scientist partnerships – someone has to care about the data

• Co-investigation – projects should potentially lead to publications

• Scale – platforms should support projects multiple classrooms can join.

Guiding Principles

Page 5: The iPlant Collaborative Community Cyberinfrastructure for Life Science Network for Integrating Bioinformatics into Life Sciences Education April, 2014

DNA SubwayEducational workflows for Genomes, DNA Barcoding, RNA-Seq

Commonly used bioinformatics tools in streamlined workflows

Teach important concepts in biology and bioinformatics

Inquiry-based experiments for novel discovery and publication of data

Page 6: The iPlant Collaborative Community Cyberinfrastructure for Life Science Network for Integrating Bioinformatics into Life Sciences Education April, 2014

Sequencing

Towards an egalitarian biologyNA purification

Page 7: The iPlant Collaborative Community Cyberinfrastructure for Life Science Network for Integrating Bioinformatics into Life Sciences Education April, 2014

Towards an egalitarian bioinformatics

$ tophat -p 8 -G genes.gtf -o C1_R1_thout genome C1_R1_1.fq C1_R1_2.fq$ tophat -p 8 -G genes.gtf -o C1_R2_thout genome C1_R2_1.fq C1_R2_2.fq$ tophat -p 8 -G genes.gtf -o C1_R3_thout genome C1_R3_1.fq C1_R3_2.fq$ tophat -p 8 -G genes.gtf -o C2_R1_thout genome C2_R1_1.fq C1_R1_2.fq$ tophat -p 8 -G genes.gtf -o C2_R2_thout genome C2_R2_1.fq C1_R2_2.fq$ tophat -p 8 -G genes.gtf -o C2_R3_thout genome C2_R3_1.fq C1_R3_2.fq

$ cufflinks -p 8 -o C1_R1_clout C1_R1_thout/accepted_hits.bam$ cufflinks -p 8 -o C1_R2_clout C1_R2_thout/accepted_hits.bam$ cufflinks -p 8 -o C1_R3_clout C1_R3_thout/accepted_hits.bam$ cufflinks -p 8 -o C2_R1_clout C2_R1_thout/accepted_hits.bam$ cufflinks -p 8 -o C2_R2_clout C2_R2_thout/accepted_hits.bam$ cufflinks -p 8 -o C2_R3_clout C2_R3_thout/accepted_hits.bam

$ cuffmerge -g genes.gtf -s genome.fa -p 8 assemblies.txt

$ cuffdiff -o diff_out -b genome.fa -p 8 –L C1,C2 -u merged_asm/merged.gtf \./C1_R1_thout/accepted_hits.bam,./C1_R2_thout/accepted_hits.bam,\./C1_R3_thout/accepted_hits.bam \./C2_R1_thout/accepted_hits.bam,\./C2_R3_thout/accepted_hits.bam,./C2_R2_thout/accepted_hits.bam

Page 8: The iPlant Collaborative Community Cyberinfrastructure for Life Science Network for Integrating Bioinformatics into Life Sciences Education April, 2014

CI enables bioinformatics education

www.urbanbarcodeproject.org