30
+ Computer Science and Biology Arthi Ramachandran July 7, 2011

Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+

Computer Science and Biology Arthi Ramachandran July 7, 2011

Page 2: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+Outline

n Phylogeny

n What is DNA? n Massive codes, tons of information

n What is Computational Biology?

n CS Unplugged

n Phylogenetics Activity

n Conclusion

Page 3: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+Phylogeny

Page 4: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+

Page 5: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+What is DNA?

n Double helix polymer

n Backbone n Deoxyribose (sugar) n Phosphate

n Bases n Adenine n Cytosine n Guanine n Thiamine

Page 6: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+What is DNA?

n Instructions for n How to build a human

n Code for who and what we are

n 1 human = 3 billion bases

n Humans can’t analyze that much information by hand

Page 7: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+How much data is there?

n Moore’s law: n  # transistors on chip doubles

in 2 years

n Sequencing n  2001: 1st human genome

n  Billions of dollars n  10 year effort

n  Now: n  Thousands of dollars n  Several days

n  Opportunity for research

Productivity of DNA Synthesis & Sequencing

Base

s se

que

nced

per

per

son

per

day

. N

umb

er o

f tra

nsis

tors

per

chi

p.

Year

Page 8: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+Computational Biology

n To analyze DNA, we need to use ideas from n Computer science

n Mathematics n Statistics

Page 9: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+Computational Biology is everywhere!

n Phylogenetics n How different species are related

n Genomics n DNA sequence analysis

n Medical informatics n Data mining medical records

n Computational neuroscience

Page 10: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+CS Unplugged

n Activities to teach Computer Science at a middle-high school level n Without computers

n No programming

n Examples: n Binary arithmetic n Artificial Intelligence

n Cryptography n Graph Theory

Page 11: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+CS Unplugged: Computational Biology

n Biology: evolution

n Biology: phylogenetics

n Mathematics: matrices n Only High School level math needed

Page 12: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+Activity

n Play modified version of the children’s game “telephone”

n From final set of messages {A, B, C, D, E}, can we reconstruct who told whom and when?

Page 13: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+The Biology behind it

n Messages ~ DNA sequences

n Passing of a message ~ passing DNA from one generation to the next

n Changes in message ~ mutations

Page 14: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+The Biology behind it

n Phylogenetics n  From the end DNA sequences

(current living organisms) we want to construct the evolutionary tree

Page 15: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+Activity

n Set of similar sounding words: n  {chair, prepare, affair, stair}

n  {school, rule, cool, fool} n  {dream, stream, theme, beam}

Page 16: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+Activity

n Construct a message with 5-7 of those words

n Student 1 passes it on to Student 2, 3 and so on

Page 17: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+Activity Rules

n Whisper the message you hear to the next person

n No repeats allowed if you didn’t hear the message properly

n Write what you heard down on the given paper

Page 18: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+Activity

n A, B, C, D, E are the final messages

n Relabel them V, W, X, Y, Z

Page 19: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+Activity

n V:

n W:

n X:

n Y:

n Z:

n How would you construct the tree structure?

Page 20: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+Activity

n Edit distance n How similar are two of the messages?

V W X Y

W - - -

X - -

Y -

Z

Page 21: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+The Biology behind it

n Maximal Parsimony n Occam’s Razor

n  Tend towards simpler theories until we can trade some simplicity for increased explanatory power

n  the simplest explanation is most likely the correct one

n Fewest mutations possible in the tree

Page 22: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+Activity

n Reconstructing the tree n Maximal parsimony

n Smallest entry in matrix n Group those messages together ?

?

Page 23: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+Activity

n Revise distance matrix

n Repeat process

- -

-

Page 24: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+Activity

n Revise distance matrix

-

Page 25: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+Activity

n Is the final tree the same as the original tree?

n If not, why? n Random selection to break ties

n Neutral changes

Page 26: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+The Math behind it

n Matrix theory n Species distance matrix

n  Pairwise distances

Page 27: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+The Computer Science behind it

n Easy to solve with 5 messages of short length n What if there are hundreds of messages?

n What if each message is hundreds of words? Thousands? Millions?

Page 28: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+Evolutionary tree example

Page 29: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+Summary

n Just one example of what can be done using computer science in the biological sphere

n Computer Science is used everywhere!

Page 30: Computer Science and Biology - Columbia UniversityCS4HS/talks/Phylogenetics.pdf · CS Unplugged ! Activities to teach Computer Science at a middle-high school level ! Without computers

+Resources

n CS Unplugged n http://csunplugged.org/activities