71
1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS BIOINFORMATICS COURSE MTAT.03.239 05.09.2012

1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

  • Upload
    others

  • View
    17

  • Download
    0

Embed Size (px)

Citation preview

Page 1: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS

BIOINFORMATICS COURSE

MTAT.03.239

05.09.2012

Page 2: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

2 "Introduction to Bioinformatics"

Bioinformatics Course

Is a characteristic that distinguishes objects that have signaling and self-sustaining processes (i.e. living organism) to those that do not have it

Is a state of living characterized by capacity for metabolism, growth, reaction to stimuli, and reproduction

A diversity of life forms are found on Earth, eg. plants, animals, fungi, protists, archaea and bacteria

LIFE

Page 3: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

3 "Introduction to Bioinformatics"

Bioinformatics Course

Page 4: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

4 "Introduction to Bioinformatics"

Bioinformatics Course

WHAT IS BIOLOGY ?

Page 5: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

5 "Introduction to Bioinformatics"

Bioinformatics Course

Page 6: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

6 "Introduction to Bioinformatics"

Bioinformatics Course

BIOLOGY Is a study of life and living organisms

It brings together the structure, function, growth, origin, distribution, adaptation, interactions, taxonomy and evolution of living organism

AEROBIOLOGY, AGRICULTURE, ANATOMY, ASTROBIOLOGY, BIOCHEMISTRY, BIOENGINEERING, BIOINFORMATICS,

BIOMATHEMATICSOR, MATHEMATICAL BIOLOGY, BIOMECHANICS, BIOMEDICAL RESEARCH, BIOPHYSICS, BIOTECHNOLOGY, BUILDING BIOLOGY, BOTANY, CELLBIOLOGY, CONSERVATION BIOLOGY, CRYOBIOLOGY, DEVELOPMENTAL BIOLOGY, ECOLOGY, EMBRYOLOGY, ENTOMOLOGY, ENVIRONMENTAL BIOLOGY, EPIDEMIOLOGY, ETHOLOGY, EVOLUTIONARY BIOLOGY, GENETICS, HERPETOLOGY, HISTOLOGY, ICHTHYOLOGY, INTEGRATIVE BIOLOGY, LIMNOLOGY, MAMMALOGY, MARINE BIOLOGY, MICROBIOLOGY, MOLECULAR BIOLOGY, MYCOLOGY, NEUROBIOLOGY, OCEANOGRAPHY, ONCOLOGY, ORNITHOLOGY, POPULATION BIOLOGY, POPULATION ECOLOGY, POPULATION GENETICS, PALEONTOLOGY, PATHOBIOLOGY OR PATHOLOGY, PARASITOLOGY, PHARMACOLOGY, PHYSIOLOGY, PHYTOPATHOLOGY, PSYCHOBIOLOGY, SOCIOBIOLOGY, STRUCTURAL BIOLOGY, VIROLOGY

Page 7: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

Is a study of life and living organisms

It brings together the structure, function, growth, origin, distribution, adaptation, interactions, taxonomy and evolution of living organism

AEROBIOLOGY, AGRICULTURE, ANATOMY, ASTROBIOLOGY, BIOCHEMISTRY, BIOENGINEERING, BIOINFORMATICS,

BIOMATHEMATICSOR, MATHEMATICAL BIOLOGY, BIOMECHANICS, BIOMEDICAL RESEARCH, BIOPHYSICS, BIOTECHNOLOGY, BUILDING BIOLOGY, BOTANY, CELLBIOLOGY, CONSERVATION BIOLOGY, CRYOBIOLOGY, DEVELOPMENTAL BIOLOGY, ECOLOGY, EMBRYOLOGY, ENTOMOLOGY, ENVIRONMENTAL BIOLOGY, EPIDEMIOLOGY, ETHOLOGY, EVOLUTIONARY BIOLOGY, GENETICS, HERPETOLOGY, HISTOLOGY, ICHTHYOLOGY, INTEGRATIVE BIOLOGY, LIMNOLOGY, MAMMALOGY, MARINE BIOLOGY, MICROBIOLOGY, MOLECULAR BIOLOGY, MYCOLOGY, NEUROBIOLOGY, OCEANOGRAPHY, ONCOLOGY, ORNITHOLOGY, POPULATION BIOLOGY, POPULATION ECOLOGY, POPULATION GENETICS, PALEONTOLOGY, PATHOBIOLOGY OR PATHOLOGY, PARASITOLOGY, PHARMACOLOGY, PHYSIOLOGY, PHYTOPATHOLOGY, PSYCHOBIOLOGY, SOCIOBIOLOGY, STRUCTURAL BIOLOGY, VIROLOGY

7 "Introduction to Bioinformatics"

Bioinformatics Course

BIOLOGY COMPRISES AREAS OF STUDY THAT FOCUS ON LIFE AT A VARIETY OF LEVELS AND FROM A DIVERSITY OF PERSPECTIVES

BIOLOGY

Page 8: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

8 "Introduction to Bioinformatics"

Bioinformatics Course

Domain - Eukaryota Kingdom - Animalia Phylum - Chordata Vertebrata (Subphylum) Class - Mammalia Order - Primates Anthropoidea (Suborder) Hominoidea (Superfamily) Family - Hominidae Genus - Homo Species - sapiens

LIVING SYSTEMS

Page 9: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

9 "Introduction to Bioinformatics"

Bioinformatics Course

HUMANS Lineage (full): root; cellular organisms; Eukaryota; Opisthokonta; Metazoa; Eumetazoa; Bilateria; Coelomata; Deuterostomia; Chordata; Craniata; Vertebrata; Gnathostomata; Teleostomi; Euteleostomi; Sarcopterygii; Tetrapoda; Amniota; Mammalia; Theria; Eutheria; Euarchontoglires; Primates; Haplorrhini; Simiiformes; Catarrhini; Hominoidea; Hominidae; Homininae; Homo; Homo sapiens

http://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?id=9606

Page 10: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

10 "Introduction to Bioinformatics"

Bioinformatics Course

SPECIES

Defined as a group of living organisms consisting of similar individuals capable of exchanging genes or interbreeding

http://www.nature.com/news/2011/110823/full/news.2011.498.html

Page 11: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,
Page 12: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

13 "Introduction to Bioinformatics"

Bioinformatics Course

NO. OF SPECIES

http://www.iucnredlist.org/documents/summarystatistics/2010_1RL_Stats_Table_1.pdf

Page 13: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

14

LEVELS OF ORGANISATION

http://www.nature.com/scitable/topicpage/biological-complexity-and-integrative-levels-of-organization-468

Page 14: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

15

LEVELS OF ORGANISATION

http://www.nature.com/scitable/topicpage/biological-complexity-and-integrative-levels-of-organization-468

Page 15: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

16 "Introduction to Bioinformatics"

Bioinformatics Course

BIOINFORMATICS ?

Page 16: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

17 "Introduction to Bioinformatics"

Bioinformatics Course

COMPUTER SCIENCE [CS]

STUDIES COMPUTABLE PROCESSES AND STRUCTURES ( WITH THE AID OF COMPUTERS )

Page 17: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

18 "Introduction to Bioinformatics"

Bioinformatics Course

BIOINFORMATICS AND COMPUTATIONAL BIOLOGY

The boundaries between the two diciplines are not well defined and can be distinguished by the problems they solve

BIOINFORMATICS – is the application of statistics and computer science to the field of molecular biology

COMPUTATIONAL BIOLOGY – actual process of analyzing and interpreting data

Page 18: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

19 "Introduction to Bioinformatics"

Bioinformatics Course

DEFINITION OF BIOINFORMATICS

The term bioinformatics was coined in 1978 Bioinformatics is the application of information technology and computer science to the field of molecular biology The science of using / developing computer software and algorithms to record, analyze and merge biologically related data Using computer technology to manage large amounts of biological data Bioinformatics involves the use of techniques including applied mathematics, informatics, statistics, computer science, artificial intelligence, chemistry, and biochemistry to solve biological problems usually on the molecular level http://www.google.com/search?q=define%3ABioinformatics

Page 19: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

20 "Introduction to Bioinformatics"

Bioinformatics Course

DEFINITION OF BIOINFORMATICS

The collection, organization, storage, analysis, and integration of large amounts of biological data using networks of computers and databases Bioinformatics involves the integration of computers, software tools, and databases in an effort to address biological questions In summary, the use of computer science to solve biological problems

http://www.google.com/search?q=define%3ABioinformatics

Page 20: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

21

BIOINFORMATIC FOCUS

http://www.nature.com/scitable/topicpage/biological-complexity-and-integrative-levels-of-organization-468

ORGANISM

ORGAN

TISSUE CELL

MOLECULES

Page 21: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

22

BIOINFORMATIC FOCUS

http://www.nature.com/scitable/topicpage/biological-complexity-and-integrative-levels-of-organization-468

ORGANISM

ORGAN

TISSUE CELL

MOLECULES

ANALYSIS AND INTERPRETATION OF VARIOUS TYPES OF BIOLOGICAL DATA INCLUDING: NUCLEOTIDE AND AMINO ACID SEQUENCES, PROTEIN DOMAINS, AND PROTEIN STRUCTURES.

Page 22: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

Development of new algorithms and statistics with which to assess biological information, such as relationships among members of large data sets.

23

BIOINFORMATIC FOCUS

http://www.nature.com/msb/journal/v3/n1/images/msb4100163-f4b.jpg

"Introduction to Bioinformatics" Bioinformatics Course

Page 23: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

24

BIOINFORMATIC FOCUS

http://www.jofwidata.com/images/database-design-development.jpg http://wolfson.huji.ac.il/expression/detective.jpg

Development and implementation of tools that enable efficient access and management of different types of information, such as various databases, integrated mapping information

"Introduction to Bioinformatics" Bioinformatics Course

Page 24: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

25

UNITS OF INFORMATION IN BIOINFORMATICS

DNA Sequence Pathways

RNA Structure Interactions

Protein Evolution Mutations

"Introduction to Bioinformatics" Bioinformatics Course

Page 25: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

26

UNITS OF INFORMATION IN COMPUTER SCIENCE

File Storage capacity by Bits and Bytes

Bit Byte Kilobyte Megabyte Gigabyte

bit 1 8 1024*8=

8,192 1024*8192=

8,388,608 1024*8388608= 8,589,934,592

byte 8 1 1024 1024*1024=

1,048,576 1024*1048576= 1,073,741,824

Kilobyte 8,192 1024 1 KB 1024 1,048,576

Megabyte 8,388,608 1,048,576 1024 1 MB 1024

Gigabyte 8,589,934,592 1,073,741,824 1,048,576 1024 1 GB

Terabyte 8,796,093,022,208

1TB 1,099,511,627,776 1,073,741,824 1,048,576 1024

Petabyte

9,007,199,254,740,990

1,125,899,906,842,620

1,099,511,627,776

1,073,741,824 1,048,576

1024 TB 1 PB

"Introduction to Bioinformatics" Bioinformatics Course

Page 26: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

27

File Storage capacity by Bits and Bytes

Bit Byte Kilobyte Megabyte Gigabyte

Petabyte

9,007,199,254,740,990

1,125,899,906,842,620

1,099,511,627,776

1,073,741,824 1,048,576

1024 TB 1 BO

Exabyte

9,223,372,036,854,780,000

1,152,921,504,606,850,000

1,125,899,906,842,620

1,099,511,627,776

1,073,741,824

1,048,576 TB 1024 PB 1 EB

Zettabyte

9,444,732,965,739,290,000,000

1,180,591,620,717,410,000,000

1,152,921,504,606,850,000

1,125,899,906,842,620

1,099,511,627,776

1,073,741,824 TB 1,048,576 PB 1024 EB 1 ZB

Yottabyte

9,671,406,556,917,030,000,000,000

1,208,925,819,614,630,000,000,00

0

1,180,591,620,717,410,000,000 KB

1,152,921,504,606,850,000 MB

1,125,899,906,842,620 GB

1,099,511,627,776 TB 1,073,741,824 PB 1,048,576 EB 1024 ZB 1 YB

"Introduction to Bioinformatics" Bioinformatics Course

UNITS OF INFORMATION IN COMPUTER SCIENCE

Page 27: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

28

CELL SIZES

"Introduction to Bioinformatics" Bioinformatics Course

http://learn.genetics.utah.edu/content/begin/cells/scale/

Page 28: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

29

HUMAN CELL

http://bhavanajagat.files.wordpress.com/2012/02/cell-structure-and-functions.jpg

Page 29: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

30

EXAMPLES OF BIOLOGICAL DATA

"Introduction to Bioinformatics" Bioinformatics Course

GENOME – DNA TRANSCRIPTOME – RNA PROTEOME – Proteins

The biological information contained in a genome is encoded in deoxyribonucleic acid (DNA) or, for many types of virus, in ribonucleic acid (RNA)

Page 30: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

31

NAME THE NUMBERS

"Introduction to Bioinformatics" Bioinformatics Course

1

3

4 5

2

NUCLEUS DNA GENES CHROMOSOME CELL

Page 31: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

32

EXAMPLES OF BIOLOGICAL DATA

"Introduction to Bioinformatics" Bioinformatics Course

Page 32: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

33

CENTRAL DOGMA OF MOLECULAR BIOLOGY

http://compbio.pbworks.com/f/central_dogma.jpg

DNA is transcribed into RNA and RNA is translated into proteins

Page 33: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

34

EXAMPLES OF BIOLOGICAL DATA

"Introduction to Bioinformatics" Bioinformatics Course

GENOME – DNA TRANSCRIPTOME – RNA PROTEOME – Proteins

The biological information contained in a genome is encoded in deoxyribonucleic acid (DNA) or, for many types of virus, in ribonucleic acid (RNA)

Page 34: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

35

GENOME

"Introduction to Bioinformatics" Bioinformatics Course

Is the entirety of an organism’s hereditary information

The genome includes both the genes and non-coding sequences of DNA/RNA

In 1995, Haemophilus influenzae or was the first genome of a living organism to be sequenced in July 1995

1 830 140 base pairs of DNA in single circular chromosome that contains 1740 protein-coding gene, 58 transfer RNA genes and 18 other RNA genes

http://www.sciencemag.org/content/269/5223/local/front-matter.pdf

Page 35: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

36

WHOLE GENOMES

"Introduction to Bioinformatics" Bioinformatics Course

Page 36: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

37

GENOME SIZES

"Introduction to Bioinformatics" Bioinformatics Course

Escherichia coli 4,600,000 bp (1997)

Saccharomyces cerevisiae 12,100,000 (1996)

Arabidopsis thaliana 157,000,000 (2000)

Homo sapiens 3,200,000,000 (2000)

Page 37: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

38

GENOME SIZES

"Introduction to Bioinformatics" Bioinformatics Course

Japanese flower Paris japonica 130 billion base pairs – 50 times the human genome

Page 38: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

39

COMPLETELY SEQUENCED GENOMES

"Introduction to Bioinformatics" Bioinformatics Course

Page 39: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

40

HUMAN GENOME

"Introduction to Bioinformatics" Bioinformatics Course

Human body

• 1014 cells

• (100 trillion)

One cell

• 23 pairs of chromosomes

DNA

• ≈21,000 to 23,000 genes

RNA

• 3 billion pairs of DNA bases

Protein

• ≈100 000 different proteins

Page 40: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

41

Relative proportions (%) of bases in DNA

"Introduction to Bioinformatics" Bioinformatics Course

CURRENT SCIENCE, VOL. 85, NO. 11, 10 DECEMBER 2003

Page 41: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

42

DNA

DNA with high GC-content is more stable than DNA with low GC-content, 3 hydrogen bonds

"Introduction to Bioinformatics" Bioinformatics Course

Page 42: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

43

DNA vs RNA

DNA – deoxyribonucleic acid Sugar is deoxyribose DNA is a polymer of deoxyribonucleotides Bases are adenine (A), guanine (G), cytosine (C) and thymine (T)

RNA –ribonucleic acid Sugar is ribose RNA is a polymer of ribonucleotides Bases are adenine (A), guanine (G), cytosine (C) and uracil (U)

http://www2.chemistry.msu.edu/faculty/reusch/VirtTxtJml/Images3/dna_rna1.gif

Page 43: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

44

DNA SEQUENCE

"Introduction to Bioinformatics" Bioinformatics Course

Raw DNA sequence Coding or non-coding Parses into genes 4 nucleotide bases ATGC

>ENST00000539570 cdna:known chromosome:GRCh37:15:63889592:63893885:1 gene:ENSG00000259662 gene_biotype:protein_coding transcript_biotype:protein_coding ATGTGGCCACTGCTCACCATGCACATAACCCAGCTCAACCGGGAGTGCCTGCTGCACCTCTTCTCCTTCCTAGACAAGGACAGCAGGAAGAGCCTTGCCAGGACCTGCTCCCAGCTCCACGACGTGTTTGAGGACCCCGCACTCTGGTCCCTGCTGCACTTCCGTTCCCTCACTGAACTCCAGAAGGACAACTTCCTCCTGGGCCCGGCACTCCGCAGCCTCTCCATCTGCTGGCACTCCAGCCGCGTGCAGGTGTGCAGCATTGAGGACTGGCTCAAGAGTGCCTTCCAGAGAAGCATCTGCAGCCGGCACGAGAGCCTGGTCAATGATTTCCTCCTCCGGGTGTGCGACAGGCTTTCTGCTGTGCGCTCCCCACGGAGGCGGGAGGCGCCTGCACCGTCCTCGGGGACTCCGATCGCCGTTGGACCGAAATCACCTCGGTGGGGAGGACCTGACCACTCGGAGTTCGCCGACTTGCGCTCGGGGGTGACGGGGGCCAGGGCTGCCGCGCGCAGGGGTCTGGGGAGCCTCCGGGCGGAGCGACCCAGCGAGACCCCGCCGGCTCCCGGAGTGTCCTGGGGACCGCCACCTCCAGGAGCCCCGGTGGTGATCTCGGTGAAGCAGGAGGAGGGGAAGCAGGGGCGCACGGGCAGAAGGAGCCACCGAGCCGCTCCTCCTTGCGGTTTTGCCCGCACGCGCGTCTGCCCGCCCACCTTTCCTGGGGCGGATGCGTTCCCGCAGTGA

Page 44: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

45

A GENE

"Introduction to Bioinformatics" Bioinformatics Course

http://www.down-syndrome.org/updates/2054/updates-2054-figure1-400w.png

Page 46: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

47

GENE EXPRESSION REGULATORS - EPIGENETICS

"Introduction to Bioinformatics" Bioinformatics Course

http://scienceblogs.com/pharyngula/2008/07/22/epigenetics/

Page 47: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

48

EXAMPLES OF BIOLOGICAL DATA

"Introduction to Bioinformatics" Bioinformatics Course

GENOME – DNA TRANSCRIPTOME – RNA PROTEOME – Proteins Transcriptome is a set of all RNA molecules including mRNA, rRNA, tRNA, and non-coding RNA produced in one or a population of cells

http://www.bio.miami.edu/~cmallery/150/gene/c7.17.7b.transcription.jpg

Page 48: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

49

TRANSCRIPTION

"Introduction to Bioinformatics" Bioinformatics Course

http://www.youtube.com/watch?v=ztPkv7wc3yU

Page 49: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

50

TRANSCRIPTION

"Introduction to Bioinformatics" Bioinformatics Course

http://www.bio.miami.edu/~cmallery/150/gene/c7.17.7b.transcription.jpg

Page 50: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

51

ALTERNATIVE SPLICING

http://www.nature.com/scitable/content/a-schematic-representation-of-alternative-splicing-95777

"Introduction to Bioinformatics" Bioinformatics Course

Page 51: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

52

TYPES OF RNA

"Introduction to Bioinformatics" Bioinformatics Course

http://csls-text.c.u-tokyo.ac.jp/images/fig/fig03_4.gif

mRNA – messenger RNA:

encodes amino acid sequences of a polypeptide

tRNA – transfer RNA: brings

amino acids to ribosomes during translation

rRNA – ribosomal RNA: with

ribosome proteins makes up the ribosomes, the organelles that translate the mRNA

snRNA – small nuclear

RNA: forms complexes with proteins that are used in RNA processing in eukaryotes

Page 52: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

53

TYPES OF RNA

"Introduction to Bioinformatics" Bioinformatics Course

http://finchtalk.geospiza.com/2009_05_01_archive.html

Page 53: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

54

EXAMPLES OF BIOLOGICAL DATA

"Introduction to Bioinformatics" Bioinformatics Course

GENOME – DNA TRANSCRIPTOME – RNA PROTEOME – Proteins

The proteome is the entire set of proteins expressed by a genome, cell, tissue or organism.

http://artavanis-tsakonas.med.harvard.edu/research_images/figure_harsha_proteome.jpg

Page 54: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

55

FROM TRANSCRIPTION TO TRANSLATION

"Introduction to Bioinformatics" Bioinformatics Course

http://www1.cs.columbia.edu/~cleslie/cs4761/microarray/central-dogma.png

Page 55: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

56

TRANSLATION

"Introduction to Bioinformatics" Bioinformatics Course

http://0.tqn.com/d/chemistry/1/0/G/m/mrnatranslation.jpg

Page 56: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

57

TRANSLATION INITIATION

"Introduction to Bioinformatics" Bioinformatics Course

http://bioap.wikispaces.com/Ch+17+Collaboration

Page 57: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

58

TRANSLATION TERMINATION

"Introduction to Bioinformatics" Bioinformatics Course

http://kvhs.nbed.nb.ca/gallant/biology/translation_termination.html

Page 58: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

59

UNIVERSAL GENETIC CODE

"Introduction to Bioinformatics" Bioinformatics Course

http://www.biogem.org/codon.jpg

Page 59: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

60

AMINO ACIDS

Page 60: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

61

PROTEIN

Proteins consists of long chains of amino acid sequences 20 letter alphabet (IUPAC nomenclature)

IUPAC amino acid code

Three letter code

Amino acid

A Ala Alanine

C Cys Cysteine

D Asp Aspartic Acid

E Glu Glutamic Acid

F Phe Phenylalanine

G Gly Glycine

H His Histidine

I Ile Isoleucine

K Lys Lysine

L Leu Leucine

IUPAC amino acid code

Three letter code

Amino acid

M Met Methionine

N Asn Asparagine

P Pro Proline

Q Gln Glutamine

R Arg Arginine

S Ser Serine

T Thr Threonine

V Val Valine

W Trp Tryptophan

Y Tyr Tyrosine

"Introduction to Bioinformatics" Bioinformatics Course

Page 61: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

62

PROTEIN SEQUENCE

>sp|P48431|SOX2_HUMAN Transcription factor SOX-2 OS=Homo sapiens GN=SOX2 PE=1 SV=1 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMVWSRGQRRKMA QENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLM KKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLGY PQHPGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGSM GSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPEPAAPSRLHMSQHYQS GPVPGTAINGTLPLSHM

"Introduction to Bioinformatics" Bioinformatics Course

Page 62: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

63

PROTEIN SIZE

http://www.quora.com/Protein-nutrition-1/Whats-the-average-size-of-a-human-protein-in-kDa

http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1150220/

Page 63: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

64

PROTEIN STRUCTURE

"Introduction to Bioinformatics" Bioinformatics Course

Page 64: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

65

PROTEIN DOMAINS

Page 65: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

66

PROTEIN SEQUENCE

>sp|P48431|SOX2_HUMAN Transcription factor SOX-2 OS=Homo sapiens GN=SOX2 PE=1 SV=1 MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMVWSRGQRRKMA QENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLM KKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLGY PQHPGLNAHGAAQMQPMHRYDVSALQYNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGSM GSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPEPAAPSRLHMSQHYQS GPVPGTAINGTLPLSHM

Proteins are divided into domains

DNA BINDING DOMAIN

http://www.uniprot.org/

Page 66: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

67

GENE TRANSCRIPTION, TRANSLATION AND PROTEIN SYNTHESIS

http://compbio.pbworks.com/f/central_dogma.jpg

Page 67: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

68

CENTRAL DOGMA

http://www.youtube.com/watch?v=983lhh20rGY

Page 68: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

69

BIOINFORMATIC APPLICATIONS

"Introduction to Bioinformatics" Bioinformatics Course

The integrative approaches are useful and applied in Agricultural Higher yield in crops or fruits Disease or drought resistance crops

Medical To understand processes in healthy and disease individuals Genetic diseases

Pharmaceutical To find or develop new and better drugs Gene based drugs Structure based drug designing

Page 69: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

70

BIOINFORMATIC QUESTIONS 1

"Introduction to Bioinformatics" Bioinformatics Course

To identify an unknown gene of interest

Sequence matching

Is there a match to known sequence in the database

Which protein family does it match to

How to identify more family members

I have an similar structure, how to identify its potential ligands

How to identify if my gene/protein is found present also in other species

How can I identify genes that are inherited together in a specific region

Page 70: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

71

BIOINFORMATIC QUESTIONS 2

"Introduction to Bioinformatics" Bioinformatics Course

I have to constructed a artificial gene, how do I design the primers, how to check if I have the right sequence?

To know structure of an poorly expressed RNA sequence

To identify the structure and function of a protein sequence

To cluster protein sequences into families of related sequences and develop models

To generate phylogenetic trees to identify the evolutionary relationships using similar proteins/DNA

To identify which other proteins interacts with sequence of interest.

Page 71: 1. INTRODUCTION TO BIOLOGY AND BIOINFORMATICS · 6 "Introduction to Bioinformatics" Bioinformatics Course BIOLOGY Is a study of life and living organisms It brings together the structure,

72

BIOINFORMATIC QUESTIONS 3

"Introduction to Bioinformatics" Bioinformatics Course

Find genes that have similar expression in specific conditions

Find transcription factors that regulate specific genes

Vizualise different gene and protein networks

Describe the regulation of genes