20
© Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

© Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

  • View
    216

  • Download
    1

Embed Size (px)

Citation preview

Page 1: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

© Wiley Publishing. 2007. All Rights Reserved.

Biological Sequences

Page 2: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

Learning Objectives

Crash course in molecular biologyKnowing the basic properties of the main

biological sequences: DNA, RNA, and

proteins

Page 3: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

Outline

1. Protein sequences

2. DNA sequences

3. RNA sequences

4. Entire genomes

Page 4: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

Proteins

Proteins are like small machines in the cell.Proteins carry out most of the work in a cell.Proteins are synthesized from RNA sequences.

Page 5: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

Amino Acids

Proteins are made of 20 amino acids. Each amino acid is small molecule made up of fewer than

100 atoms. The 20 amino acids have similar terminations; they can be

chained to one another like Lego bricks.

Page 6: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

Protein Sequences

Proteins are made of amino acids chained by peptide bonds. Protein sequences are written from the N to the C-terminus. Your average protein is 400 amino acids long. The longest protein is 30,000 amino acids long.

Page 7: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

Protein Structures

Proteins have well-defined

3-dimensional structures. Hydrophobic amino acids

are in the protein’s core. Hydrophilic amino acids

are on the protein’s

surface.

Page 8: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

Techniques for Bioinformatic Analysis

of Proteins

Browsing protein databases (Ch. 4) Predicting transmembrane segments (Ch. 6) Predicting secondary structural elements (Ch. 11) Predicting the 3D structure (Ch. 11) Predicting the domains (Ch. 6) Predicting the physico-chemical properties (Ch. 6)

• Weight • Isoelectric point (pI) • Digestion patterns

Page 9: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

Bioinformatic Analysis of Proteins: More Techniques

Searching protein databases with BLAST (Ch. 7)Comparing protein sequences with alignments (Ch. 8)Making multiple-sequence alignments to look for conserved

patterns (Ch.9)Visualizing 3D protein structures using the PDB database (Ch.

11)Reconstructing the phylogenetic tree of a protein family (Ch. 13)

Page 10: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

DNA

DeoxyriboNucleic AcidGenomes and genes are made of DNADNA is the main support of heredity

Page 11: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

DNA Sequences

DNA sequences are made of 4 nucleotides• Adenine A

• Guanine G

• Cytosine C

• Thymine T

DNA Sequences can be very long• Human chromosomes contain hundreds of millions of

nucleotides

• A tiny bacterium can contain a genome of several million

nucleotides

Page 12: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

Nucleotides

Nucleotides have similar terminations. Nucleotides are meant to be chained like Lego bricks. Nucleotides can interact with each other:

• Adenine with thymine (A with T)• Guanine with cytosine (G with C)

Page 13: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

Double-strand DNA

DNA sequences always come in two strands. The strands are complementary and opposite in orientation. By convention, biologists write only the 5’ and 3’ strands. Database-search programs search both strands automatically .

Page 14: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

RNA

RiboNucleic AcidRNA is a close relative of DNARNA has many functions

• Provides coding for proteins• Helps synthesize proteins• Helps many basic processes in the cell

RNA is not very stable• RNA is synthesized and very often degraded• DNA, by contrast, is very stable

Page 15: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

The RNA Sequence

RNA contains 4 nucleotides:• A, G, C, U• U is Uracil

RNA does not contain Thymine (T)Uracil replaces Thymine in RNARNA is single-stranded

Page 16: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

RNA Secondary Structures

RNA can make secondary structures RNA can make 1 strand with itself as a secondary structure Secondary structures are made of stems and loops

Page 17: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

What Is the Length of My Sequence ?

Protein sizes are expressed in amino acids or in Daltons • 115 Daltons ~ 1 amino acid

DNA and RNA sequences length are expressed in• Base-pairs (bp)• One Kbp or Kb: 1 thousand base pairs• One Mbp or Mb: 1 million base pairs• One Gbp or Gb: 1 billion base pairs

The following terms often have the same meaning:• Base• Base-pair (bp)• Nucleotide (nt)• Positions, nucleotides, residues

Page 18: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

Turning DNA into Proteins:The Genetic Code

DNA gets transcribed into RNA

using nucleotide complementarity.

RNA gets translated into proteins

using the genetic code:• UCU UAU GCG UAA • SER-TYR-ALA-STOP

Page 19: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

Some Bioinformatics Applications

For RNA and DNAUsing DNA-sequence databases (Ch.2,3)Identifying restriction sites (Ch. 5)Designing PCR primers (Ch. 5)Predicting RNA secondary structures (Ch. 12)Comparing DNA sequences (Ch. 7, 8, and 9)

Page 20: © Wiley Publishing. 2007. All Rights Reserved. Biological Sequences

Some Bioinformatics Applications

For Entire GenomesFinding your favorite genome online in public

databases (Ch. 3)Comparing your genome with others (Ch. 3, Ch. 7)Predicting the genes in your genome (Ch. 5)Building a phylogeny with your genome (Ch. 3)