55
The Human Genome HAGenetics.org Dr. Hasan Alhaddad Guest lecturer: Molecular Basis of Human Diseases October 12 th , 14 th , 16 th 2014 Room 244 (1 PM)

The Human Genome Project - Part I

Embed Size (px)

Citation preview

Page 1: The Human Genome Project - Part I

The Human Genome

HAGenetics.org

Dr. Hasan Alhaddad Guest lecturer: Molecular Basis of Human Diseases

October 12th, 14th, 16th 2014 Room 244 (1 PM)

Page 2: The Human Genome Project - Part I

Lectures structure

HAGenetics.org

•  Part I (Sunday Oct 12th): •  The book of life (Matt Ridely’s analogy with

modifications). •  Introduction to the technologies at the time.

•  Part II (Tuesday Oct 14th): •  Why sequencing genomes/the human genome? •  Genome war (public and private projects). •  Sequencing the genome. •  Genome assembly. •  Genome annotation.

•  Part III (Thursday Oct 16th): •  Genome outcome. •  The Genomic era.

Page 3: The Human Genome Project - Part I

AIMS (part I)

•  Learn the basics of the human genome and understand how to simplify the concept of a genome to the public.

•  Understand the technologies of the time and how they affect the project.

•  Understand the limitations of the technologies.

•  Understand some the general motives for sequencing the human genome.

HAGenetics.org

Page 4: The Human Genome Project - Part I

What is the genome?

HAGenetics.org

The book of life The code to our existence

The instructions to make who we are The map to how we look, feel, think, and behave

The genome is ourselves in a chemical language

Page 5: The Human Genome Project - Part I

HAGenetics.org

The human book of life – an analogy

Page 6: The Human Genome Project - Part I

The human book of life – an analogy

HAGenetics.org

I have two books

The human book of life is called

THE GENOME

Page 7: The Human Genome Project - Part I

The human book of life – an analogy

HAGenetics.org

Can you define the genome?

What cells in your body have one copy of the genome and what cells have two?

Page 8: The Human Genome Project - Part I

Some details on the human book of life

HAGenetics.org

•  Humans are diploid organisms - two books (genomes) in most of their cells.

•  Each book (genome) is composed of 23 chapters (chromosomes).

•  The total number of chapters (chromosomes) in a humans is 46.

•  Approximately 3 billion letters in the book!

•  Sex determination depends on chapter 23 (the sex chromosomes).

•  XX ! female. •  XY ! male.

Autosomal chromosomes

Sex chromosomes

Page 9: The Human Genome Project - Part I

The human book of life

HAGenetics.org

How did we learn about the number of chromosomes in the human genome?

Page 10: The Human Genome Project - Part I

Sex determination (mostly)

HAGenetics.org

X X X Y

What are the 23rd chapters of your books of life?

Page 11: The Human Genome Project - Part I

HAGenetics.org

Genome size How did we learn about the size of the human genome?

Page 12: The Human Genome Project - Part I

HAGenetics.org

Genome size

How did we learn about the size of the human genome?

Page 13: The Human Genome Project - Part I

More details about the book

HAGenetics.org

•  The number of pages, word,

and letters differ in each

chapter (chromosome).

•  Chapters (chromosomes)

are numbered based on

their size. (chr 1 is the

largest).

•  The instructions/readable

sections (genes) are not

equally distributed over the

chromosomes.

•  Many sections of the book

(genome) are not readable.

•  Many sections of the book

are of repeated letter,

words, or sentences.

•  The book is written in a

chemical language

composed of four letters

(A,T,G,C).

•  Sentences are made of

words made of three

letters (AAC, ATG, etc.).

Page 14: The Human Genome Project - Part I

What is written and how to read it?

HAGenetics.org

ATGCCATCACAAATGCGGCTATGCCATGACAAATGCGGCTAATGCCATGACAAATGCGGCTAATGCCATGACAAATGCGGCTAATGCCATGACAAATGCGGCTACATGACAAATGCGGCTAATGCCATGACAAATGCGGCTAATCATGACAAATGCGGCTACATGACCATGACAAATGCGGCTACATGACAACAACAACAACAACAACAACAACAACAACATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATATAT

Met-Pro-Ser-Met-Arg-Lue-Cys-His-Asp-Lys-Cys-Gly-stop

Bla bla bla bla bla

More bla bla bla bla

Make brown eyes and stop

Page 15: The Human Genome Project - Part I

The genome in a cell

HAGenetics.org

•  The Wellcome collection in London. •  The human genome printed using font

size (5)! •  If we print the genome using font size

12 and stretch the letter, it would go ~ from Kuwait to Spain!

•  A lot of information. How is it packaged in a 100 trillion tiny little human cells (1-100 um)?

Page 16: The Human Genome Project - Part I

HAGenetics.org

Genome packaging

The genome is packaged via the interaction of DNA with

proteins.

Page 17: The Human Genome Project - Part I

Lost?

HAGenetics.org

Are you lost with these analogy?

If yes, do not worry.

You will learn all the details during upcoming lectures "

Page 18: The Human Genome Project - Part I

HAGenetics.org

What was known at the time? A summary: •  Humans have diploid genome (2n = 46 chromosomes) in

their somatic cells.

•  Humans have haploid genome (n = 23 chromosomes) in their germ-line cells.

•  Humans have 22 pairs of autosomal chromosomes and 1 pair of sex chromosomes (X and Y)

•  Human genome contains heterochromatic regions (highly condensed – few active genes) and Euchromatic region (lightly condensed – many active genes).

•  Human genome size is ~ 3 billion bp.

Page 19: The Human Genome Project - Part I

HAGenetics.org

Molecular technologies of the time

How were genes/genomes studied at the time?

We will go over some key technologies that are relevant to the human genome project.

Page 20: The Human Genome Project - Part I

HAGenetics.org

Molecular technologies of the time Polymerase Chain Reaction

Biochomicals.com

Page 21: The Human Genome Project - Part I

Polymerase chain reaction

•  Polymerase Chain Reaction (PCR) allows the amplification (copying) of small amounts of DNA millions of copies.

•  The method was developed by Kary Mullis (1983) and he was awarded the Nobel Prize for his invention.

•  The process of PCR is similar to the process of DNA replication except it is done in tubes rather than living cells.

•  It is considered in many cases the first step before any genetic analysis.

•  Many methods and applications involve PCR.

HAGenetics.org

Page 22: The Human Genome Project - Part I

DNA replication and PCR •  DNA replication in the cells involves making an identical

copy of the genome (DNA).

•  PCR uses the same procedure but to generate millions of copies of a small section of the genome in a tube!

HAGenetics.org

Page 23: The Human Genome Project - Part I

Components

HAGenetics.org

What do we need to replicate (copy) DNA?

1.  DNA template.

2.  Building block of DNA (dNTPs).

3.  DNA copier (an enzyme).

4.  3’OH (primer).

Page 24: The Human Genome Project - Part I

Components: (1) DNA template

HAGenetics.org

•  The DNA sample you collect from a crime scene or the one under investigation is the DNA template.

G A

C T

A

T

T

A

G

C

G

C

A

T

C

G

C

G

G

C

T

A

A

T

T

A

G

C

A

T

C

G

A

T

C

G

5’

5’

3’

3’

C T T A C C T G G C A T A C T G T G 5’ 3’

G A A T G G A C C G T A T G A C A C 5’ 3’

Each strand serves as a template for copying.

Remember complementary base-pairing!

Page 25: The Human Genome Project - Part I

Components: (2) dNTP (building blocks)

HAGenetics.org

Do you remember? DNA is made of nucleotides!

Page 26: The Human Genome Project - Part I

Deoxyribonucleoside triphosphate (dNTP)

HAGenetics.org

Four dNTPs serve as the building blocks of DNA (dATP, dTTP, dGTP, dCTP)

Remember Nucleotides!

Components: (2) dNTP (building blocks)

H H

Page 27: The Human Genome Project - Part I

Deoxyribonucleoside triphosphate (dNTP)

HAGenetics.org

Why triphosphate?

For the energy required to for the phosphodiester bond

Components: (2) dNTP (building blocks)

+

H H

Page 28: The Human Genome Project - Part I

Components: (3) DNA copier (polymerase)

HAGenetics.org

• DNA polymerase is the DNA copier in the cell.

•  Uses the dNTPs (DNA building blocks) to make a complementary strand to the template.

•  Uses the available 3’-OH of a previous nucleotide and 5’phsphate from dNTP to form a phosphodiester bond.

•  Each time DNA Pol finds the correct complementary dNTP and catalyzes the reaction linking the new nucleotide.

Remember DNA Pol needs 3’-OH

Page 29: The Human Genome Project - Part I

Primers are short piece of polynucleotide

HAGenetics.org

G

C T

A

T

T

A

G

C C T G G C A T A C T G T G

5’

5’

OH-3’

3’

In order for the DNA copying machine to work and add nucleotides,

a 3’-OH needs to be available to form a phosphodiester bond!

Components: (4) primer (3’ OH)

Page 30: The Human Genome Project - Part I

PCR Process

•  Three steps are involved in PCR:

1.  DNA template denaturation: separation of the two strands of DNA.

2.  Primers annealing: small oligonucleotide attaches to each separated strand providing the 3’OH for DNA polymerase.

3.  DNA polymerization (extension): DNA polymerase extends the primers on both strands and adds nucleotides.

HAGenetics.org

Page 31: The Human Genome Project - Part I

HAGenetics.org

PCR Process

1.  DNA denaturation 2.  Primers annealing 3.  DNA extension

Page 32: The Human Genome Project - Part I

HAGenetics.org

PCR cycles

1.  DNA denaturation 2.  Primers annealing 3.  DNA extension

Tem

p (C

)

94 C

55-65 C

72 C

What happens if we repeat this cycle many times?

Cycle 1 Cycle 2 Cycle 3 Cycle 4 Cycle 5

21 22 23 24 25

Exponential growth in the number of copies generated. The number of copies you get at the end of your PCR will be

2#cycles (236 cycles = 68 billion copies)

Page 33: The Human Genome Project - Part I

HAGenetics.org

PCR – how many copies?

Threshhold

Not enough DNA template

No chemicals left in the reaction tube

# D

NA

copi

es

# cycles

Page 34: The Human Genome Project - Part I

HAGenetics.org

Problems!

There were some difficulties with this system:

1. Three water-baths with three different temperature.

2. DNA polymerase denatures at 94 C.

Page 35: The Human Genome Project - Part I

HAGenetics.org

Problems!

DNA denature (94 C)

Primer annealing (55-65 C)

Extension (72 C)

Adding DNA Pol

DNA Pol denatures

•  The sample has to be transferred into multiple water baths to accommodate the needed temperature.

•  DNA polymerase needs to be added in every cycle because DNA polymerase denatures at high temperature.

Page 36: The Human Genome Project - Part I

Improvement 1

•  Using Thermus aquaticus (Taq) polymerase.

•  Taq polymerase is heat stable and the cycles can take place without the polymerase being destroyed during the denaturation phase.

HAGenetics.org

Page 37: The Human Genome Project - Part I

Improvement 1 •  Using Thermus aquaticus (Taq) polymerase.

HAGenetics.org

Biochomicals.com

Page 38: The Human Genome Project - Part I

Improvement 2

Replacing old machine (water baths) with a thermocycler.

HAGenetics.org

Page 39: The Human Genome Project - Part I

Improvement 2 Replacing old machine (water baths) with a thermocycler.

HAGenetics.org

Page 40: The Human Genome Project - Part I

To consider

•  Length and GC content of your primer.

•  Compatibility of your forward and reverse primers.

•  Primer’s sequences do not complement each other (primer dimer).

•  Annealing temperature of both primers should be the same.

•  Length of the target DNA piece ( the longer the target the longer the extension time).

•  DNA polymerase, primers and other chemicals’ concentration should be precisely calculated.

HAGenetics.org

Page 41: The Human Genome Project - Part I

Problem with PCR

What if we do not know the sequence of DNA to design primers?

How can I get enough copies of DNA?

HAGenetics.org

Page 42: The Human Genome Project - Part I

Problem with PCR

HAGenetics.org

•  Copying unknown DNA using a biological system (bacteria).

•  The approach used is called “Molecular Cloning”.

•  It makes use Endonucleases (restriction enzymes) and plasmids of bacteria to copy a specific unknown piece of DNA.

•  The plasmid’s DNA can serve as a primer for PCR and DNA sequencing.

Page 43: The Human Genome Project - Part I

Why copy DNA?

HAGenetics.org

Why molecular cloning or PCR matters in genome studies?

DNA CANNOT be sequenced using one copy!

Page 44: The Human Genome Project - Part I

What is DNA sequencing?

HAGenetics.org

It is reading the letters of the book. It is reading the exact nucleotide sequence of the genome.

Page 45: The Human Genome Project - Part I

Sequencing methods available now!

HAGenetics.org

1. Maxam and Gilbert chemical degradation method (extinct).

2. Sanger sequencing (dideoxy or chain termination method).

3.  Illumina sequencing.

4. SOLiD sequencing.

5. Pyrosequencing.

6.  Ion Torrent method.

7. Single molecule sequencing.

Page 46: The Human Genome Project - Part I

Sanger sequencing (the great method)

HAGenetics.org

•  Fredrick Sanger has developed a sequencing method and received a Noble prize for it.

•  Sanger sequencing method is also called Chain Termination Method and Dideoxy sequencing method.

•  Employs: •  specific primers

•  dNTPs

•  ddNTPs

•  DNA polymerase

•  DNA template

Page 47: The Human Genome Project - Part I

DNA synthesis

HAGenetics.org

+

H H

DNA synthesis requires the availability of a 3’-OH and energy

H H

Page 48: The Human Genome Project - Part I

DNA synthesis

HAGenetics.org

Difference in OH location in sugar and consequences

Page 49: The Human Genome Project - Part I

DNA synthesis

HAGenetics.org

The absence of OH group on the 3’ carbon of the sugar blocks further addition of nucleotides

Page 50: The Human Genome Project - Part I

Sanger sequencing procedure

HAGenetics.org

ddGTP ddCTP ddATP ddTTP

DNA Template

Polymerase

Excess dNTPs

Primer

Page 51: The Human Genome Project - Part I

HAGenetics.org

A G T A T G A A T T C 3� 5�

3� 5�

5� G

5� G A

5� A G A

5� A G A T

5� A G T A T

A G T A T 5� C

C 5� A G C T A T

A G C T A T T 5� C

5� 5� G 5� G A 5� A G A 5� A G A T 5� A G T A T A G T A T 5� C C 5� A G C T A T A G C T A T T 5� C

Sanger sequencing procedure

Page 52: The Human Genome Project - Part I

HAGenetics.org

A T G C

+

3�

5�

•  Analysis using high resolution polyacrylamide gel electrophoresis.

•  Fragments are detected using radioactive markers and autoradiography.

ACTGGTCAATCGATCGTA

Sanger sequencing procedure

_

Page 53: The Human Genome Project - Part I

Sanger sequencing - Gel

HAGenetics.org

•  Reading these gels for the human genome is a lot of work.

•  We need a faster more efficient method.

Page 54: The Human Genome Project - Part I

HAGenetics.org

•  Each dideoxy nucleotide is attached to a florescent marker.

•  At the end of each cycle, a laser beam can detect the florescent marker and thus record the position of the nucleotide.

Sanger sequencing - Automated

Page 55: The Human Genome Project - Part I

Chromatogram - Automated

HAGenetics.org