46
Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species.

Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Embed Size (px)

Citation preview

Page 1: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Genome organization

Eukaryotic genomes are complex and DNA amounts and organization vary

widely between species.

Page 2: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Genome OrganizationG

Page 3: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

C value paradox:

The amount of DNA in the haploid cell of an organism is not related to its evolutionary complexity or number of genes.

Page 4: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Highly Repeated Sequences

Page 5: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species
Page 6: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species
Page 7: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

There are different classes of eukaryotic DNA based on sequence complexity.

Page 8: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

105 106 107 108 109 1010 1011 1012

basepairs

Amount of DNA in a Genome Does Not Correlate with

Complexity

Page 9: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

How many genes do humans have?

Original estimate was between 50,000 to 100,000 genes

We now think humans have ~ 20,000 genes

How does this compare to other organisms?

Mice have ~30,000 genes

Pufferfish have ~35,000

Nematodes (C. elegans), have ~19,000

Yeast (S. cerevisiae) has ~6,000

The microbe responsible for tuberculosis has ~4,000

Page 10: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species
Page 11: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Single Copy Sequences

Exome

Page 12: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Even the Amount of DNA a Gene Spans Differs Among Species

Page 13: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Problems?• Some gene products are RNA (tRNA,

rRNA, others) instead of protein

• Some nucleic acid sequences that do not encode gene products (noncoding regions) are necessary for production of the gene product (protein or RNA).

• Eukaryotic genes are complex!

Page 14: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Gene Identification• Open reading frames• Sequence conservation

– Database searches– Synteny

• Sequence features– CpG islands

• Evidence for transcription– ESTs, microarrays

• Gene inactivation– Transformation, RNAi

Page 15: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Unique genes

Page 16: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Noncoding regions

• Regulatory regions– RNA polymerase binding site

– Transcription factor binding sites

• Introns

• Polyadenylation [poly(A)] sites

Page 17: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Splice Sites

• Eukaryotes only • Removal of internal parts of the newly

transcribed RNA.• Takes place in the cell nucleus• Splice sites difficult to predict

Page 18: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

One gene, many proteins via alternative splicing , 3’ cleavage and polyadenlyation

Page 19: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species
Page 20: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Exon Shuffling

Page 21: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Trans-Splicing in Higher Eukaryotes

Gingeras, Nature (2009) 461, 206-211

Page 22: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Non-contiguous Transcription Generates An Enormous Number of Possible

Transcripts

Gingeras, Nature (2009) 461, 206-211

Six 2-exon co-linear combinations from four exons

325 combinations of 3-exons, non-colinear

• Reassortment of exons coding for ncRNA or protein domains could dramatically increase number of functional products beyond the number of ‘genes’

• Trans-splicing exists in higher eukaryotes as well as in lower ones like Trypanosomes

Blue: only co-linearRed: all combinations

Page 23: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Why genome size isn’t the only concern (size doesn’t matter?)

• More sophisticated regulation of expression?

• Proteome vastly larger than genome?

– Alternate splicing

– RNA editing

• Postranslational modifications?

• Cellular location?

• Moonlighting

Page 24: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Gene families

• E.g. globins, actin, myosin• Clustered or dispersed• Pseudogenes

Page 25: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species
Page 26: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Pseudogenes

• Nonfunctional copies of genes

• Formed by duplication of ancestral gene, or reverse transcription (and integration)

• Not expressed due to mutations that produce a stop codon (nonsense or frameshift) or prevent mRNA processing, or due to lack of regulatory sequences

Page 27: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Duplicated genes• Encode closely related (homologous) proteins• Formed by duplication of an ancestral gene

followed by mutation

Five functional genes and two pseudogenes

Page 28: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Paralogs vs Orthologs

• Different members of the globin gene family are paralogs, having evolved one from another through gene duplication. Paralogs are separated by a gene duplication event.

• Each specific gene family member (e.g. a specific gene in human) is an ortholog of the same family member in another species (e.g. mouse). Both evolved from an ancestral globin gene. Orthologs are separated by a speciation event.

• It is not always easy to distinguish true orthologs from paralogs , especially in polyploid organisms!

Page 29: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Protein - coding sequences less than 1.5% of the genome in humans!

Page 30: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Noncoding RNAs (ncRNA)

• Do not have translated ORFs• Small• Not polyadenylated

Page 31: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species
Page 32: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species
Page 33: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species
Page 34: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Functions of Known lncRNAs

Ponting et al, Cell (2009) 136, 629-641

• Transcriptional interference -lncRNA transcription turns off transcription of nearby gene

• Initiation of chromatin remodeling - lncRNA transcription turns on transcription of nearby gene

• Promoter inactivation - lncRNA binds to TFIIB and to promoter DNA

• Activation of an accessory protein - lncRNA binds to allosteric effector protein TLS and inhibits histone acetyltransferase, decreasing transcription

Page 35: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Functions of Known lncRNAs

• Activation of transcription factors - binding of lncRNA to Dlx2 activates Dlx5/6 activity

• Oligomerization of an accessory protein - lncRNA induces heat shock factor trimerization

• Epigenetic silencing of gene clusters -Xist RNA inactivates X chromosome

• Epigenetic repression of genes in trans -HOTAIR binds PRC2, leading to methylation and silencing of several genes in HOXD locus

• Transport of transcription factors -lnRNA NRON keeps NFAT out of nucleus

Ponting et al, Cell (2009) 136, 629-641

Page 36: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

• ~97-98% of the transcriptional output of the human genome is ncRNA– Introns– Transfer RNAs (tRNA)

• ~ 500 tRNA genes in human genome– Ribosomal RNAs

• Tandem arrays on several chromosomes• 150-200 copies of 28S – 5.8S – 18S

cluster• 200-300 copies of 5S cluster

ncRNA

Page 37: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species
Page 38: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

• The level of transcription from human chromosomes 21 and 22 is an order of magnitude higher than can be accounted for by known or predicted exons

• Almost half of all transcripts from well-constructed mouse cDNA libraries are ncRNAs (identified because they do not code for an open reading frame of larger than 100 codons)

Genome Organization - ncRNA

Page 39: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Repeat sequences – 50% or more of the genome

Page 40: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Repetitive DNA

• Moderately repeated DNA– Tandemly repeated rRNA, tRNA and histone genes

(gene products needed in high amounts)– Large duplicated gene families– Mobile DNA

• Simple-sequence DNA– Tandemly repeated short sequences– Found in centromeres and telomeres (and others)– Used in DNA fingerprinting to identify individuals

Page 41: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Segmental duplications

• Found especially around centromeres and telomeres

• Often come from nonhomologous chromosomes

• Many can come from the same source• Tend to be large (10 to 50 kb)• Unique to humans?

Page 42: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Repetitive DNA - Segmental duplications

Page 43: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Mobile DNA

• Moves within genomes• Most of the moderately repeated DNA

sequences found throughout higher eukaryotic genomes– L1 LINE is ~5% of human DNA (~50,000

copies)– Alu is ~5% of human DNA (>500,000 copies)

• Some encode enzymes that catalyze movement

Page 44: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species
Page 45: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species

Repetitive DNA – Highly repetitive satellite DNA

Page 46: Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species