35
Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from father. In transmitting a chromosome to an offspring, however, the physical process of recombination (crossing over) results in a chromosome that contains part of the maternal chromosome and part of the paternal chromosome. Recombination also makes possible a number of different analytical strategies in genetics: linkage, ancestry tracing, and some forms of association. Key terms: polymorphism, recombination, crossing over, linkage, linkage analysis, association design, haplotype, linkage equilibrium/disequilibrium, GWAS (genome-wide association study).

Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Embed Size (px)

Citation preview

Page 1: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Gene Hunting:Linkage and Association

We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from father. In transmitting a chromosome to an offspring, however, the physical process of recombination (crossing over) results in a chromosome that contains part of the maternal chromosome and part of the paternal chromosome.

Recombination also makes possible a number of different analytical strategies in genetics: linkage, ancestry tracing, and some forms of association.

Key terms: polymorphism, recombination, crossing over, linkage, linkage analysis, association design, haplotype, linkage equilibrium/disequilibrium, GWAS (genome-wide association study).

Page 2: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Recombination(Crossing Over)

In meiosis, homologous chromosomes join together at a section and exchange genetic

material.

Homologous chromosomes: chromosomes with the same genes on them. E.g., yourpaternal chromosome number 1 and your maternal chromosome number 1.

Page 3: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Recombination:

LinkageAnalysis

PopulationHaplotypeAnalysis

Genome-wideAssociation

(Current “Hot”Technology)

AncestryTracing

Association

Page 4: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Ab

C

d

aB

c

D

Ab

C

d

aB

c

D

Ab

c

D

C

d

aB

Example:

Page 5: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Key Point about Recombination:

• If two alleles are separated by 8 nucleotides, then there are “8 chances” of a recombination event between the two..

• If two alleles are separated by 257 nucleotides, then then are “257 chances” of a recombination event between the two.

• Therefore, alleles on the same DNA strand that are far away are more likely to be broken up by recombination than alleles that are close together.

Recombination is a function ofphysical distance.

Page 6: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

ATCGGCTAGCCTGACATT

ATTGGCTAGCCTGACGAT

CCTTAA

Pair Up

ATCGGCTAGCCTGACATT

ATTGGCTAGCCTGACGAT

Dad Mom

OriginalChromosomes:

ATCGGCTAGCCTGACGAT

ATTGGCTAGCCTGACATT

Exchange Material

NewChromosomes:

Allele 1

Allele 2

Page 7: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

ATCGGCTAGCCTGACATTGC

ATCGGCTAGCCTGACATTGC

3 chances

10 chances

17 chances

Page 8: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

In other words:

Alleles close together on the same DNA strand(i.e., the same chromosome) tend to be transmittedas a unit.

Alleles far away on the same DNA strand tend to bebroken up.

Page 9: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Definitions:

Linkage: Biological phenomenon that close to one another tyend to transmitted as a unit.

Linkage Analysis:(1) tracing the co-segregation of(2) one or more marker genes with a trait gene within pedigrees(3) within families

Page 10: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Definitions:

Trait gene: A gene that contributes to the trait of interest, e.g., schizophrenia.

Marker Gene: A polymorphic “gene” with that does not contribute to the trait but has a known location in the genome.

Page 11: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Rationalefor Linkage Analysis

Can I predict who gets the disorder (trait)by knowing the marker genes in a family?

YES: A trait gene is close to a marker.

NO: No trait genes are close to the marker.

Page 12: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Aa Aaaa AaAa

Aa

aaaa

aa

Father’s chromosomes are AD

ad

Linkage Analysis

Page 13: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Aa Aaaa AaAa

Aa

aaaa

aa

Father’s chromosomes areA

D

a

d

Linkage Analysis

Page 14: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Aa aaI.1 I.2

aa AaII.5 II.6

Aa aaII.7 II.8

Aa

AA

AA

Aa

II.1 II.2

III.1 III.2

aaAaII.3 II.4

Aa aaIII.3 III.4

AaaaIII.5 III.6

Aa aaIII.7 III.8

Page 15: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

ADad

adad

I.1 I.2

ADad

AdAd

II.1 II.2

adad

Adad

II.3 II.4

adad

Adad

II.5 II.6

ADad

adad

II.7 II.8

AD

adad

ad

III.8III.7

ad

Adad

ad

III.6III.5

Ad

adad

ad

III.4III.3

Ad

AdAD

ad

III.2III.1

Page 16: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

ATCTGCCTCGCCATAAAGTCATTCGCTCATATCTGCCTCGCCATAAAGTCATTCGCTGATATCAGCCTCGCCATAAAGTCATTCGCTCATATCAGCCTCGCCATAAAGTCATTCGCTGAT

DNA Strand: Haplotype:

TCTGACAG

Haplotype

Series of alleles along a short section of the same strand of DNA

T alleleA allele

position 4:

C alleleG allele

position 28:

Page 17: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Linkage Equilibrium & Disequilibrium

If I know the first allele in a haplotype,can I predict the second allele?

Yes No

LinkageDisequilibrium

LinkageEquilibrium

Page 18: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Linkage Equilibrium & Disequilibrium

In other words:

Equilibrium: Frequency of a haplotype is due to chance.

Disequilibrium: Frequency of a haplotype differs from chance frequency.

Page 19: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Haplotype

Chance: If the frequency of allele T is .2

and the frequency of allele C is .4, then the frequency of haplotype TC is .2*.4 = .08.

Nonchance: If the frequency of allele T

is .2 and the frequency of allele C is .4, then the frequency of haplotype TC is significantly different from .08.

(Graduate)

Page 20: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Position 4:

Position 28:

T

A

TC TG

AC AG

C G

Haplotype(Graduate)

Page 21: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Position 4:

Position 28:

T

A

C G

.2

.8

.6.4

.08

.32

.12

.48

Equilibrium(Graduate)

Page 22: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Position 4:

Position 28:

T

A

C G

.2

.8

.6.4

.16

.14

.04

.56

Disequilibrium(Graduate)

Page 23: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Position 4:

Position 28:

T

A

C G

p1

q1

q2p2

X11

X21

X12

X22

Statistics for Equilibrium(Graduate)

Page 24: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

= X11X22 - X12X21= cov(L1, L2)

D = X11- p1p2 = X22 - q1q2

If D > 0, D = D/Dmax where Dmax = min(p1q2, p2q1)

D and R2 are the most often used stats.

R2 = 2 / (p1p2q1q2)

Statistics for Equilibrium(Graduate)

If D > 0, D = D/Dmax where Dmax = min(p1p2, q1q2)

Page 25: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

1. Mutation occurs and creates a new spelling variation (polymorphism).

2. This creates linkage disequilibrium with those polymorphisms along the same DNA strand with the mutation.

3. Over generations, recombination will break up the disequilibrium with polymorphisms that are far away from the mutation.

4. Polymorphisms close to the original mutation, however, will remain in disequilibrium for a longer time.

5. Hence, polymorphisms close to the mutation will be in disequilibrium longer than polymorphisms farther away from the mutation.

Formation of Disequilibrium(Graduate)

Page 26: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

1. Is the norm rather than the exception for short sections of DNA (100,000 nucleotides).

2. Generates “haplotype blocks” (see next slide).

3. Haplotype Mapping Project (HapMap): provide a map of the haplotype blocks for the human genome.

4. Allows genome-wide association studies.

Disequilibrium:

Page 27: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Haplotype Blocks:

Section of DNA (vertical bar = polymorphism):

Block 1Block 2

Block 7

• Haplotype Block: Series of adjacent alleles in strongdisequilibrium.

• Logic: Instead of genotyping all 37 polymorphisms, genotype one in each block.

• If there is a “hit,” then go back and genotype the other polymorphisms in that block.

Page 28: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Haplotype block structure of the cytochrome P450 CYP2C gene cluster on chromosome 10.

From Walton et al. (2005), Nature Genetics 37, 915-0916.

Page 29: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Association Design

• Begins with KNOWN polymorphism theoreticallyexpected to be associated with the trait (e.g., DRD2 and schizophrenia).

• Genotypes people on the gene and phenotypes them on the trait.

•Tests whether the genotype is associated with the trait.

•Two types:

(1)Population-based (controls = general pop)(2)Family-based (controls = genetic relatives)

Page 30: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Population-basedAssociation Design

Phenotype:

Genotype:

Schiz:

Not Schiz:

AA Aa aa

Do 2 test for association.

Page 31: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from
Page 32: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Genome-wide Association Study(GWAS)

(1) Genotype one locus per haplotype block

(2) Do an association test for every gene.

(3) Number of genes that can be assayed changesfrom year to year.

Page 33: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

GWAS: Genome-wideAssociation Study

1. DNA arrays with 1,000s of SNPs scattered throughout the genome. (Current chips in 2009 has1,000, 000 different SNPs)

2. Select the SNPs so that they cover ALL the genome. (Some DNA chips concentrate on known protein coding regions rather than trying to cover all the genome)

3. Genotype patients and controls on all the SNPs.

4. Find the SNPs that differ.

5. Problem: number of statistical tests.

Page 34: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Problems with GWAS

(1) Expensive.

(2) Large number of statistical tests.

(3) Need very, very large samples (10,000 or more.

Page 35: Gene Hunting: Linkage and Association We humans are diploid (i.e., we have two copies of a gene), inheriting one chromosome from mother, the other from

Results from GWAS

(1) Good success in medicine.

(2) Limited success for psychiatric disorders

(3) Virtually no success for normal behavioral traits (personality, IQ)

(4) Genetics of behavior is hyper-polygenic: many, many, many genes