52
Introduction to Genetic Epidemiology HGEN619, 2006 Hermine H. Maes

Introduction to Genetic Epidemiology

  • Upload
    tacey

  • View
    74

  • Download
    4

Embed Size (px)

DESCRIPTION

Introduction to Genetic Epidemiology. HGEN619, 2006 Hermine H. Maes. Genetic Epidemiology. Establishing / Quantifying the role of genes and environment in variation in disease and complex traits ~ Answering questions about the importance of nature and nurture on individual differences - PowerPoint PPT Presentation

Citation preview

Page 1: Introduction to  Genetic Epidemiology

Introduction to Genetic EpidemiologyHGEN619, 2006

Hermine H. Maes

Page 2: Introduction to  Genetic Epidemiology

Genetic Epidemiology Establishing / Quantifying the role of genes and

environment in variation in disease and complex traits ~ Answering questions about the importance of nature and nurture on individual differences

Finding those genes and environmental factors

Page 3: Introduction to  Genetic Epidemiology

Genes & Environment How much of the variation in a trait is accounted

for by genetic factors? Do shared environmental factors contribute

significantly to the trait variation? The first of these questions addresses

heritability, defined as the proportion of the total variance explained by genetic factors

Page 4: Introduction to  Genetic Epidemiology

Nature-nurture question Sir Francis Galton: comparing the similarity of identical

and fraternal twins yields information about the relative importance of heredity vs environment on individual differences

Gregor Mendel: classical experiments demonstrated that the inheritance of model traits in carefully bred material agreed with a simple theory of particulate inheritance

Ronald Fisher: first coherent account of how the ‘correlations between relatives’ explained ‘on the supposition of Mendelian inheritance’

Page 5: Introduction to  Genetic Epidemiology
Page 6: Introduction to  Genetic Epidemiology
Page 7: Introduction to  Genetic Epidemiology
Page 8: Introduction to  Genetic Epidemiology

People and IdeasGalton (1865-ish)

CorrelationFamily Resemblance

TwinsAncestral Heredity

Mendel (1865)Particulate Inheritance

Genes: single in gamete double in zygote

Segregation ratios

Darwin (1858,1871)Natural SelectionSexual Selection

Evolution

Fisher (1918)Correlation & MendelMaximum Likelihood

ANOVA: partition of variance

Spearman (1904)Common Factor Analysis

Wright (1921)Path Analysis

Thurstone (1930's)Multiple Factor Analysis

Mather (1949) &

Jinks (1971)Biometrical Genetics

Model Fitting (plants) Joreskog (1960)Covariance

Structure AnalysisLISREL

Morton (1974)Path Analysis &

Family Resemblance

Watson &

Crick (1953)

Jinks & Fulker (1970)Model Fitting applied to humans

Martin & Eaves (1977)Genetic Analysis of

Covariance Structure

Elston etc (19..)Segregation

Linkage Rao, Rice, Reich,

Cloninger (1970's)Assortment

Cultural Inheritance

Neale (1990) MxMolecularGenetics

PopulationGenetics

2000

Page 9: Introduction to  Genetic Epidemiology
Page 10: Introduction to  Genetic Epidemiology

Biometrical Model

aa AA

Aa m

d-d

h

To make the simple two-allele model concrete, let us imagine that we are talking about genesthat influence adult stature. Les us assume that the normal range of height for males is from

4 feet 10 inches to 6 feet 8 inches; that is, about 22 inches. And let us assume that eachsomatic chromosome has one gene of roughly equivalent effect. Then, roughly speaking, we

are thinking in terms of loci for which the homozygotes contribute +- 1/2 inch (from themidpoint), depending on whether they are AA , the increasing homozygote, or aa , the

decreasing homozygote. In reality, although some loci may contribute greater effects thanthis, others will almost certaily contribute less; thus we are talking about the kind of model in

which any particular polygene is having an effect that would be difficult to detect by themethods of classical genetics.

in Biometrical Genetics chapter in Methodology for Genetic Studies of Twins and Families

Page 11: Introduction to  Genetic Epidemiology

0

1

2

3

1 Gene 3 Genotypes 3 Phenotypes

0

1

2

3

2 Genes 9 Genotypes 5 Phenotypes

01234567

3 Genes 27 Genotypes 7 Phenotypes

0

5

10

15

20

4 Genes 81 Genotypes 9 Phenotypes

Polygenic Traits

Page 12: Introduction to  Genetic Epidemiology

Stature in adolescent twins

Stature

190.0

185.0

180.0

175.0

170.0

165.0

160.0

155.0

150.0

145.0

Women700

600

500

400

300

200

100

0

Std. Dev = 6.40

Mean = 169.1

N = 1785.00

Page 13: Introduction to  Genetic Epidemiology

Physical attributes (height, eye color) Disease susceptibility (asthma, anxiety) Behavior (intelligence, personality) Life outcomes (income, children)

Individual differences

Page 14: Introduction to  Genetic Epidemiology

Polygenic Model Polygenic model: variation for a trait caused by a

large number of individual genes, each inherited in a strict conformity to Mendel’s laws

Multifactorial model: many genes and many environmental factors also of small and equal effect

Effects of many small factors combined > normal (Gaussian) distribution of trait values, according to the central limit theorem.

Page 15: Introduction to  Genetic Epidemiology

The normal distribution is to be expected whenever variation is produced by the addition of a large number of effects, non-predominant

This holds quite often

Quantitative traits

Central Limit Theorem

Page 16: Introduction to  Genetic Epidemiology

Body Mass Index vs “obesity” Blood pressure vs “hypertensive” Bone Mineral Density vs “fracture” Bronchial reactivity vs “asthma” Neuroticism vs “anxious/depressed” Reading ability vs “dyslexic” Aggressive behavior vs “delinquent”

Continuous or Categorical ?

Page 17: Introduction to  Genetic Epidemiology

unaffected affected

Disease liability

Single threshold

severe

Disease liability

Multiple thresholds

mildnormal mod

Multifactorial Threshold Model of Disease

Page 18: Introduction to  Genetic Epidemiology

Imprecise phenotype Phenocopies / sporadic cases Low penetrance Locus heterogeneity/ polygenic effects

Genetically Complex Diseases

Page 19: Introduction to  Genetic Epidemiology

Disease Phenotype

Commonenvironment

Marker Gene1

Individualenvironment

Polygenicbackground

Gene2

Gene3

Linkage

Linkagedisequilibrium

Mode ofinheritance Linkage

Association

Complex Trait Model

Page 20: Introduction to  Genetic Epidemiology

Causes of Variation pre-1990

estimation of ‘anonymous’ genetic and environmental components of phenotypic variation

genetic epidemiologic studies post-1990

identification of QTL’s: quantitative trait loci contributing to genetic variation of complex (quantitative) traits

linkage and association studies

Page 21: Introduction to  Genetic Epidemiology

Stages of Genetic Mapping Are there genes influencing this trait?

Genetic epidemiological studies Where are those genes?

Linkage analysis What are those genes?

Association analysis

Page 22: Introduction to  Genetic Epidemiology

Partitioning Variation phenotypic variance (VP) partitioned in genetic

(VG) and environmental (VE) VP = VG + VE Assumptions: additivity & independence of

genetic and environmental effects heritability (h2): proportion of variance due to

genetic influences (h2 = VG /VP) property of a group (not an individual), thus specific

to a group in place & time

Page 23: Introduction to  Genetic Epidemiology

Sources of Variance Genetic factors:

Additive (A)Dominance (D)

Environmental factors:Common / Shared (C)Specific / Unique (E)Measurement Error, confounded with E

Page 24: Introduction to  Genetic Epidemiology

Genetic Factors Additive genetic factors (A): sum of all the

effects of individual loci

Non-additive genetic factors: result of interactions between alleles at the same locus (dominance, D) or between alleles on different loci (epistasis)

Page 25: Introduction to  Genetic Epidemiology

Environmental Factors Shared [common or between-family]

environmental factors (C): aspects of the environment shared by members of same family or people who live together, and contribute to similarity between relatives

Non-shared [specific, unique or within-family] environmental factors (E): unique to an individual, contribute to variation within family members, but not to their covariation

Page 26: Introduction to  Genetic Epidemiology

Estimating Components Estimate phenotypic variance components from

data on covariances of related individuals Different types of relative pairs share different

amounts of phenotypic variance Biometrical genetics theory: specify amounts in

terms of genetic and environmental variances Three major types of study: family, adoption and

twin

Page 27: Introduction to  Genetic Epidemiology

Designs to disentangle G+E Resemblance between relatives caused by:

Shared Genes (G = A + D)

Environment Common to family members (C)

Differences between relatives caused by:

Non-shared Genes

Unique environment (E)

Page 28: Introduction to  Genetic Epidemiology

Informative Designs Family studies – G + C confounded

MZ twins alone – G + C confounded

MZ twins reared apart – rare, atypical, selective placement ?

Adoption studies – increasingly rare, atypical, selective placement ?

MZ and DZ twins reared together

Extended twin design

Page 29: Introduction to  Genetic Epidemiology

Classical Twin Study MZ and DZ twins reared together

MZ twins genetically identicalDZ twins share on average half their genes

Equal Environments AssumptionMZ and DZ twins share relevant

environmental influences to same extent

Page 30: Introduction to  Genetic Epidemiology

MZ and DZ twins: determining zygosity using ABI Profiler™ genotyping

(9 STR markers + sex)MZ DZ DZ

Identity at marker loci - except for rare mutation

Zygosity

Page 31: Introduction to  Genetic Epidemiology

MZ & DZ Correlations rMZ > rDZ: G (heritability) C: increase rMZ & rDZ Relative magnitude of the MZ and DZ

correlations > contribution of additive genetic (G) and shared environmental (C) factors

1-rMZ: importance of specific environmental (E) factors

Page 32: Introduction to  Genetic Epidemiology

Twin Correlations

*

*

MZDZ

.5

1.0

**

MZDZ

.6

.8

* *

MZDZ

.7

.8

*

*

MZDZ

.4

.8A

E

C

Page 33: Introduction to  Genetic Epidemiology

Example thus if, VP = VA + VC + VE = 2.0

CovMZ = VA + VC = 1.6CovDZ = 1/2VA + VC = 1.2

then, by algebra, VA = 0.8, VC = 0.8, VE = 0.4

but it isn’t always so simple, consider VP = 1.0, CovMZ = 0.6; CovDZ = 0.65

then VA = -0.1, VC = 0.7, VE = 0.4 nonsensical negative variance component

Page 34: Introduction to  Genetic Epidemiology

Observed Statistics Trait variance & MZ and DZ covariance as

unique observed statistics Estimate the contributions of additive genes (A),

shared (C ) and specific (E) environmental factors, according to the genetic model

Useful tool to generate the expectations for the variances and covariances under a model is path analysis

Page 35: Introduction to  Genetic Epidemiology

Path Analysis Allows us to diagrammatically represent linear

models for the relationships between variables Easy to derive expectations for the variances

and covariances of variables in terms of the parameters of the proposed linear model

Permits translation into matrix formulation

Page 36: Introduction to  Genetic Epidemiology

Phenotype

E C A D

UniqueEnvironment

AdditiveGenetic

SharedEnvironment

DominanceGenetic

e

ac

d

Variance ComponentsP = eE + aA + cC + dD

Page 37: Introduction to  Genetic Epidemiology

PT1

ACE

PT2

A C E

1

MZ=1.0 / DZ=0.5

e ac eca

ACE Model Path Diagram for MZ & DZ Twins

Page 38: Introduction to  Genetic Epidemiology

Model Fitting Evaluate significance of variance components -

effect size & sample size Evaluate goodness-of-fit of model - closeness of

observed & expected values Compare fit under alternative models Obtain maximum likelihood estimates

Page 39: Introduction to  Genetic Epidemiology

Mx Structural equation modeling package Software: www.vcu.edu/mx Manual: Neale et al. 2006 Free

Page 40: Introduction to  Genetic Epidemiology

Both continuous and categorical variables Systematic approach to hypothesis testing Tests of significance Can be extended to:

More complex questions Multiple variables Other relatives

Structural equation modeling

Page 41: Introduction to  Genetic Epidemiology

Are the same genes acting in males and females? (sex limitation)

Role of age on (a) mean (b) variance (c) variance components

Are G & E equally important in age, country cohorts? (heterogeneity)

Are G & E same in other strata (e.g. married/unmarried)? ( G x E interaction)

SEM: more complex questions I

Page 42: Introduction to  Genetic Epidemiology

Do the same genes account for variation in multiple phenotypes? (multivariate analysis)

Do the same genes account for variation in phenotypes measured at different ages? (longitudinal analysis)

Do specific genes account for variation/covariation in phenotypes? (linkage/association)

SEM: more complex questions II

Page 43: Introduction to  Genetic Epidemiology

Linkage & Association Analysis

Page 44: Introduction to  Genetic Epidemiology

Stages of Genetic Mapping Are there genes influencing this trait?

Epidemiological studies Where are those genes?

Linkage analysis What are those genes?

Association analysis

Page 45: Introduction to  Genetic Epidemiology

Sharing between relatives Identifies large regions

Include several candidates

Complex disease Scans on sets of small families popular No strong assumptions about disease alleles Low power Limited resolution

Linkage Analysis

Page 46: Introduction to  Genetic Epidemiology

Linkage Scan

Page 47: Introduction to  Genetic Epidemiology

Stages of Genetic Mapping Are there genes influencing this trait?

Epidemiological studies Where are those genes?

Linkage analysis What are those genes?

Association analysis

Page 48: Introduction to  Genetic Epidemiology

Sharing between unrelated individuals Trait alleles originate in common ancestor High resolution

Recombination since common ancestor Large number of independent tests

Powerful if assumptions are met Same disease haplotype shared by many patients

Sensitive to population structure

Association Analysis

Page 49: Introduction to  Genetic Epidemiology

Association Scan

Page 50: Introduction to  Genetic Epidemiology

Genome Scan Gene 1 Gene 2 Gene 3 Gene 4

Breast cancer DLC-1 Chr 8q Chr 13q

Lung cancer CD44 Chr 22q

Melanoma B-RAF

Type 2 diabetes PPAR PPP1R3A FOXA2 Chr 1q

HDL-C plasma level CETP LPL

Osteoarthritis AGC1

Schizophrenia DDC

Proof of Concept: Genes/Regions

Page 51: Introduction to  Genetic Epidemiology

First (unequivocal) positional cloning of a complex disease QTL !

Page 52: Introduction to  Genetic Epidemiology

From QTL to gene: the harvest begins: RKorstanje & B Paigen : Nature Genetics 31, 235 – 236 (2002)

Number of genes identified from QTL by year