Upload
olivia-campbell
View
216
Download
0
Tags:
Embed Size (px)
Citation preview
Charting The Genotype-Phenotype Map:Lessons From Drosophila
Trudy F. C. Mackay
Department of Genetics &
W. M. Keck Center for Behavioral Biology
North Carolina State University
Raleigh, NC 27695 USA
Evolutionary Quantitative Genetics
What maintains variation for quantitative traits within populations?
Do the same loci causing variation within populations also cause divergence between populations? Between species?
How predictable is evolution?
Quantitative Traits
Genetic Variation
Environmental Variation
Gene-Environment Interaction and Correlation
AACTGGCCTCCTCCTCCTCCTATACGTTACCGG………………….CGATTCCAAAAACTGGCCTCCT……………….ATACGTTACCGG………………….AAACCTTAGCAACTGGCCTCCT……………….ATACGTTACCGC………………….AAACCTTAGCAACTGGCCTCCT……………….ATACGTTACCGC……..3kb……..AAACCTTAGC
Genetic Resources
Model Organism
Genetic Manipulation
Living library of genetic variation
Community resource for association mapping complex traits
Replicated genotypes
Accurate phenotypes, multiple phenotypes (pleiotropy), multiple environments (GEI)
Freeze 1 genome sequences of 168 lines
192
Drosophila melanogaster Genetic Reference Panel (DGRP)
Mackay et al. 2012. Nature 482:173
Isofemale Lines
20 GenerationsFS Inbreeding
205 Inbred Lines
Natural PopulationRaleigh NC, USA
0
5
10
15
20
25
30
35
40
DGRP Line
Ch
ill
Co
ma
Rec
ove
ry T
ime
(min
)
0
5
10
15
20
25
30
35
40
45
DGRP Line
Sta
rtle
Res
po
nse
(se
c)
0
20
40
60
80
100
120
DGRP Line
Sta
rvat
ion
Res
ista
nce
(h
r)
H2 = 0.43
H2 = 0.56
H2 = 0.58
Freeze 2 Genotyping
205 lines sequenced on Illumina platform Initial variant discovery using 7 existing callers Global list of variants grouped into haplotype bins Reads re-aligned to haplotypes to count supporting and opposing reads Final genotypes called by the Joint Genotyper for Inbred Lines (JGIL), which computes a
quality score for each genotype and for each variant
Site Frequency Spectra
* Minor alleles are grouped to calculate MAF
The Numbers
SNP/MNPs3,976,011
Indels462,416
Polymorphic microsatellites23,754
Potentially damaging variants6,523
Freeze 2 Molecular Variation in DGRP Lines
Potentially damaging annotations: SPLICE_SITE, START_LOST, EXON_DELETED, FRAME_SHIFT, STOP_GAINED, STOP_LOST
Hundreds of genes affected per line Large gene families enriched for affected genes (e.g. olfactory receptors) Novel alleles for functional analysis
Potentially Damaging Variants in DGRP Lines
Variation in Genome Size in DGRP Lines
Reference Strain
167.5-
170.0
170.1-
172.5
172.6-
175.0
175.1-
177.5
177.6-
180.0
180.1-
182.5
182.6-
185.0
185.1-
187.5
187.6-
190.0
190.1-
192.5
192.6-
195.0
0
10
20
30
40
50
60
70
80
90
Genome Size (Mb)
Num
ber
of L
ines
Other Information
Wolbachia infection status of each DGRP line
Cytogenetic analysis of polymorphic inversions:In(2L)t
In(2R)Nova ScotiaIn(3L)Payne
In(3R) MouradIn(3R)PayneIn(3R)Kodani
In(3R)C
Inversions and Number of
Segregating Sites
Distribution of Pairwise Genomic Relationships
Inversions Account for Most High Genomic
Relatedness
Genomic Relatedness Among Lines with Same Inversions
5 50 95 140 185 230 275 320 365 410 455 500 545 590 635 680 725 770 815 860 905 950 9950
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.45
0.5
X
2L
2R
3L
3R
Distance (bp)
R s
qu
are
d
Mackay et al. 2012. Nature 482:173
Rapid Local LD Decay
Association Analyses
A T
Phe
noty
pic
Val
ue
A TP
heno
typ
ic V
alu
e
No Association Association
http://dgrp.gnets.ncsu.edu/Faster GWAS
GWAS includes Wolbachia, inversions, relationship matrixLatest FlyBase annotations of variants
GWAS in the DGRP: Starvation Resistance
Many significant SNPs (FDR < 0.1)Many sex-specific effectsNovel genes
Standardized effects not small
Most significant SNPs are low frequency
Local LD
Mackay et al. 2012. Nature 482:173
0 0.1 0.2 0.3 0.4 0.50
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
Minor Allele Frequency
a/σ
p
Relationship Between Effect Size and MAF
Mackay et al. 2012. Nature 482:173
R_Spider Analysis of Ethanol Sensitivity GWASwww.BioProfiling.de
Axon guidance
Protein phosphorylation
Purine metabolism
Neurotransmitter secretion
G-protein coupled receptor
TV Morozova et al., unpublished
Additive Multi-SNP Model Explains Most Variation
Startle Responser2 = 0.72
Starvation Resistancer2 = 0.81
Chill Coma Recoveryr2 = 0.84
Mackay et al. 2012. Nature 482:173
But Predictive Ability is Low
*GBLUP, 5-fold cross validation; ~2.5 million common SNPs
Trait Broad Sense H2 Predictive Ability*
Starvation Resistance 0.58 0.239 ± 0.008
Startle Response 0.56 0.230 ± 0.012
Chill Coma Recovery 0.43 -0.038 ± 0.008
Ober et al. 2012. PLoS Genetics 8: e1002685
Small sample size?
Rare alleles?
Alleles with large effects?
Non-additive genetic variation?
Testing Multi-Locus Predictions: Flyland
Round robin mating40 DGRP lines
Random mating70 generations
N= 800/generation
Flyland Population
Starvation resistanceStartle response
Chill coma recovery
6 libraries/pool300X depth/pool
Huang et al., PNAS 109: 15553
Complex Genetic Architecture in Flyland
276 SNPs, PBonferroni < 0.05
61 SNPs, PBonferroni < 0.05
320 SNPs, PBonferroni < 0.05
Huang et al., PNAS 109: 15553
SNP Effects Not Small
Huang et al., PNAS 109: 15553
No Replication between Flyland and DGRP
Huang et al., PNAS 109: 15553
No Replication Between Flyland and DGRP:Not From Different P-value Thresholds
Huang et al., PNAS 109: 15553
No Replication between Flyland and DGRP:Effects Not Correlated
Huang et al., PNAS 109: 15553
No Replication between Flyland and DGRP:SNPs not in LD
Huang et al., PNAS 109: 15553
No Replication between Flyland and DGRP:Epistasis
Trait
Trait-associated
common SNPs (DGRP
+ Flyland)
Significant epistatic
pairs
Trait-associated SNPs
with at least one
significant interacting
SNP (Flyland + DGRP)
Starvation resistance 411 (144 + 267) 19,629 288 (57 + 231)
Startle response 105 (52 + 53) 1,252 48 (11 + 37)
Chill coma recovery 472 (164 + 308) 57,863 314 (79 + 235)
Huang et al., PNAS 109: 15553
Context Dependence Due to Differences in Allele Frequency
Epistatic Networks Reveal Common Interacting Genes
Huang et al., PNAS 109: 15553
Interconnected Networks of Common Interacting Genes
Starvation Resistance Chill Coma Recovery
SNPs context-dependent, network architecture conserved Huang et al., PNAS 109: 15553
Epistatic Interaction Networks Enriched for Known Genetic Interactionswww.BioProfiling.de
Starvation Resistance P < 0.005
Lipid metabolism
Signal transduction
Huang et al., PNAS 109: 15553
Epistatic Interaction Networks Enriched for Known Genetic Interactionswww.BioProfiling.de
Chill Coma Recovery P < 0.05
Nervous system function
Nervous system development, locomotion
Huang et al., PNAS 109: 15553
General Strategy
1. Test for interactions between SNPs that do not replicate in different populations, reciprocally in each population
2. Within a population: pairwise search for epistasis using GWAS SNPs as focal SNPs
3. Within a population: Identify SNPs with different allele frequencies between extremes of quantitative trait distribution; use these as focal SNPs for pairwise GWAS
0
5
10
15
20
25
30
35
40
DGRP Line
Ch
ill
Co
ma
Rec
ove
ry T
ime
(min
)
0
5
10
15
20
25
30
35
40
45
DGRP Line
Sta
rtle
Res
po
nse
(se
c)
0
20
40
60
80
100
120
DGRP Line
Sta
rvat
ion
Res
ista
nce
(h
r)
H2 = 0.43
H2 = 0.56
H2 = 0.58
Is this a lot of variation?
Or too little given molecular diversity?(~4 Million SNPs + non-SNP variants)
Additive SNP effects may be underestimated if they interact epistatically but the model does not include epistasis
A Paradigm Shift?
Has the genome evolved to suppress effects of deleterious mutations?
Epistatic interactions = genetic basis of canalization (homeostasis)
Evidence for Epistasis From h2 (Realized) and H2
Observed Expected
Trait Realized h2 H2 (Inbred Lines)
H2 = 2h2/(1+h2)
Copulation latency
0.07 0.25 0.13
Startle response
0.16 0.58 0.28
Aggressive behavior
0.09 0.78 0.17
Ethanol knock-down time
0.08 0.24 0.15
Mackay et al. 2005, PNAS 102: 6622-6629; Edwards et al. 2006, PLoS Genetics 2: 3154; Jordan et al. 2007, Genome Biol. 8: R172; Morozova et al. 2009, Genetics 83: 733-745; Ayroles et al. 2009, Nat. Genet. 41: 299-307; Edwards et al. 2009, Genome Biol. 10: R76; Morozova et al. 2009, Genetics 83: 733-745
12
14
16
18
20
22
A1 A2
Ph
en
oty
pic
Val
ue
B1
B2
12
14
16
18
20
22
24
26
28
30
A1 A2
Ph
en
oty
pic
Val
ue
B1
B2
StarvationResistance
StartleResponse
Chill ComaRecovery
Evidence for Epistasis From h2 (GBLUP) and H2
Apparent additivity Estimates of VA include VAA
Epistasis from change of magnitude of effects in different backgrounds
Largely epistastic Epistasis from change of direction of
effects in different backgrounds
Hypothesis
IF
Extensive epistatic interaction networks suppress allelic effects in outbred populations
THEN EXPECT
Large numbers of induced mutations in a single inbred background affect quantitative traits (decanalization)
The same mutations have different effects in different genetic backgrounds
Mutations affecting the same trait perturb the same underlying network and interact epistatically
Natural variants suppress (buffer) effects of newly induced mutations
Response to selection from mutations affecting the same trait will be less than expected based on homozygous and heterozygous effects
1 2 3
Life Span (deviation from control)-60 -40 -20 0 20
Cum
ulat
ive
freq
uenc
y (%
)
0
20
40
60
80
100
Magwire et al. 2010, PLoS Genetics 6: e1001037
P-Element Mutagenesis in Drosophila
Large Numbers of Induced Mutations Affect Quantitative Traits?
21.9%
23.3%
6.2%
34.7%
32.7%
41.1%
37.1%
24.8%
Dilda & Mackay 2002, Genetics 162: 1655; Norga et al. 2003, Curr. Biol. 13: 1388-1397; Harbison et al. 2004, Genetics 166: 1807-1823; Sambandan et al. 2006, Genetics 174: 1349-1363; Yamamoto et al. 2008, PNAS 105: 12393-12398; Edwards et al. 2009, BMC Biology 7: 29; Magwire et al. 2010, PLoS Genetics 6: e1001037; Morozova et al. 2011, Genetics 187: 1193-1205
Same Mutations Have Different Effects in Different Genetic Backgrounds?
6-ph
osph
ofru
cto-2
-kin
ase
(F)
CG1099
0 (B
)
CG1122
6 (B
)
CG1333
4/CG13
333
(B)
CG1470
9 (B
)
CG9650
(A)
couc
h po
tato
(F)
Defen
se re
pres
sor 1
(A)
Gliota
ctin/
CG3793
(B)
leth
al(1
)G02
89/C
G3267
9 (B
)
mus
hroo
m-b
ody e
xpre
ssed
(F)
plex
us (B
)
spich
thyin
/CG57
76 (B
)
Vacuo
lar H
+ ATPas
e 16
kD su
buni
t (B)
-50.00
-40.00
-30.00
-20.00
-10.00
0.00
10.00
20.00
30.00
Mutation (Background)
Lif
es
pa
n (
da
ys
, de
via
tio
n f
rom
co
ntr
ol)
Magwire et al. 2010, PLoS Genetics 6: e1001037
Mutations Affecting the Same Trait Interact Epistatically?
×
1,2 1,3 1,4 1,5 1,6 1,7 1,8 1,9 1,10
2,3 2,4 2,5 2,6 2,7 2,8 2,9 2,10
3,4 3,5 3,6 3,7 3,8 3,9 3,10
4,5 4,6 4,7 4,8 4,9 4,10
5,6 5,7 5,8 5,9 5,10
6.7 6,8 6,9 6,10
7,8 7,79 7,10
8,9 8,10
9,10
Federowicz et al. 1998, Genetics 148: 1885-1891
Mutations Affecting the Same Trait Interact Epistatically?
Sambandan et al. 2006, Genetics 174: 1349-1363; Yamamoto et al. 2008, PNAS 105: 12393-12398; Magwire et al. 2010, PLoS Genetics 6: e1001037
CG10990
BG00817
BG00004
CG9238
esg
CG31531
pyd
mub
BG00297
crol
CG10990
BG00817
BG00004
CG9238
esg
CG31531
pyd
mub
BG00297
crol
Males Females
Startle Response
Olfactory Behavior
Life Span
Response to Selection From Mutations Less Than Additive?
Sema-1a lola
robo
emcCsp
HLHm7
CSB
CSB/Sem
a-1a
Sema-
1a
CSB/lola lol
a
CSB/robo
robo
CSB/em
cem
c
CSB/Csp Csp
CSB/HLH
m7
HLHm
70
10
20
30
40
50
Genotype
Sta
rtle
Re
sp
on
se
(s
ec
)
Serrano-Négron et al., unpublished data
Response to Selection From Mutations Less Than Additive?
p = 2/3; q = 1/3
VA = 2pqƩ[a+d(q-p)]2
VD = Ʃ(2pqd)2
h2 = VA/(VA+VD+VE)
h2EXP = 0.951
R1 = ih2σP
REXP = 15.5 Serrano-Négron et al., unpublished data
Response to Selection From Mutations Less Than Additive?
G0 G1 G2 G3 G4 G5 G6 G7 G8 G9 G1015
20
25
30
35
40
Generation
Star
tle R
espo
nse
(sec
)
h2 = 0.078
Serrano-Négron et al., unpublished data
Natural Variants Suppress Effects of Newly Induced Mutations?
X 2 3
Canton S wt
Canton S M
Canton S wt Canton S M
Ph
en
oty
pic
Va
lue
2a
X 2 3RAL_1 wt
RAL_2 wt
RAL_3 wt
RAL_4 wt
RAL_5 wt
RAL_i wt
RAL_1 M
RAL_2 M
RAL_3 M
RAL_4 M
RAL_5 M
RAL_i M
Exp.
RAL_1 wt - 2a
RAL_2 wt - 2a
RAL_3 wt - 2a
RAL_4 wt - 2a
RAL_5 wt - 2a
RAL_i wt - 2a
Obs.
RAL_1 M
RAL_2 M
RAL_3 M
RAL_4 M
RAL_5 M
RAL_i M
Expected – Observed = Estimate of Interaction Effect (I)
Natural Variants Suppress Effects of Newly Induced Mutations?
Sema-5cHLHm7
CspCrc
neur
-25
-20
-15
-10
-5
0
5
10
5-10
0-5
-5-0
-10--5
-15--10
-20--15
-25--20
Yamamoto et al. Genet. Res. 91: 373-382
Startle Response
Natural Variants Suppress Effects of Newly Induced Mutations?
Yamamoto et al. Genet. Res. 91: 373-382
-10
0
10
20
30
40
50
-10 0 10 20 30 40
Sta
rtle
Re
sp
on
se
(s)
Startle Response (s)
02468
1012141618
0 5 10 15 20 25
|I|
|2a|
Lessons Learned
Extensive epistasis for Drosophila quantitative traits
Epistatic interactions define genetic networks that are:
Highly interconnected
Enriched for biologically plausible GO categories, metabolic and cellular pathways
Novel
Implications of epistasis:
Additivity an emergent property of underlying epistatic networks
Population-specific genetic architecture for individual molecular variants, but common network
Individual risk prediction requires knowledge of interacting partners
Missing heritability – small additive effects even though interactions highly significant
Genomic basis of selection response: population-specific
Potential for speciation
(De)canalization and human disease
Team DGRP Freeze 2.0
Funding
Spencer JohnsonAaron Tarone
Stephen RichardsRichard Gibbs
Yi HanJeff Reid
Robert AnholtWen Huang
Yutaka InoueRichard LymanMike Magwire
Tanya MorozovaJason Peiffer
Yazmin Serrano-NégronEric Stone
Lavanya TurlipatiAki Yamamoto
Bart DeplanckeAndreas Massouras
Eileen FurlongJan Korbel
Thomas Zichner
David Mittelman