On genome-wide association studies (GWAS)

•association

•linkage disequilibrium

•population structure

•case/control design

•single nucleotide polymorphism data

AAGTCAGTCTAGGAAGTCAGTCTAGGAATCGGGTCGGG

TTCAGTCAGATCCTTCAGTCAGATCCTTAGCCCAGCCC

TTCAGTCAGATCCTTCAGTCAGATCCCCAGCCCAGCCC

AAGTCAGTCTAGGAAGTCAGTCTAGGGGTCGGGTCGGG

Chromosome 1

Chromosome 2

SNPSNP

Population structure explained part of the significant +11.2% inflation of test statistics we observed in an analysis of 6,322 nonsynonymous SNPs in 816 cases of type 1 diabetes and 877 population-based controls from Great Britain. The remainder of the inflation resulted from differential bias in genotype scoring between case and control DNA samples, which originated from two laboratories, causing false-positive associations.

Nature Genetics 37, 1243 - 1246 (2005) Published online: 9 October 2005; | doi:10.1038/ng1653

Population structure, differential bias and genomic control in a large-scale, case-control association studyDavid G Clayton1, Neil M Walker1, Deborah J Smyth1, Rebecca Pask1, Jason D Cooper1, Lisa M Maier1, Luc J Smink1, Alex C Lam1, Nigel R Ovington1, Helen E Stevens1, Sarah Nutland1, Joanna M M Howson1, Malek Faham2, Martin Moorhead2, Hywel B Jones2, Matthew Falkowski2, Paul Hardenbol2, Thomas D Willis2 & John A Todd1

•premise: pop structure causes variance inflation of test statistic under null

•Y_i^2 ~ chi-square(1) ideally

•Y_i^2 ~ inflation factor lambda * chi-square(1)

•so use T_i = Y_i^2/lambda.hat

•lambda.hat = median(Y_i^2)/[ null median ]

Genomic Control (Devlin and Roeder)

•genomic control (Devlin & Roeder)

•structured association (Pritchard et al)

•principal components (Price et al)

Handling population structure

ArticleNature 447, 661-678 (7 June 2007) | doi:10.1038/nature05911; Received 26 March 2007; Accepted 11 May 2007

Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls

The Wellcome Trust Case Control Consortium

•UK population; european ancestry

•seven diseases; 50 research groups (BD, CAD,CD,HT,RA,T1D,T2D)

•2000 cases per disease

•3000 common controls (two distinct sets)

•Affymetrix 500K mapping array set

•16179 samples included (809 dropped considering contamination, non-Caucasian ancestry)

•469,557 SNPs included (93.8%)

•Average call rate 99.63%

•392,575 have MAF > 1%

Quality Control

There may be important population structure that is not well captured by current geographical region of residence. Present implementations of strongly model-based approaches such as STRUCTURE11, 12 are impracticable for data sets of this size, and we reverted to the classical method of principal components13, 14, using a subset of 197,175 SNPs chosen to reduce inter-locus linkage disequilibrium. Nevertheless, four of the first six principal components clearly picked up effects attributable to local linkage disequilibrium rather than genome-wide structure. The remaining two components show the same predominant geographical trend from NW to SE but, perhaps unsurprisingly, London is set somewhat apart

The overall effect of population structure on our association results seems to be small, once recent migrants from outside Europe are excluded. Estimates of over-dispersion of the association trend test statistics (usually denoted ; ref. 15) ranged from 1.03 and 1.05 for RA and T1D, respectively, to 1.08–1.11 for the remaining diseases. Some of this over-dispersion could be due to factors other than structure, and this possibility is supported by the fact that inclusion of the two ancestry informative principal components as covariates in the association tests reduced the over-dispersion estimates only slightly (Supplementary Table 6), as did stratification by geographical region. This impression is confirmed on noting that P values with and without correction for structure are similar (Supplementary Fig. 9). We conclude that, for most of the genome, population structure has at most a small confounding effect in our study, and as a consequence the analyses reported below do not correct for structure. In principle, apparent associations in the few genomic regions identified in Table 1 as showing strong geographical differentiation should be interpreted with caution, but none arose in our analyses.

On genome-wide association studies (GWAS)

Documents

Application of Genome-Wide Association Study (GWAS) and transcriptomics to study Gene-Trait associations in Banana

Statistical analysis of genome-wide association …bioinformatics.org.au/ws09/presentations/Day3...Statistical analysis of genome-wide association (GWAS) data Jim Stankovich Menzies

Genome-Wide Association Studies (GWAS) – #1

Präsentation advocacy PersMed · Manolio’TA.’N’Engl’J’Med’2010;363:166I176.’ Genome-wide Association Studies (GWAS) New Definition of Diseases The recent exceptional

Genome-wide association studies (GWAS)

Publikationsliste 2011 - Centeropdelt · Patrick F ; O'Donovan, Michael C ; Daly, Mark J ; Gejman, Pablo V ; The Schizophrenia Psychiatric Genome-Wide Association Study (GWAS) Consortium

Genome-wide Association Study of Alcohol Dependence · lele as in the GWAS. In the combined analysis, 2 closely linked intergenic SNPs met genome-wide significance (rs7590720, P=9.72

Genome-wide meta-analysis, fine-mapping, and integrative ... · Genome-wide association studies (GWAS) have discovered numerous genomic loci associated with Alzheimer’s disease

Post-GWAS Prioritization through Integrated Analysis of …qiongshilu.com/Poster_GenoWAP.pdf · 2016. 4. 24. · Genome-wide association study identifies five new schizophrenia loci

arxiv.org · Abstract Annotations of gene structures and regulatory elements can inform genome-wide association studies (GWAS). However, choosing the relevant annotations for interpreting

Network-based Analysis of Genome-wide Association Study (GWAS) Data

Genome(Wide(Associa9on(Study( (GWAS)((and(GBS(applica9on ...hpc.ilri.cgiar.org/beca/training/AdvancedBFX2015/BecA_Workshop_G… · Genome(Wide(Associa9on(Study((GWAS)((and(GBS(applica9on(in(Cereal(Crop(Plants(Definition

Drosophila and genome-wide association studies: a review ... · Drosophila melanogaster, for functional investigation of findings from human GWAS. We highlight selected examples where

Estudos de Associação Pan-Genômica GWAS · 2015-11-25 · Estudos de Associação pan-genômica (GWAS) Os estudos de associação pan-genômica, ou Genome-wide Association Studies

Gene × Environment Determinants of Stress- and Anxiety ... · Genome-Wide Association Studies: Basic Tenets The purpose of GWAS is to identify loci in the genome where genetic variation

Genome-wide association studies (GWAS) Thomas Hoffmann Department of Epidemiology and Biostatistics, and Institute for Human Genetics

Genome-wide association study of lung function phenotypes ...home.uchicago.edu/~abney/abney_web/Publications_files/yao2014a.pdf · Here, we conducted a GWAS of lung function phenotypes

An international initiative to conduct comprehensive genome-wide association studies (GWAS) for an array of agronomic traits in peanut

Genome‑wide association study for grain yield and related ... › attachment › 1007 › Genome-wi… · Genome-wide association studies (GWAS) and quantita-tive trait loci (QTL)

Benefits and limitations of genome-wide association studies · 2019-08-26 · Genome-wide association studies (GWAS), in which hundreds of thousands to millions of genetic variants