24
ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Embed Size (px)

Citation preview

Page 1: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics

Research

Leslie Biesecker, M.D.National Human Genome Research

Institute, NIH

Page 2: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Translational genomics research space

Page 3: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Genetic architecture of disease

Common variants

Rare variants

Page 4: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Genetic architecture of disease

Common variants

Rare variants

The only way to assess both is by sequencing

Page 5: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Building a clinical genomics research program

1. Develop a robust infrastructure for the generation and use of genomic data in clinical research

2. Genomically dissect a phenotype with complex genetic architecture

3. To understand how to interact with subjects re genomic data

Page 6: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Initial approach

• 1,000 subjects recruited from general population

• Initial phenotype for atherosclerosis– Consented for follow-up sequencing– Consented for re-contact for phenotyping

• Sequence ~400 candidate genes– Selected for many reasons– Consented for WGS

• Associate variants with phenotypes• Return results

– A goal of the study is to learn from the subjects which results we should return

Page 7: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

ClinSeq GeneList V6 .

ABCA1 ABCG1 ABCG5 ABCG8 ACE ACTA2 ADH1C AGT AGTR1 ADIPOR1 ADIPOR2 ADIPOQ ALOX ALOX5AP ANGPTL3 ANGPTL4 ANRIL APOA1 APOA2 APOA5 APOB APOBEC1 APOC1 APOC2 APOC3 APOC4 APOE APOM AR ARF ARG1 ATF4 BDKRB2 BMI1 BSDL C11ORF2 CALM1 CAPG CAV1 CCL2 CCR2 CCR5 CD36

CD40 CD40LG CDKN1A CDKN2A CDKN2B CELSR2 CETP CFH CIITA CRP CXCR4 CYBA DGAT1 DGAT2 DUSP1 ENPEP ESR1 F13B F2 F5 F7 FABP2 FABP4 FAM5C FAS FGB FGG FLI1 FOS FTO GAL GALNT2 GAS6 GATA2 GCKR GCLC GCLM GDF5 GJA4 GP1BA GP1BB GP5 GP6 GPX1

HDAC2 HMGCR HMOX1 HMOX2 ICAM1 IL18 IL8 INSIG2 ITGA2 ITGB1 ITGB2 ITGB3 ITGB5 ITGB7 JAK3 KALRN KCNJ8 KCNMB1 KIF6 KL LCAT LDLR LDLRAP1 LGALS2 LIPC LIPE LIPG LPA LPL LRP6 LRP8 LTA LTA4 H LTC4S MBL2 MEF2A MLXIPL MMAB MMP3 MMP9 MOGAT1 MOGAT2 MPO

MTP MVK MYLK NCCT NFKB1 NKCC2 NOS1 NOS3 NOX1 NPC1 NPC1L1 NPR1 NRIH2 NRIH3 NRF1 OLR1 OR13G1 P2RY12 PAPPA PCSK9 PDGFB PDGFRB PER1 PITX2 PLA2G4A PLA2G7 PLAT PLTP PON1 PPARA PPARD PPARG PRDX2 PRDX3 PRKG1 PSMA6 PSRC1 PTGIS PTGS1 PTGS2 RAP1B RBKS ROMK

ROS1 RXRA SAR1B SCARB1 SCD SELL SELP SELS SERPINE1 SIRT1 SIRT3 SOAT1 SOAT2 SOCS3 SOD2 SOD3 SORT1 SREBF1 SREBF2 STAT1 TAS2R50 TBXA2R TCF1 TCF7L2 THBD THBS4 TIMP1 TLR4 TLR8 TNFRSF1A TNFSF4 TRIB1 TRIB3 UCP2 UCP3 USF1 VAMP8 VEGF VLDLR VNN1

Page 8: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Progress – the numbers

• Enrollment began January, 2007• 590 patients enrolled Feb. 1• 354 DNAs sequenced – PCR/3730

– 326 ClinSeq subject samples– 28 HapMap samples

• 219 genes– 3,500 genomic target sequences

• > 1.7 M sequence reads to date~825,000,000 bp of bidirectional genomic sequence

Page 9: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Quality measures

• Sample ID spike– Non-human genomic clone spike

• HapMap control DNAs– 1/30 ClinSeq DNAs

• Manual review of traces• CLIA sample confirmation

Page 10: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Visual quality

% coverage Q20

Page 11: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Yield snapshot

• Coverage– Sample 140 genes

• Targeted 402,907 bp ROI• Yielded 357,912 bp – 88.8% design coverage

• Variants– Total 3,353– Adjust sample coverage to exactly 500

chromosomes• Variants 2,271

– Extrapolate false positive rate• 1,984 variants

Page 12: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Uncommon alleles common

Page 13: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Data flow

Page 14: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Sub-projects underway

Positive controlsValidating recent assoc. of rare variants &

phenotypesSequencing genes under GWAS peaks for rare, high

penetrance variantsTesting associations of candidate genes with

phenotypeControl cohort for other sample setsSearch for miRNA variantscDNA sequencing pilot to measure expression &

isoformsCapture method refinementPatient motivations and preferences for results of

medical sequencingTesting automated vs. manual pedigree acquisition

Page 15: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Positive result example

• 65 yo female• High cholesterol diagnosed

at 25 years• RX: atorvastatin, ezetimibe,

hctz, lisinopril, niacin• Coro Ca++ 1,726• Chol 172, Trig 50, HDL 75,

LDLd 88

• LDLR p.Y188X

• Family members diagnosed & treatment started

Page 16: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Replications

• ANGPTL4 – (decr trigs)– 30 NS variants, one novel, p.E40K x 3– Not quite significant, await full sample

• SLC12A3, SLC12A1, & KCNJ1 (decr BP)– 15 NS variants– Not assoc with decr BP

Page 17: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Penetrance & frequency

Frequency

Pe

ne

tra

nc

e

Page 18: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Penetrance & frequency

Frequency

Pe

ne

tra

nc

e

We understand this work

Page 19: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Penetrance & frequency

Frequency

Pe

ne

tra

nc

e

We understand this work

Explore this

Page 20: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Penetrance & frequency

Frequency

Pe

ne

tra

nc

e

We understand this work

Explore this

This is probably not clinical genetics!

Page 21: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Classic translational paradigm

Sort Phenotype

Generate Hypothesis

Apply Assay

Correlate

Page 22: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Novel translational paradigm

Sort Phenotype

Generate Hypothesis

Apply Assay

Correlate

Sort Genotype

Generate Hypothesis

Apply Assay

Correlate(Sort) Phenotype

Page 23: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Implications

• Large numbers of patients are interested• Possible to consent subjects to WGS• Clinically relevant results can be sifted• Subjects can receive and act upon results• Going forward:

– Next-Gen: Exome -> WGS– More diverse population– Discovering associations of variants and

phenotype– Learning subject’s view in real setting

Page 24: ClinSeq: Piloting Large-Scale Medical Sequencing for Translational Genomics Research Leslie Biesecker, M.D. National Human Genome Research Institute, NIH

Collaborators• NISC

– Jim Mullikin, Bob Blakesly, Gerry Bouffard, Praveen Cherukuri, Pedro Cruz, Nancy Hanson, Morgan Park, Alice Young

• NHGRI– Eric Green, Flavia Facio,

Stephanie Brooks, Amy Linn, Paul Gobourne, Jennifer Johnston, Teri Manolio, Jamie Teer, Clesson Turner, Alec Wilson

• NHLBI– Richard Cannon, Andrew Arai,

Paul Hwang, Toren Finkel, Vandana Sachdev, Bob Shamburek

• NIDDK– Monical Skarulis, Kristina Rother

• NIHCC– Alan Remaley