Upload
pistoia-alliance
View
96
Download
0
Embed Size (px)
Citation preview
— APPLICATION OF MACHINE LEARNING TECHNIQUES TO PATIENT STRATIFICATIONALEXANDER IVLIEV, PHDOCTOBER 2016
—
2
—
Why is it important?Technical definition of the problemExample from our researchExample from published research
01020304
AGENDA
3
—
Why is it important?Technical definition of the problemExample from our researchExample from published research
01020304
AGENDA
4
—CASE STUDY - GEFITINIB
Early stratification would have prevented the withdrawal, but retrospective analysis did allow a new Marketing approval
1st line therapy for metastatic NSCLC with exon 19 deletions or exon 21 (L858R) substitutions
Launched with accelerated
approval
Withdrawn from market after phase III
study
NDA resubmission
with companion diagnostic
• Accelerate approval but post-marketing trial requested for full approval
• Phase III studies showed no improvement in overall survival
• Drug withdrawn from the market• Retrospective data analysis: 10% partial response in genotype unselected patients
>70% responders have activating mutations in EGFR
Best responses seen in East Asians, females and non-smokers
• NDA resubmitted with narrowed labelling and mutation-based companion diagnostic
• Unfortunately, lost market share to Tarceva during elucidation of patient segment
5
—TECHNICAL DEFINITION OF THE PROBLEM
Patients
Gene
s
Class 1,Disease type 1
Class 2,Disease type 2
Class ?
Patient
6
—AIM: TO DISCOVER PATIENT SELECTION BIOMARKERS FOR ERLOTINIB USING PRECLINICAL DATA SUCH AS CELL LINE
VIABILITY SCREENS
Clinical trial design for erlotinib
Gene expression240 cell lines
Sensitive Resistant
+
7
—INTEGRATED BIOMARKER DISCOVERY APPROACH
Statistical approaches
Pathway knowledgePathway-driven
approaches
8
—MACHINE LEARNING WORKFLOW
54,675 probesets
3,787 probesets
485 genes
191 genes
51 genes
9
—
ClinicalTrials.gov numbers: NCT00409968, NCT00411671, NCT00411632, NCT00410059, NCT00410189
10
—VALIDATION: INDEPENDENT CLINICAL TRIAL
Thanks to Bin Li at Takeda for slide
11
—VALIDATION: INDEPENDENT CLINICAL TRIAL
Thanks to Bin Li at Takeda for slide
12
—BIOLOGICAL INTERPRETATION
Network Building Algorithms
Analyze networkAnalyze network (transcription factors)Analyze network (receptors)Transcription regulationShortest pathsTrace pathwaysSelf regulationDirect interactionsAuto expandExpand by one interactionManual expand
13
—
Why is it important?Technical definition of the problemExample from our researchPublished research: community effort
01020304
AGENDA
14
—MAQC CONSORTIUM
Leming Shi et al. MAQC-II study of common practices for the development and validationof microarray-based predictive models. Nat. Biotech., 2010
• 6 datasets• 13 endpoints• 36 data analysis teams• 30,000 models
15
—LESSONS LEARNED
The number one factor which predicts how accurate predictions will be is endpoint complexity
Leming Shi et al. MAQC-II study of common practices for the development and validationof microarray-based predictive models. Nat. Biotech., 2010
It’s easy to tell boys from girls
It’s hard to predict outcome of cancer treatment
16
—LESSONS LEARNED
Leming Shi et al. MAQC-II study of common practices for the development and validationof microarray-based predictive models. Nat. Biotech., 2010
“The top-performing teams were mainly industrial participants with many years of experience in microarray data analysis, whereas bottom-performing teams were mainly less-experienced graduate students or researchers.”
17
—LESSONS LEARNED
Leming Shi et al. MAQC-II study of common practices for the development and validationof microarray-based predictive models. Nat. Biotech., 2010
“Applying good modeling practices appeared to be more important than the actual choice of a particular algorithm.”
“Many models with similar performance can be developed from a given data set.”
18
—
Thank you for your attention