Pistoia Alliance USA Conference 2016

— APPLICATION OF MACHINE LEARNING TECHNIQUES TO PATIENT STRATIFICATIONALEXANDER IVLIEV, PHDOCTOBER 2016

—

2

—

Why is it important?Technical definition of the problemExample from our researchExample from published research

01020304

AGENDA

3

—

Why is it important?Technical definition of the problemExample from our researchExample from published research

01020304

AGENDA

4

—CASE STUDY - GEFITINIB

Early stratification would have prevented the withdrawal, but retrospective analysis did allow a new Marketing approval

1st line therapy for metastatic NSCLC with exon 19 deletions or exon 21 (L858R) substitutions

Launched with accelerated

approval

Withdrawn from market after phase III

study

NDA resubmission

with companion diagnostic

• Accelerate approval but post-marketing trial requested for full approval

• Phase III studies showed no improvement in overall survival

• Drug withdrawn from the market• Retrospective data analysis: 10% partial response in genotype unselected patients

>70% responders have activating mutations in EGFR

Best responses seen in East Asians, females and non-smokers

• NDA resubmitted with narrowed labelling and mutation-based companion diagnostic

• Unfortunately, lost market share to Tarceva during elucidation of patient segment

5

—TECHNICAL DEFINITION OF THE PROBLEM

Patients

Gene

s

Class 1,Disease type 1

Class 2,Disease type 2

Class ?

Patient

6

—AIM: TO DISCOVER PATIENT SELECTION BIOMARKERS FOR ERLOTINIB USING PRECLINICAL DATA SUCH AS CELL LINE

VIABILITY SCREENS

Clinical trial design for erlotinib

Gene expression240 cell lines

Sensitive Resistant

+

7

—INTEGRATED BIOMARKER DISCOVERY APPROACH

Statistical approaches

Pathway knowledgePathway-driven

approaches

8

—MACHINE LEARNING WORKFLOW

54,675 probesets

3,787 probesets

485 genes

191 genes

51 genes

9

—

ClinicalTrials.gov numbers: NCT00409968, NCT00411671, NCT00411632, NCT00410059, NCT00410189

http://cancerdiscovery.aacrjournals.org/external-ref?link_type=CLINTRIALGOV&access_num=NCT00409968





10

—VALIDATION: INDEPENDENT CLINICAL TRIAL

Thanks to Bin Li at Takeda for slide

11

—VALIDATION: INDEPENDENT CLINICAL TRIAL

Thanks to Bin Li at Takeda for slide

12

—BIOLOGICAL INTERPRETATION

Network Building Algorithms

Analyze networkAnalyze network (transcription factors)Analyze network (receptors)Transcription regulationShortest pathsTrace pathwaysSelf regulationDirect interactionsAuto expandExpand by one interactionManual expand

13

—

Why is it important?Technical definition of the problemExample from our researchPublished research: community effort

01020304

AGENDA

14

—MAQC CONSORTIUM

Leming Shi et al. MAQC-II study of common practices for the development and validationof microarray-based predictive models. Nat. Biotech., 2010

• 6 datasets• 13 endpoints• 36 data analysis teams• 30,000 models

15

—LESSONS LEARNED

The number one factor which predicts how accurate predictions will be is endpoint complexity


It’s easy to tell boys from girls

It’s hard to predict outcome of cancer treatment

16

—LESSONS LEARNED


“The top-performing teams were mainly industrial participants with many years of experience in microarray data analysis, whereas bottom-performing teams were mainly less-experienced graduate students or researchers.”

17

—LESSONS LEARNED


“Applying good modeling practices appeared to be more important than the actual choice of a particular algorithm.”

“Many models with similar performance can be developed from a given data set.”

18

—

Thank you for your attention

Science

Pistoia Alliance USA Conference 2016