22
Vall d’Hebron Institut de Recerca (VHIR) Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in continous predictive variables in survival analysis Unitat d’Estadística i Bioinformàtica (UEB) 1 Santiago Pérez Hoyos [email protected] 2014 Spanish Stata Users Group meeting Barcelona, 23 Octubre 2014

Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

Vall d’Hebron Institut de Recerca (VHIR)Institut d’Investigació Sanitària acreditat per l’I nstituto de Salud Carlos III (ISCIII)

Cutpoint determination in continous predictive variables in

survival analysis

Unitat d’Estadística i Bioinformàtica (UEB)

1

Santiago Pérez Hoyos

[email protected]

2014 Spanish Stata Users Group meeting

Barcelona, 23 Octubre 2014

Page 2: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

� Motivation

�How to select cutpoint

�Cavats

Index

2

�Solutions

�Stata Flowchart

�Examples

�Further work

Page 3: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

�Increasing interest in use of binary categorization for continous variables involving clinical or epidemiologicaldata (gene expressions, biomarkers, biochemicalparameters,etc.)

�Main objective to build prognostic scores for a follow-

Motivation

3

�Main objective to build prognostic scores for a follow-up event

� Easy to compute

�Classify in high and low risk

�Allows to calculate impact measures ( hazard ratios)

Page 4: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

�Survival 60 days after leaving ICU depending on severity scores andbiochemical index

�Time to death or AIDS in HIV infected

Examples

4

�Time to death or AIDS in HIV infectedsubjects depending on age, CD4 counts , RNA viral load, etc.

�Cancer survival depending on geneexpression or biomarkers

Page 5: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

�Based on graphics of the relationship of the continous variable with the outcome

�Median or percentiles

How to select a cutpoint

5

�Based on previous literature

� Based on best fit or the most significant relation with outcome ( minimun p-valueof all possible cutpoints)

Page 6: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

�All values of prognostic variable except a proportion of extrem are cutpoints candidates

�Value that best separates outcomesaccording to maximum test statistics or

Minimum p-value

6

according to maximum test statistics or minimum p-value is choosen

�Max Log-rank test is chosen in R Maxstatpackage

�Likelihood profile is chosen in ouraproximation in Stata

Page 7: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

�Inflation or type I error rates

�Overestimate measures of effect

�Loss of information when categorizing

Caveats

7

�Loss of information when categorizing

�Replicate the cutpoint in similar data

Page 8: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

�Bonferroni correction Pbonf= pmin*n values

�considering data out P5 and P95

�P -3.13 p (1+1.65 ln(p ))

Solutions

8

�Palt= -3.13 pmin (1+1.65 ln(pmin))

�Benjamini-Hochberg q values (qqvalue in stata)

�Cross-validation

Page 9: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

Stata Template

� Define local parameters (regression type, titles, variables)

�Delete missing values, select data between percentiles 5 & 95

�Store null model

9

�Store null model

�Loop among unique values of quantitative variable anddichotomize variable

�Fit a regression model and calculate likelihood ratio test

�Save likelihood, p value in a temporary file

�Calculate Bonferroni and Benjamini-Hochberg corrections

Page 10: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

Stata Template

� Select the minimum p-value (first obs aftergsorting)

�Plot likelihood profile and hr profile

�Top ten table of p-values

10

�Top ten table of p-values

�Fit regression model with selected cutpoint

�Use html functions for output

�Saving results & graphs in html file

Page 11: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

Example

� Data from GEMES Spanish HIV seroconverters study

�2257 HIV seroconventers

�Interested in time to death

CD4 and age as covariates

11

� CD4 and age as covariates

�Find cutpoint for CD4 and age

Page 12: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

12

Page 13: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

Top ten table

13

Page 14: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

Profile plot

14

Page 15: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

Survival after cutpoint

15

Page 16: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

Cox after cutpoint

16

Page 17: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

Survival after cutpoint

17

Page 18: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

Profile plot

18

Page 19: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

Profile plot

19

Page 20: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

Cox after cutpoint

20

Page 21: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

Further work

�Build an ado function in Stata

�Extent to more than one variable

�Extent to more than one cutpoint

21

�Extent to more than one cutpoint

� Mazudar M, Glassman JR. Tutorial in Biostatistics: Categorizing a prognostic variable review of methods. Code for easy implementation and applications to decision making about Cancer treatments. Sata in Med.2000; 19: 113-132

�William BA, Mandrekar JN, Mandrekar SJ, Cha SS, Furth AF. Finding optimal cutpoints for continouscovariates with binary and time-to-event outcomes. Techical Report Series #79. Department of Health ScientesResearch Mayo Clinic. 2006.

�Hothorn T, Lausen B. Maximally selected rank statistics with several p-value approximations. R Package ‘maxstat’ July 2, 2014

Page 22: Cutpoint determination in continous predictive variables ...Institut d’Investigació Sanitària acreditat per l’Instituto de Salud Carlos III (ISCIII) Cutpoint determination in

Thanks

Gràcies

22

Gràcies

Gracias