Upload
mckenzie-kemp
View
44
Download
3
Embed Size (px)
DESCRIPTION
Cervical Cancer Case Study. Eshetu Atenafu, Sandra Gardner, So-hee Kang, Anjela Tzontcheva University of Toronto Department of Public Health Sciences (Biostatistics) Acknowledgments: Professors P.Corey, J. Hsieh, W. Lou, J.Stafford. Outcome Variable. - PowerPoint PPT Presentation
Citation preview
Cervical CancerCase Study
Eshetu Atenafu, Sandra Gardner,
So-hee Kang, Anjela Tzontcheva
University of Toronto
Department of Public Health Sciences
(Biostatistics)
Acknowledgments: Professors P.Corey, J. Hsieh, W. Lou, J.Stafford
Outcome Variable
• Time to event calculated as recurrence date - surgery date; otherwise censored at death or last follow up date
• 4 cases where recurrence date > follow up date
• Decided there were no cases of left-censoring
• N=871, 68 recurrent events, 92% censored for a total of 3,573 person-years of follow up over the time period of 1984 to 2001
Covariate manipulation
PELLYMPH AGE - 40 SURGYR - 1993
0 1
- +
CLS
0 1
clear other
MARGINS
0 1
1cm >1cm
MAXDEPTH
HIST 1(SCC)
HIST 3 (AC)
HISTOLOG
0 1
3cm >3cm
SIZE
GRADE 2 GRADE 3
GRADE
0 1
non treated
ADJ_RAD
Covariate Summary (1)
• Age - median 40 years
• 3% with disease left after surgery
• 13% received radiation therapy
• 46% capillary-lymphatic space invasion
• 6% positive pelvic lymph nodes
• Histology– SCC 62%, AC 28%
Covariate Summary (2)
• Tumor grade (cell differentiation)– better 21%, moderate 52%, worst 27%
• Maximum depth of tumor– 22% greater than 1 cm
• Tumor size– 5% greater than 3 cm
• Median year of surgery is 1993
Methods
• Univariate log-rank tests
• Non-parametric survival trees (CART-SD)
• Semi-parametric (Cox regression)
• Parametric models (Exponential, Weibull, Log-normal)
Log-rank testsVariable % Missing
Log-rank Wilcoxon Log-rank Wilcoxon
ADJ_RAD 0.0004 0.0002 0.6 0.02 0.01AGE 0.69 0.71 0.0 0.37 0.46CLS <0.0001 <0.0001 11.9 <0.0001 0.0003GRAD 0.03 0.01 15.6 0.1 0.05HISTOLOG 0.11 0.04 0.1 0.65 0.42MARGINS 0.05 0.03 0.1 0.31 0.24MAXDEPTH <0.0001 <0.0001 14.6 <0.0001 <0.0001PELLYMPH <0.0001 <0.0001 0.0 0.0002 0.0002SIZE <0.0001 <0.0001 2.9 0.0002 <0.0001SURGYR 0.67 0.79 0.0 0.47 0.53
All Data (n=871) Complete Data (n=549)
Using all available data per variable
Complete data (n=549)
MAXDEPTH
• Loss of power concerns
• We are losing 23 recurrent event cases due to missing Maxdepth and only 4 for other missing covariates
• We developed models including and excluding Maxdepth
• Attempted imputation of all missing values (TRANSCAN and IMPUTE, Design and Hmisc S-plus/R libraries, F.Harrell)
Survival Trees
• Builds a binary decision tree and groups patients with similar prognosis
• Uses maximized version of Log-rank test to split the data into groups with different survival
• Advantages: non-parametric, “ranks” covariates by importance, captures interactions
• Disadvantages: non-interpretability of large trees, excludes cases with missing values
Survival Tree including Maxdepth
|maxdepth<17.95
cls:1
pellymph:1
size:1
549
491
273 218
197 21
58
49 9
low
moderate high
high high
0 1 2 3 4 5
Years
0.0
0.2
0.4
0.6
0.8
1.0 low
moderate
high
Survival Tree excluding Maxdepth
|size:1
pellymph:1
cls:1
645
610
569
322 247
41
35
low moderate
high
high
0 1 2 3 4 5
Years
0.0
0.2
0.4
0.6
0.8
1.0 low
moderate
high
Comparisons of Cox modelsStandard
Parameter Estimate Error p Exp(Estimate) Lower Upper
Including MaxdepthMAXDEPTH 1.14 0.33 0.0005 3.13 1.64 5.98CLS (1 or 2) 1.06 0.37 0.0047 2.89 1.40 5.96SIZE>3cm 1.02 0.49 0.0385 2.77 1.06 7.25
Excluding MaxdepthCLS (1 or 2) 1.28 0.31 <.0001 3.60 1.96 6.59SIZE>3cm 1.78 0.32 <.0001 5.91 3.17 11.03HISTOLOGY (SCC) -0.93 0.30 0.0021 0.39 0.22 0.71PELLYMPH(+) -0.28 0.65 0.6700 0.76 0.21 2.69SCC*PELLYMPH 1.57 0.73 0.0320 4.78 1.14 20.04
After imputationMAXDEPTH 1.20 0.27 <.0001 3.31 1.95 5.62CLS (1 or 2) 0.95 0.30 0.0014 2.59 1.44 4.66SIZE>3 cm 1.31 0.32 <.0001 3.71 1.98 6.94HISTOLOGY(SCC) -0.50 0.26 0.0500 0.61 0.36 1.01
95% CI
Using imputed data
Using all available data per variable
Model ComparisonStandard
Parameter Estimate Error p Exp(-Estimate) Lower Upper
ExponentialIntercept 4.31 0.26 <.0001 0.01 0.01 0.02CLS (1 or 2) -1.30 0.31 <.0001 3.69 2.01 6.77SIZE>3 cm -1.92 0.31 <.0001 6.83 3.69 12.65HISTOLOGY (SCC) 0.99 0.30 0.0010 0.37 0.20 0.67PELLYMPH(+) 0.36 0.65 0.5836 0.70 0.20 2.50SCC*PELLYMPH -1.74 0.73 0.0171 5.71 1.36 23.86
Cox Proportional Hazards Exp(Estimate)CLS (1 or 2) 1.28 0.31 <.0001 3.60 1.96 6.59SIZE>3 cm 1.78 0.32 <.0001 5.91 3.17 11.03HISTOLOGY(SCC) -0.93 0.30 0.0021 0.39 0.22 0.71PELLYMPH(+) -0.28 0.65 0.6700 0.76 0.21 2.69SCC*PELLYMPH 1.57 0.73 0.0320 4.78 1.14 20.04
Log NormalIntercept 5.12 0.50 <.0001 4.15 6.09CLS (1 or 2) -1.49 0.38 <.0001 -2.24 -0.75SIZE>3 cm -2.56 0.51 <.0001 -3.56 -1.57HISTOLOGY (SCC) 0.92 0.35 0.0084 0.24 1.61PELLYMPH(+) -1.22 0.48 0.0118 -2.17 -0.27Scale 2.28 0.23 1.87 2.78
95% CI
Exponential Model Prognostic Groups
PELLYMPH CLS 1 or 2 HISTOLOGY (SCC) SIZE>3 cm Rate/Pyr P(Survival=2) P(Survival=5)
High RiskY Y Y Y 0.497 0.37 0.08N Y N Y 0.336 0.51 0.19Y Y N Y 0.237 0.62 0.31Y N Y Y 0.135 0.76 0.51N Y Y Y 0.125 0.78 0.54N N N Y 0.092 0.83 0.63Y Y Y N 0.073 0.86 0.69 *
Medium RiskN Y N N 0.049 0.91 0.78Y Y N N 0.035 0.93 0.84N N Y Y 0.034 0.93 0.84
Low RiskY N Y N 0.020 0.96 0.91N Y Y N 0.018 0.96 0.91N N N N 0.013 0.97 0.94Y N N N 0.009 0.98 0.95N N Y N 0.005 0.99 0.98 *
* see plot
Log-normal Model Prognostic GroupsPELLYMPH CLS 1 or 2 HISTOLOGY (SCC) SIZE>3 cm Rate/Pyr P(Survival=2) P(Survival=5)
High RiskY Y Y Y 0.197 0.51 0.36N Y N Y 0.161 0.57 0.41Y Y N Y 0.394 0.36 0.22Y N Y Y 0.077 0.76 0.61N Y Y Y 0.090 0.72 0.57N N N Y 0.066 0.79 0.66Y Y N N 0.071 0.77 0.64
Medium RiskN Y N N 0.038 0.90 0.81Y Y Y N 0.044 0.88 0.78N N Y Y 0.041 0.89 0.79
Low RiskY N Y N 0.021 0.97 0.92N Y Y N 0.024 0.96 0.90N N N N 0.019 0.97 0.94Y N N N 0.033 0.92 0.84N N Y N 0.012 0.99 0.97
Comparison of Prognostic Groups
Conclusions (1)• Important prognostic factors are:
– tumor size >3cm– capillary-lymphatic space invasion– positive pelvic lymph nodes– Squamous cell carcoma type histology
• Missing values and imputation issues with respect to maximum depth of tumor are of concern
Conclusions (2)• We have selected 3 prognostic groups using non-
parametric and parametric methods• Parametric models appear to overestimate the 5 year
survival probability for the high risk group• Non-parametric and parametric 5 years survival estimates
for the prognostic groups are similar, but the parametric models group fewer patients for high and moderate risk compared to the survival tree
• We are concerned, however, that the predictive ability of these models is poor.
Another Cohort• Ishikawa H. et al. (1999) Prognostic Factors of Adenocarcinoma
of the Uterine Cervix, Gynecologic Oncology 73:42-46• Nakanishi T. et al. (2000) A Comparison of Prognoses of
Pathologic Stage 1b Adenocarinoma and Squamous Cell Carcinoma of the Uterine Cervix, Gynecologic Oncology 79:289-293
• Nakanishi T. et al. (2000) The significance of tumor size in clinical stage 1b cervical cancer: Can a cut-off figure be determined?, International Journal of Gynecologic Cancer 10:397-401
References• LeBlanc, M. and Crowley J. (1993) Survival Trees
by Goodness of Split. JASA 88: 457-467
• Segal, M. R.(1988) Regression Trees for Censored Data. Biometrics 44: 35-47
• Lausen B and Schumacher M. (1992) Maximally Selected Rank Statistics. Biometrics 48: 73-85
• Haupt G. Survival Trees in S-plus (library survcart demo)