30
Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

Embed Size (px)

Citation preview

Page 1: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

Empowering Evidence: Basic Statistics

June 3, 2015

Julian Wolfson, Ph.D.Division of BiostatisticsSchool of Public Health

Page 2: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

My goal for today

Introduce the major statistical “potholes” to be on the lookout for when interpreting published research.

Page 3: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

NOT my goal for today

● Cram an intro stat course into one hour

● Teach you how to analyze data● Give you a recipe for deciding whether

a statistical analysis has been conducted correctly

Page 4: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

Leek and Peng, Nature (2015)

Page 5: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

3. Statistical vs. scientific significance

1. Selection bias and confounding

2. Multiple comparisons and post-hoc analysis

Page 6: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

What is statistics?

Statistics is the science which allows us to draw reliable conclusions from data.

For the purposes of evaluating evidence, we are mainly interested in statistical inference, which involves quantifying the uncertainty of our conclusions based on the data in hand.

Page 7: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health
Page 8: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

● In medical science, we mostly seek to understand cause-effect relationships.

● Randomized intervention studies are one important tool.

● But sometimes we are “stuck” with observational data.

All roads lead to causality

Page 9: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

Selection bias & confounding

Page 10: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

Selection bias: what to watch out for

● Unmeasured (unmeasurable?) risk factors

● Exclusion of observations with missing data

● Post-randomization comparisons in randomized studies:o “Compliers only” analyseso Surrogate endpoint analyses

Page 11: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health
Page 12: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health
Page 13: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

Selection bias & confoundingRecommendations:1.Seek randomized trial evidence wherever possible, but be skeptical of non-ITT analyses.2.Look for how missing data and drop-out were handled3.Do a “mental sensitivity analysis” → how large would effect of selection bias / confounding have to be to change the scientific conclusion?

Page 14: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

The almighty p-value

For statistical inference, we seek to evaluate the plausibility of the null hypothesis about some characteristic of the population:●Mean SBP is 135●Probability of getting the flu over next 6 months is the same for two vaccines●Median time to progression is the same for two chemotherapeutic agents

Page 15: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

The almighty p-value

We evaluate the evidence for/against the null hypothesis on the basis of our sample.

The p-value tells us how surprised we should be to see data “as or more extreme” than that in our sample.

If the p-value based on our sample is small (we are very surprised!), we take a leap of faith and declare that the null hypothesis is false.

Page 16: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

http://xkcd.com/1478/

Page 17: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

P-value fishing

Hypothesis tests:●Are designed to control the Type I error rate, the probability of rejecting the null hypothesis when it is true.●Rely on the assumption that you are performing a single well-defined, repeatable experiment.

Page 18: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

P-value fishing

Problem: Common abuses of significance testing (“p-value fishing”) result in hypothesis testing procedures which do NOT control the Type I error rate.

One of the most common abuses is post-hoc subgroup analysis.

Page 19: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

Example

● Suppose you perform a study to assess the effect of a tuberculosis vaccine vs. placebo

● Overall, there is no effect, but you notice that the vaccine appears more effective in Hispanic women.

● You test the null hypothesis of no vaccine effect for Hispanic women → p = 0.013

● You report “The vaccine offers protection against tuberculosis for Hispanic females (p = 0.013).”

Page 20: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health
Page 21: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

Post-hoc subgroup analysisThe major problem with post-hoc subgroup analysis is that you are often using the same data to generate and test the hypothesis.

Think: What is the repeatable experiment here?

Page 22: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

Post-hoc subgroup analysisRecommendation:When evaluating evidence, try to establish whether tested hypotheses were pre-specified. If not, be very cautious about interpretation.

Page 23: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

Power

The power of a hypothesis test is the probability of rejecting the null hypothesis when it is false in some specified way.

e.g., “Our study has 80% power to detect a 20 mg/dl drop in total cholesterol.”

Page 24: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

Sample size

Most study designs trade off power and sample size, while keeping the Type I error rate fixed.

In many studies, “sample size” is hard to pin down:●Longitudinal studies●Adaptive designs●Cluster randomization

Page 25: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

Beware large sample sizes!Budding researchers are universally warned about drawing conclusions from small samples.

But inferential statistics is very good at quantifying uncertainty in these settings.

With bigger sample sizes now feasible to collect and analyze, we need to have a conversation about the dangers of large N.

Page 26: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

Statistical vs. Scientific Significance

As sample sizes increase, smaller differences between groups become “detectable” (likely to yield p < 0.05).

Effects which are statistically significant may be so small as to be scientifically insignificant.

Page 27: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

Obstet Gynecol. 2010 Feb; 115(2 Pt 1): 357–364.

Page 28: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

Statistical vs. Scientific Significance

Recommendation: Look at the effect estimate (and corresponding confidence interval*) in addition to the p-value.

*Confidence interval: An estimated range for the effect which should contain the true effect size (90/95/99)% of the time.

Page 29: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

Wrap-up

The major “statistical” issues which arise when evaluating evidence are non-technical, mostly scientific issues.

(Bio)statisticians are trained to recognize these issues and prevent/correct them or explain their possible impact on results.

Evaluating evidence requires a team-based approach which combines statistical and domain expertise.

Page 30: Empowering Evidence: Basic Statistics June 3, 2015 Julian Wolfson, Ph.D. Division of Biostatistics School of Public Health

Thank you!