33
Biostatis tics Lecture 8

Hypothesis - Biostatistics

Embed Size (px)

DESCRIPTION

Understanding Hypothesis in Biostatistics for more details check our website at http://www.helpwithassignment.com/statistics-assignment-help

Citation preview

Page 1: Hypothesis - Biostatistics

Biostatistics

Lecture 8

Page 2: Hypothesis - Biostatistics

Lecture 7 Review–

Using confidence intervals and p-values tointerpret the results of statistical analyses

• • 

• 

Null hypothesisP-value

Interpretation of confidence intervals & p- values

Page 3: Hypothesis - Biostatistics

Null hypothesis

•  A null hypothesis is one that proposes there isno difference in outcomes

•  We commonly design research to disprove a nullhypothesis

Page 4: Hypothesis - Biostatistics

P-value:- comparing two groups

What is theprobability (P-

value) of finding the

observed difference

How likely is itwe would see

adifference this big

IFIF

The null hypothesisis true?

There was NO realdifference betweenthe populations?

Page 5: Hypothesis - Biostatistics

Interpretation of p-values1!

Weak evidence againstthe null hypothesis0.1!

Increasing evidence againstthe null hypothesis with

decreasing P-value0.01!

0.001!Strong evidence against

the null hypothesis

0.0001!

P-v

alue

!

Page 6: Hypothesis - Biostatistics

Objective To assess the effect of combined hormone replacement therapyon health related quality of life.

Design Randomised placebo controlled double blind trial.

(HRT)

Ta ble 3EuroQoL Visual Analogue Scores (EQ-VAS) by treatment group.Figures are means (SE)

one year (95%

Combined HRT

(n=1043*)

Placebo

(n=1087*)Adjusted

difference at

CI)

P-value

EQ-VAS 77.9 (0.5) 78.5 (0.4) -0.59

(-1.66 to 0.47)

0.28

Page 7: Hypothesis - Biostatistics

Five trials of drugs to reduce serum cholesterol

A reduction of 0.5 mmol/L or more correspondsto a clinically important effect of the drug

Trial Drug Cost No. of patients

per group

Observed difference in mean

cholesterol (mmol/L)

s.e. of difference (mmol/L)

95% CI for population

difference in mean

cholesterol

P-value

1 A Cheap 30 -1.00 1.00 -2.96 to 0.96 0.32

2 A Cheap 3000 -1.00 0.10 -1.20 to -0.80 <0.001

3 B Cheap 40 -0.50 0.83 -2.13 to 1.13 0.55

4 B Cheap 4000 -0.05 0.083 -0.21 to 0.11 0.55

5 C Expensive 5000 -0.125 0.05 -0.22 to -0.03 0.012

Page 8: Hypothesis - Biostatistics

Lecture 8 – Proportions andintervals

Binary variables (RECAP)

confidence

• 

•  Single proportion–  Standard error, confidence interval

•  Incidence & prevalence

•  Difference in two proportions–  Standard error, confidence interval

Page 9: Hypothesis - Biostatistics

Categorical variables - Binary

Binary variable – two categories only

(also termed – dichotomous variable)

Examples:-

  Outcome – Diseased or Healthy; Alive

or Dead…

  Exposure - Male or Female; Smoker or non-smoker;

Treatment or control group….

Page 10: Hypothesis - Biostatistics

Inference

Proportion of population diseased – π??

Proportion of sample diseased, p=d/n

Number of subjects who do experience outcome (diseased) = dNumber of subjects who do not experience outcome (healthy) = h

Total number in sample = n = h + d

Page 11: Hypothesis - Biostatistics

Inference - example

Proportion of population with vivax malaria - π

Proportion of sample with vivaxp = d/n = 15/100 = 0.15 (15%)

malaria,

Number of sample with vivax malaria = d = 15Number of sample without vivax malaria = h = 85Total number in sample = n = 15 + 85 = 100

Page 12: Hypothesis - Biostatistics

Single proportion - Inference

• Obtain a sample estimate, p, of the population proportion, π

• REMEMBER different samples would give different estimatesof π (e.g. sample 1 p1, sample 2 p2,…)

• Derive:

– Standard error

– Confidence interval

Page 13: Hypothesis - Biostatistics

Standard error & confidence intervalof a single proportion

• Standard error (SE) for single proportion:-(from the Binomial distribution)

π (1 − π ) p(1 − p)s.e.( p ) = ~

n n

• 95% CI for single proportion:-(approximate method based on the normal distribution)

Lower limit = p - 1.96×s.e.(p)

Upper limit = p + 1.96×s.e.(p)

Page 14: Hypothesis - Biostatistics

Standard error & confidence intervala single proportion – malaria exampleof

Estimated proportion of vivax

Standard error of p

malaria (p) = 15/100 = 0.15

p(1 − p)

0.15(1 − 0.15)s e ( p ). . = = 0.036=

n 100

• 95% Confidence interval for population proportion (π)

Lower limit = p - 1.96×s.e.(p) = 0.15 – 1.96×0.036 = 0.079

Upper limit = p + 1.96×s.e.(p) = 0.15 + 1.96×0.036 = 0.221

Interpretation..

“We are 95% confident, the population proportion of people

vivax malaria is between 0.079 and 0.221

(or between 7.9% and 22.1%)”

with

Page 15: Hypothesis - Biostatistics

Definition of a confidenceREMEMBER…..

interval

If we were to draw several independent,

random samples (of equal size) from the

sample population and calculate 95%confidence intervals for each of them, 0.

4

0.35

0.3

Population0.2

5

then on average 19 out of every 20 (95%)

such confidence intervals would

contain the true population

proportion (π), and one of every 20

0.2

0.15

0.1

(5%) would not.0.05

0

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

Sam ple

Sam

ple

pro

po

rtio

n a

nd

95%

CI proportion = 0.16

(16%)

Page 16: Hypothesis - Biostatistics

WARNING….Confidence Interval of a

single proportionThe normal approximation method breaks

down1)

2)

if:Sample

Sample

size (n) is small

proportion (p) is close to 0 or 1

Require:

np ≥ 10 or n(1-p) ≥ 10

Stata lets you calculate an ‘exact’ CI

Page 17: Hypothesis - Biostatistics

Confidence Interval for a single proportion in Stata

• cii 100 15

•••

-- Binomial Exact --

[95% Conf. Interval]Variable | Obs Mean Std. Err.

-------------+---------------------------------------------------------------

| 100 .15 .0357071 .0864544 .2353075

• cii 100 1

••

-- Binomial Exact --

[95% Conf. Interval]Variable | Obs Mean Std. Err.

-------------+---------------------------------------------------------------

| 100 .01 .0099499 .0002531 .0544594

Page 18: Hypothesis - Biostatistics

Interpretation of proportions:

Incidence versus Prevalence

Page 19: Hypothesis - Biostatistics

Prevalence

Proportion of people in a defined population that have a given disease at a specified point in time

Prevalence = no. of people with the disease at particular point in time

no. of people in the population at a particular point in time

Examples:-

• Prevalenceliving in the

Prevalence

Thailand.

Prevalence

of chronic pain among people aged 25+ years andGrampian region, UK.

of typhoid among villagers living in Tak province,•

• of diagnosed asthma in individuals aged 15 to 50

years, registered with a particular general practice in Carlton.

Page 20: Hypothesis - Biostatistics

Incidence risk(Cumulative incidence)

Proportion of new cases in a disease free population in a given time period

Incidence risk = no. of new cases of disease in a given time period

no. of people disease-free at beginning of time period

Examples:-

• Incidence risk of death in five years following diagnosis withprostate cancer

Incidence risk of breast cancer over 10 years of follow-up in

women 40-69 years of age and free from breast cancer in

1990

Page 21: Hypothesis - Biostatistics

Incidence rate(NOT a proportion)

Number of new cases in a disease free population per person per unit time

• that occur

Incidence rate = no. of new cases of disease

total person-years of observation

Examples:-

• Incidence rate of all-cause mortality of men in the Melbourne

Collaborative Cohort Study = 9.0 per 1000 men per year

‘9 out of every 1000 men die each year’

(

Page 22: Hypothesis - Biostatistics

Comparing two proportions

Page 23: Hypothesis - Biostatistics

Comparing two proportions2×2 table

•••

ProportionProportionProportion

of all subjects experiencing outcome, p = d/nin exposed group, p1 = d1/n1

in unexposed group, p0 = d0/n0

Be alert (not alarmed): watch for transposing the table and swapping columns or rows

With outcome

(diseased)

Without outcome

(disease-free)

Total

Exposed

(group 1)

d1 h1 n1

Unexposed

(group 0)

d0 h0 n0

Total d h n

Page 24: Hypothesis - Biostatistics

Comparing two proportionsExample:- TBM trial (Thwaites GE et al 2004)

Adults with tuberculous meningitis randomly allocated intotreatment groups:

2

1.

2.

Dexamethasone

Placebo

Outcome measure: Death during nine months following start of

treatment.

Research question:

Can treatment with dexamethasone reduce the risk of deathadults with tuberculous meningitis?

among

Page 25: Hypothesis - Biostatistics

Comparing two proportionsExample – TBM trial

Death during 9 months post start of treatment

Treatment group Yes No Total

Dexamethasone

(group 1)

87 187 274

Placebo

(group 0)

112 159 271

Total 199 346 545

Page 26: Hypothesis - Biostatistics

Difference in two population proportions, π1-π0

Estimate of difference in population proportions = p1 – p0

Example:- TBM trial

Dexamethasone

p1 = d1/n1 = 87/274 = 0.318

Placebo

p0 = d0/n0 = 112/271 = 0.413

p1 – p0 = 0.318 – 0.413 = -0.095 (or -9.5%)

Page 27: Hypothesis - Biostatistics

Difference in two proportions - Inference

• Obtain a sample estimate, p1-p0, of the difference in population proportions, π1Dπ0

• REMEMBER different samplesof π1Dπ0 (e.g. sample 1 p11-p10,

would give different estimatessample 2 p21-p20,…)

• Derive:

– Standard error of difference in sample proportions

– Confidence interval of difference in population proportions

Page 28: Hypothesis - Biostatistics

Standard error & confidence intervalfor difference between two

proportions• Standard error (SE) for difference between sample proportions:-

[s.e.( p )]2 + [s.e.( p )]2s.e.( p ) =− p1 0 1 0

• 95% CI for difference between population

Lower limit = (p1-p0) - 1.96×s.e.(p1-p0)

Upper limit = (p1-p0) + 1.96×s.e.(p1-p0)

proportions:-

Page 29: Hypothesis - Biostatistics

Standard error & confidence interval

for difference between two proportions

Example:- TBM trial

Estimate of difference in population proportions

= p1-p0 = -0.095

s.e.(p1-p0) = 0.041

95% CI for difference in population proportions (π1-π0):

-0.095 ± 1.96×0.041

-0.175 up to -0.015 OR -17.5% up to -1.5%

Interpretation:-

“We are 95% confident, that the difference in population proportions is

between -17.5% (dexamethasone reduces the proportion of deaths by a

large amount) and -1.5% (dexamethasone marginally reduces the

proportion of deaths)”.

Page 30: Hypothesis - Biostatistics

Comparing proportions usingcsi 87 112 187 159

Stata

| Exposed Unexposed | Total-----------------+------------------------+------------

Cases |

Noncases |87

187112

159|

|199

346-----------------+------------------------+------------

Total |

||||

274 271 |

||||

545

Risk .3175182 .4132841 .3651376

Point estimate [95% Conf. Interval]|------------------------+------------------------

Risk difference

Risk ratio Prev.

frac. ex. Prev.

frac. pop

|

|||

-.0957659

.7682808

.2317192

.1164974

|

|||

-.1762352 -.0152966.6139856

.0386495.9613505

.3860144

+-------------------------------------------------chi2(1) = 5.39 Pr>chi2 = 0.0202

Remember the warning about how the table is presented-Stata requires presentation with outcome by rows and exposure by columns

Results are close to those obtained by hand

Page 31: Hypothesis - Biostatistics

Difference between two proportions:-

Risk difference

Example:- TBM trial

Outcome measure: Death during nine months

treatment.

following start of

Dexamethasone

p1 (incidence risk) = d1/n1 = 87/274 = 0.318

Placebo

p0 (incidence risk) = d0/n0 = 112/271 = 0.413

p1 – p0 (risk difference) = 0.318 – 0.413 = -0.095 (or -9.5%)

Page 32: Hypothesis - Biostatistics

Lecture 8 – Objectives

• Define binary variables, prevalence and incidence risk

• Calculate and interpret a proportion and 95% confidenceinterval for the population proportion

• Calculate and interpret the difference in sample proportionsand 95% confidence interval for difference in population proportions

Page 33: Hypothesis - Biostatistics

Thank You

www.HelpWithAssignment.com