Rates 2007

Embed Size (px)

Citation preview

  • 8/14/2019 Rates 2007

    1/38

    1

    Estimating and Comparing

    RatesIncidence Density

    Incidence Rate Difference and RatioConfidence Intervals

    Standardized Rates and Their Comparison

  • 8/14/2019 Rates 2007

    2/38

    2

    A Definition

    Kleinbaum, Kupper and Morgenstern, EpidemiologicResearch: Principles and Quantitative Methods (1982),p.97:

    A true rate is a potential for change in one quantity per unitchange in another quantity, where the latter quantity isusually time. () Thus, a rate is not dimensionless andhas no finite upper bound i.e., theoretically, a rate can

    approach infinity.

  • 8/14/2019 Rates 2007

    3/38

    3

    Rates

    A well-known example of rate is velocity, i.e., change ofdistance per unit of time (given, e.g., in km/h). In practice, it does (should?) have an upper-bound

    We can talk about instantaneous and average rates. Example instantaneous: your car velocity at a particular time-point

    (can depend on the time-point, e.g., city and highway).

    Example average: your average speed after travelling a particular

    distance (assumed constant across the whole trip).

    In epidemiology, we usually talk about average rates.

  • 8/14/2019 Rates 2007

    4/38

    4

    Incidence/Mortality Rate

    Kleinbaum, Kupper and Morgenstern, EpidemiologicResearch: Principles and Quantitative Methods (1982),p.100:

    The incidence rate of disease occurrence is the instantaneouspotential for change in disease status (i.e., the occurrence of

    new cases) per unit of time at time t, or the occurrence ofdisease per unit of time, relative to the size of the candidate

    (i.e., disease-free) population at time t.

    We could similarly define the mortality rate.

  • 8/14/2019 Rates 2007

    5/38

    5

    Incidence Rate

    Other terms:

    an instantaneous risk (or probability);

    a hazard (especially for mortality rates);

    a person-time incidence rate; a force of morbidity.

    It is expressed in units of 1/ time.

    It is sometimes confused with risk.

  • 8/14/2019 Rates 2007

    6/38

    6

    Rates and Risks

    Assume that the incidence rate is constant over time (=),and the same for all individuals.

    The risk(probability)of developing disease in time Twillthen be equal to 1-e-T.

    Risk is sometimes called a cumulative incidence.

    In a disease-free (at time 0) cohort ofNindividuals, you would

    thus expect N(1-e-T) new cases after time T. Similarly, we could talk about the risk of death.

    Thus, formally these are two different quantities.

  • 8/14/2019 Rates 2007

    7/387

    Estimating Rates

    Rates require observations of incidence in time. Thus,they are estimated from cohort studies.

    Instantaneous rates are seldom obtained. Rather, the

    average rates are computed. The most basic estimator is the incidence density(ID):

    timepopulationaccrued

    )t,(tperiodcalendarin thecasesnewofno. 10==PT

    IID

    PTis expressed in person-years, person-days etc.

  • 8/14/2019 Rates 2007

    8/388

    Incidence Density

    A hypothetical cohort of 12 subjects. Followed for the period of 5.5 years.

    7 withdrawals among non-cases

    three (7,8,12) lost to follow-up;

    two (3,4) due to death; two (5,10) due to study termination.

    PT= 2.5+3.5++1.5 = 26.ID=5/26=0.192 per (person-) year

    or

    1.92 per 10 (person-)years.

  • 8/14/2019 Rates 2007

    9/389

    Population-Time Without Individual Data

    E.g., population-based registries.

    Person-years computed using the mid-year population.

    For rare events, periods of several years may be used.

    Ideally, one would like to use mid-year populations for each year.

    Alternatively, one can use information for several time-points, or themid-period population (these are less accurate solutions).

    One may face the problem of removing those not at risk (e.g., women forprostate cancer incidence).

  • 8/14/2019 Rates 2007

    10/3810

    Population-Time Without IndividualData: Example

  • 8/14/2019 Rates 2007

    11/3811

    Incidence Density: Remarks

    It is an estimate of an average rate.

    So we will sometimes refer to it as an incidence rate.

    Any fluctuations in the instantaneous rate are obscured andcan lead to misleading conclusions. E.g.,

    1000 persons followed for 1 year

    100 persons followed by 10 years

    produce the same number of person-years.

    If the average time to disease onset is 5 years, ID in the firstcohort will be lower.

  • 8/14/2019 Rates 2007

    12/3812

    Incidence Density: Remarks

    If applied to the whole cohort/population, sometimes calledcrude rate.

    However, sex, age, race etc. can have substantial influenceon the incidence of disease.

    Comparing crude rates for two populations, which differ

    w.r.t., e.g., age, can be misleading (confounding!).

    Therefore, usually standardized rates are compared.

    E.g., for cancer, age- and sex-standardized rates are used.

    They will be discussed later.

  • 8/14/2019 Rates 2007

    13/3813

    Confidence Interval for IncidenceDensity

    By using a Poisson model, standard error ofID=I / PTcanbe estimated by:

    ( ) 2)SE(

    PT

    IID =

    Thus, an approximate 95% CI forID is given by:

    ID 1.96SE(ID).

    99% CI: ID 2.58SE(ID) .

  • 8/14/2019 Rates 2007

    14/3814

    Estimating Incidence Densities: Example

    Postmenopausal Hormone and Coronary Heart Disease Cohort Study: Stampferet al., NEJM(1985). Involving female nurses:

    Hormone use

    105786.251477.554308.7Person-years

    906030CHDTotalNoYes

    ID1 = 30/54308.7 = 0.00055; SE(ID1) = (30/54308.72)1/2 = 0.00010

    95% CI for ID1= 0.00055 1.960.0001 = (0.00035, 0.00075)

    ID0= 60/51477.5 = 0.00116; SE(ID0) = (60/51477.52)1/2 = 0.00015

    95% CI for ID0= 0.00116 1.960.00015 = (0.00086, 0.00145)

  • 8/14/2019 Rates 2007

    15/3815

    Comparing Two Incidence Densities

    Assume data from acohort study:PTPT0PT1Pop.-time

    II0I1Cases

    TotalUnexposedExposed

    We get two estimates for non- and exposed subjects:ID0=I0/PT0 and ID1=I1/PT1.

    To compare them, we can look at Incidence rate difference: IRD = ID1- ID0.

    Incidence rate ratio: IRR= ID1/ID0.

  • 8/14/2019 Rates 2007

    16/3816

    Comparing Two Incidence Densities:Example

    Postmenopausal Hormone and Coronary Heart Disease Cohort Study: Stampferet al., NEJM(1985). Involving female nurses:

    Hormone use

    105786.251477.554308.7Person-years

    906030CHD

    TotalNoYes

    ID1 = 30/54308.7 = 0.00055; ID0= 60/51477.5 = 0.00116

    IRD = ID1- ID0= -0.00061

    IRR= ID1/ID0= 0.474

  • 8/14/2019 Rates 2007

    17/3817

    Comparing Two Incidence Densities:Poisson Model Method

    PTPT0PT1Pop.-time

    II0I1Cases

    TotalUnexposedExposed By using a Poisson model,standard error ofIRD can beestimated by:

    ( ) ( ) 21

    1

    2

    0

    0

    )SE( PT

    I

    PT

    IIRD +=

    10

    11)SE(ln

    IIIRR +=

    Standard error of ln IRRcan beestimated by:

    Thus, an approximate 95% CIforIRD is given by:

    IRD 1.96SE(IRD). 99% CI: IRD 2.58SE(IRD)

    Thus, an approximate 95% CI forIRRis given by:

    exp{ ln IRR 1.96SE(ln IRR) }

    99% CI: exp{ ln IRR 2.58SE(ln IRR) }

  • 8/14/2019 Rates 2007

    18/3818

    Comparing Two Incidence Densities:Example

    00018.054308.7

    30

    51477.5

    60)SE(

    22=+=IRD

    Hormone use

    105786.251477.554308.7Person-years

    906030CHD

    TotalNoYes

    95% CI for:

    IRD: -0.00061 1.960.00018 = (-0.00096, -0.00025)

    ln IRR: ln(0.474)1.960.22 = (-1.178, -0.315)IRR: (e-1.178, e-0.315) = (0.308, 0.729)

    Both CIs allow to reject the null hypothesis of no difference.

    ( )224.0

    30

    1

    60

    1lnSE =+=IRR

  • 8/14/2019 Rates 2007

    19/3819

    Comparing Two Incidence Densities:Test-Based Method

    PTPT0PT1Pop.-time

    II0I1Cases

    TotalUnexposedExposed 95% test-based CI forIRD canbe computed as

    IRD 1.96 SE(IRD),

    where SE(IRD)= IRD / and

    Similarly, SE(ln IRR)= (ln IRR) /

    95% test-based CI for ln IRRis

    ln IRR 1.96 (ln IRR)/

    Can be written as

    ( 1 1.96 / ) ln IRR

    95% CI forIRRis thus

    exp{ ( 1 1.96 / ) ln IRR}

    2

    10

    1

    1

    PT

    PTPTI

    PT

    PTII

    =

    Can be re-expressed as(1 1.96 / ) IRD

    99% CI: (1 2.58 / ) IRD

  • 8/14/2019 Rates 2007

    20/3820

    Comparing Two Incidence Densities:Example

    Hormone use

    105786.251477.554308.7Person-years

    906030CHDTotalNoYes

    PTPT0PT1Pop.-time

    II0I1Cases

    TotalUnexposedExposed

    2

    10

    11

    PT

    PTPTI

    PTPTII

    =

    95% test-based CI forIRD: (1 1.96/3.41) (-0.00061) = (-0.001, -0.0002)

    Close to the one based on the Poisson approximation (not in general).

    ln IRR: (1 1.96/3.41) ln(0.474) = (-1.176, -0.317)

    IRR: (e-1.176, e-0.317) = (0.309, 0.728)

    41.3

    2.105786

    5.514777.5430890

    2.1057867.543089030

    2

    =

    =

  • 8/14/2019 Rates 2007

    21/3821

    Exact Confidence Interval forIRR

    The presented CIs for ln IRR(and IRD) assume that theestimates of ln IRRvary according to the normal distribution.

    Hence their form, e.g., ln(IRR) 1.96 SE(ln IRR).

    The use of the normal distribution is an approximation. Can be problematic, especially in small samples.

    It is possible to construct a CI for ln IRRusing the exactdistribution (i.e., without approximating it by the normal).

    The CI is valid in all samples; in large samples, it is close to theapproximate CIs.

    Computation is a bit more difficult (but easily handled by computers).

  • 8/14/2019 Rates 2007

    22/38

    22

    Standardized Rates

    We will introduce the standardization w.r.t. age.

    We will assume that our population is stratifiedby age (i.e.,subdivided into age-groups).

    One needs to define age-groups (e.g., 0-4, 5-9,).

    One needs to compute age-specific rates (ID).

    Population-time and no. of cases for each age-group are required.

    There are two methods of standardization:

    Direct;

    Indirect.

  • 8/14/2019 Rates 2007

    23/38

    23

    Standardization

    Direct method Age-specific rates of the study population are applied to the age-

    distribution of the standard population

    (rates study age standard)

    Theoretical rate that would have occurred if the ratesobserved in the study population applied to the standard

    population.

    Indirect method

    Age-specific rates from the standard population are applied to theage-distribution of the study population.

    (rates standard age study)

  • 8/14/2019 Rates 2007

    24/38

    24

    Direct Standardization

    Crude Rate in study population = It/ PTt .

    Directly Standardized Rate (DSR):DSR= { (I

    1/PT

    1)N

    1+ (I

    2/PT

    2)N

    2+ (I

    3/PT

    3)N

    3} /Nt= (I

    1/PT

    1)(N

    1/N

    t) + (I

    2/PT

    2)(N

    2/N

    t) + (I

    3/PT

    3)(N

    3/N

    t).

    Make sure units are consistent!!!

    Study Population Standard Population(e.g., USA 1990)

    AgeGroup

    Observed Person-years Rate Observed Population Rate

  • 8/14/2019 Rates 2007

    25/38

    25

    Direct Standardization

    If there is no confounding, crude rate is adequate.

    DSRby itself is not meaningful it makes sense only whencomparing two or more populations. If possible, compare age-specific rates.

    The rates should exhibit more or less similar trends (also in thestandard).

    DSRdepends on the choice of the standard population. The age-distribution of the latter should not be radically different from

    the compared populations.

    There are several standard populations (e.g., for the world,

    continents etc.).

  • 8/14/2019 Rates 2007

    26/38

    26

    Indirect Standardization

    Direct standardization requires age-specific rates for allcompared populations.

    If these are not available, or they are imprecise, theindirect method is preferred.

    Both should lead to similar conclusions; if they do not, thereason should be investigated.

  • 8/14/2019 Rates 2007

    27/38

    27

    Indirect Standardization

    Standardized (Incidence or Mortality) Ratio (SIRorSMR):

    SIR = It / Ej = Observed/ Expected.

    Take Indirectly Standardized Rate (ISR) as:

    ISR = SIR (crude rate for the standard population).

    Make sure units are consistent!!!

    Study Population Standard Population

    (e.g., USA 1990)

    Age

    GroupObs Person-

    yearsRate Obs Population Rate

    Expected

  • 8/14/2019 Rates 2007

    28/38

    28

    Standardization of Rates: Example

    Infant deaths (for children less than 1 year of age) in Colorado andLouisiana in 1987.

    Colorado: 527 deaths out of 53808 life births; crude rate = 9.8 per 1000.

    Louisiana: 872 deaths out of 73967 life births; crude rate = 11.8 per 1000.

    Crude infant mortality rate for Colorado is lower than for Louisiana. In the US, infant mortality depends on race.

    USA, 1987Race LifeBirths

    %LifeBirths

    InfantDeaths

    Rate(x1000)

    Black 641567 16.8 11461 17.9

    White 2992488 78.6 25810 8.6

    Other 175339 4.6 1137 6.5

    Total 3809394 100 38408 10.1

  • 8/14/2019 Rates 2007

    29/38

    29

    Standardization of Rates: Example

    The distribution of race of new-born children is different in thetwo states.

    Infant mortality rates depend onrace.

    Race is a confounder.

    Compare race-specific infantmortality rates.

    Unclear (differences in variousdirections).

    Colorado LouisianaRace

    Life Birth % Life Births %

    Black 3166 5.9 29670 40.1

    White 48805 90.7 42749 57.8

    Other1837 3.4 1548 2.1

    Total 53808 100 73967 100

    Colorado LouisianaRace

    Rate(x1000) Rate(x1000)

    Black 16.4 17.7White 9.6 8.0Other 3.3 1.9

    Total 9.8 11.8

  • 8/14/2019 Rates 2007

    30/38

    30

    Standardization of Rates: Example

    Direct standardization: apply state- and race-specific rates to thestandard race distribution (US, 1987).

    DSRfor Colorado: 10.45 (per 1000 life births; crude: 9.8).

    DSRfor Louisiana: 9.35 (per 1000 life births; crude: 11.8).

    9.3535628.710.4539828.213809394Total

    0.09333.11.90.15578.63.30.046175339Other

    6.2823939.98.07.5428727.99.60.7862992488White

    2.9811355.717.72.7610521.716.40.168641567Black

    Rate*Ni/N

    t

    (x1000)

    Rate*Ni

    Rate

    (x1000)

    Rate*Ni/N

    t

    (x 1000)

    Rate*Ni

    Rate

    (x1000)

    Ni/ N

    tN

    i

    (Births)

    LouisianaColoradoUS, 1987Race

  • 8/14/2019 Rates 2007

    31/38

    31

    Standardization of Rates: Example

    Indirect standardization: apply race-specific rates of a standardpopulation (US, 1987) to the race-distribution of the states.

    US Colorado LouisianaRace

    Rate(x1000)

    Life Births(PTi)

    Deaths(Obs.)

    Rate*PTi(Exp. Deaths)

    Life Births(PTi)

    Deaths(Obs.)

    Rate*PTi(Exp. Deaths)

    Black 17.9 3166 52 56.7 29670 525 531.1

    White 8.6 48805 469 419.7 42749 344 367.6Other 6.5 1837 6 11.9 1548 3 10.1

    Total 10.1 53808 527 488.3 73967 872 908.8

    SMRfor Colorado: 527/488.3 = 1.08 (8% higher than the US). ISR= SMRx 10.1 = 10.9 (race-adjusted infant mortality-rate).

    SMRfor Louisiana: 872/908.8 = 0.96 (4% lower than the US).

    ISR= SMRx 10.1 = 9.7 (race-adjusted infant mortality-rate).

  • 8/14/2019 Rates 2007

    32/38

    32

    Standardization of Rates: Example

    Is it reasonable to use theadjusted rates?

    The plot of race-specific ratesshows similar trend(black>white>other).

    The distribution of race in the USis similar to the two states(white>black>other).

    Results for both standardizationmethods are similar.

  • 8/14/2019 Rates 2007

    33/38

    33

    Comparison of Directly StandardizedRates

    If we have two standardized rates, we may want to compare them.

    For the direct method, assume we have DSR1 and DSR2.

    95% CI can then be obtained using the normal approximation:

    (DSR1 - DSR2) 1.96 SE(DSR1 - DSR2) .

    99% CI: (DSR1 - DSR2) 2.58 SE(DSR1 - DSR2) .

    The standard error is given by

    ( )

    =

    2

    21 )SE(SE kt

    k IRDN

    NDSRDSR

    where IRDk is the stratum-specific intensity rate difference.

  • 8/14/2019 Rates 2007

    34/38

    34

    Comparison of Directly StandardizedRates

    Alternatively, we might look at the standardized rate ratio:

    SRR=DSR1/DSR2.

    95% CI forSRRcan be written as: SRR1 (1.96 / Z), where

    )SE( 21

    21

    DSRDSR

    DSRDSRZ

    =

    99% CI can be written as: SRR1 (2.58 / Z ).

  • 8/14/2019 Rates 2007

    35/38

    35

    Comparison of Directly StandardizedRates: Example

    US Colorado LouisianaRace

    %Births(Ni/Nt) Births(PTi) Deaths(Ii) Rate(IDi) Births(PTi) Deaths(Ii) Rate(IDi) IRDi(x1000) SE

    i(x1000)

    Black 16.8 3166 52 16.4 29670 525 17.7 -1.3 2.4White 78.6 48805 469 9.6 42749 344 8.0 1.6 0.6Other 4.6 1837 6 3.3 1548 3 1.9 1.4 1.7

    Total 100 53808 527 9.8 73967 872 11.8

    DSR1 (Colorado): 0.01045 (10.45 per 1000 life births).

    DSR2 (Louisiana): 0.00935 (9.35 per 1000 life births).

    ( ) ( ) ( ) ( ) 0006.00017.0046.00006.0786.00024.0168.0SE 22221 =++=DSRDSR

  • 8/14/2019 Rates 2007

    36/38

    36

    Comparison of Directly StandardizedRates: Example

    DSR1 = 0.01045; DSR2= 0.00935.

    DSR1 - DSR2= 0.0011.

    95% CI: 0.0011 1.960.0006 = (-0.0002, 0.002).

    CI includes 0 - we cannot reject H0 of no difference.

    SRR= DSR1 / DSR2= 1.12.

    Z= (DSR1 - DSR2) / SE = 1.83.

    95% CI: 1.121 (1.96 / 1.83)

    = (0.99, 1.26). -

    ( ) ( ) ( ) ( ) 0006.00017.0046.00006.0786.00024.0168.0SE 22221 =++=DSRDSR

  • 8/14/2019 Rates 2007

    37/38

    37

    Comparison of Indirectly StandardizedRates

    In directly standardized rates, stratum specific-rates for different studypopulations are combined using the same weights (relative stratum-sizes in the standard population).

    In indirectly standardized rates, the weights (PTi / expected Ii) differ.

    Thus, technically speaking, ISRs (SIRs) should not be compared.

    On the other hand, it is valid to ask whether SIR(orSMR) is different

    from 1.

    To do that, one can construct a 95% CI, e.g., as follows:

    SIR 1.96(observed events)/(expected events).

  • 8/14/2019 Rates 2007

    38/38

    Standardization of Rates

    Standardization is a simple way to remove effect ofconfounding.

    It can be extended to more than one confounder.

    Similar techniques can be used for differences or ratios ofrates.

    An alternative is a stratifiedanalysis (later).