21
Nonresponse Bias Correction in Telephone Surveys Using Census Geocoding: An Evaluation of Error Properties Paul Biemer RTI International and University of North Carolina Andy Peytchev RTI International

Paul Biemer RTI International and University of North Carolina Andy Peytchev RTI International

  • Upload
    lola

  • View
    45

  • Download
    0

Embed Size (px)

DESCRIPTION

Nonresponse Bias Correction in Telephone Surveys Using Census Geocoding: An Evaluation of Error Properties. Paul Biemer RTI International and University of North Carolina Andy Peytchev RTI International. Estimating the Population Mean in an RDD Survey. TOTAL SAMPLE. Respondents { R }. - PowerPoint PPT Presentation

Citation preview

Page 1: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

Nonresponse Bias Correction in Telephone Surveys Using Census Geocoding:

An Evaluation of Error PropertiesPaul Biemer

RTI International and University of North Carolina

Andy PeytchevRTI International

Page 2: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

2

Estimating the Population Mean in an RDD Survey

Respondents {R} Nonrespondents

TOTAL SAMPLE

Nonrespondents {NR}

{ } { } { } { }R R NR NRy p y p y

mean of NRs is unknown

Page 3: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

3

Methods for Adjusting and Evaluating for RDD Nonresponse

Very limited information on NRs in RDD surveys Post-stratification adjustments are the norm

Effectiveness at reducing bias is questionable at best Bias is sometimes evaluated using

Maximum followup effort approaches Only evaluates reduction in bias due to slight elevations in response rates

Comparison to external gold standard estimates Limited in scope to a few characteristics

Census block group geocoding Error properties are largely unknown

The focus of this research

Page 4: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

4

Census Geocoding (CG) Method

Obtain the addresses for nonrespondents (50-60% “success” rate)

Geocode addresses Link to census aggregate data

Address matched: link unit to census block group (CBG) via geocode

Exchange matched: link to census tract (CT) via telephone number

Substitute the corresponding CBG or CT mean for the nonrespondent’s characteristic

Page 5: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

5

Estimating the Population Mean in an RDD Survey

Respondents {R} Nonrespondents

TOTAL SAMPLE

Nonrespondents {NR}

{ } { } { } { }R R NR NRy p y p y

mean of NRs is unknown

Page 6: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

6

Impute Nonrespondent Characteristics from Census Aggregate Data

Respondents {R} Nonrespondents

TOTAL SAMPLE

Nonrespondents {NR}

{ } { } { } { }NR RR NRp y py y

Obtained from NRs CBG or CT

Page 7: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

7

Questions Addressed by this Research

What is the bias in ?

Is a valid estimate of the bias in the

unadjusted (or post-stratified) estimator of the mean?

Does the CG method provide useful data for modeling

response propensity?

The first two questions will be addressed in today’s

presentation.

y

{ }ˆ

RB y y

y

{ }ˆ

RB y y

Page 8: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

8

Decomposition of the Bias in

Respondents Nonrespondents

Correctly matched

addresses

Incorrectly matched

addresses

Correctly matched

exchanges

Incorrectly matched

exchanges

TOTAL SAMPLE

{ }CA { }IA { }CE { }IE

{ }CA { }IA { }CE { }IE

y

Size

Expected Difference

Page 9: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

9

{ } { } { }

{ } { } { }

{ } { } { }

{ } { } { }

CA CA CA

IA IA IA

CE CE CE

IE IE IE

y y

y y

y y

y y

{ } { }

{ } { }

{ } { }

{ } { }

( )

( )

( )

( )

CA CA

IA IA

CE CE

IE IE

E p

E p

E p

E p

Page 10: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

10

Components of the Bias in

{ } { } { } { } { } { }

{ } { } { } { }

NR NR CA CA IA IA

CE CE IE IE

E y y

{ } { } { }

Bias( ) ( )

NR NR NR

y E y y

E y y

y

where

Page 11: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

11

Components of the Bias in

{ } { } { } { } { } { }

{ } { } { } { }

NR NR CA CA IA IA

CE CE IE IE

E y y

{ } { } { }

Bias( ) ( )

NR NR NR

y E y y

E y y

y

wherecorrect CBG match incorrect CBG match

correct CT match incorrect CT match

Page 12: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

12

Estimation of the Bias Components

National Comorbidity Survey Replication (NCS-R) National probability sample of 18+ in households Face to face survey with 71% response rate All addresses were geocoded

CG was applied to 8,178 responding hh’s that provided a telephone number (88% of NCS sample)

CG bias components estimated based on 41% response rate (response after 3 callbacks)

Sensitivity analysis based on three response rates: 2 callbacks 26% response rate 3 callbacks 40% response rate 5 callbacks 60% response rate

Page 13: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

13

Why is it reasonable to use a face to face survey to evaluate the CG bias in an RDD survey?

The nonresponse mechanism is not a critical factor in the assessment of the CG bias.

A survey with a relatively high response rate is needed to evaluate the bias.

Addresses are known for all sample members and can therefore be geocoded to their correct CGs.

Sensitivity analysis can be performed to assess the effect on CG bias of increasing response rates.

Page 14: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

14

Weighted Respondent Mean, True Mean, and CG Imputed Mean for Available Characteristics

Whit

eBlac

kAsia

nOthe

rHisp Male

Female

1 pers

HH

2+ pe

rs HH

< 18 H

H

Other H

H

Age 18

–24

Age 25

–34

Age 35

–49

Age 50

–59

Age 60

+

< $15,0

00

$15K

- $3

0K

$30K

–$50

K

$50K

–$75

K

≥ $7

5K

0

10

20

30

40

50

60

70

80

Respondent Mean True Mean Imputed Mean

Page 15: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

15

Weighted Respondent Mean, True Mean, and CG Imputed Mean for Available Characteristics

Whit

eBlac

kAsia

nOthe

rHisp Male

Female

1 pers

HH

2+ pe

rs HH

< 18 H

H

Other H

H

Age 18

–24

Age 25

–34

Age 35

–49

Age 50

–59

Age 60

+

< $15,0

00

$15K

- $3

0K

$30K

–$50

K

$50K

–$75

K

≥ $7

5K

0

10

20

30

40

50

60

70

80

Respondent Mean True Mean Imputed Mean

Page 16: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

16

White

Black

Asian

Other

Hisp Male

Female 1 H

H1+

HH

1+ ch

ild

No chil

d18

-2425

-3435

-4950

-59 60+

<15k

15K-30

K

30k-5

0k

50k-7

5k75

k+

-50.00%

-40.00%

-30.00%

-20.00%

-10.00%

0.00%

10.00%

20.00%

26% RR41% RR60% RR

Demographic Characteristics

by Response RateRelbias( | ) Relbias( | )R Ry y y y

Page 17: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

RTI International

17

Average Estimates of for {s} = {CA}, {IA}, {CE}, and {IE}

{ } { }s s

{ } { }s s

Bias Component

{CA} {IA} {CE} {IE}0.00%

0.10%

0.20%

0.30%

0.40%

0.50%

0.60%

0.70%

0.80%

0.90%

1.00%

26% RR41% RR60% RR

(Percentage points)

Page 18: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

RTI International

18

Average Relative Size of the Bias Components

{ }s { }| |s26% RR 41% RR 60% RR 26% RR 41% RR 60% RR

0.00%

5.00%

10.00%

15.00%

20.00%

25.00%

30.00%

35.00%

40.00%

45.00%

50.00%

{CA} {IA} {CE}

{IE}

Page 19: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

19

Conclusions

Bias in the CG estimates of NR bias is unacceptably large race, age, and income were the most biased

Major source of bias {IE} followed by {CA} (surprisingly) Approximately 75% of the cases fall into these subsets

Correctly matching to CBGs reduces the bias, but minimally Biases tend to build across components rather than netting

out. Increasing the survey response rate reduces bias in the CG

approach; relative importance of each component is stable

Page 20: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

20

Next Steps

Further characterize the CG bias by its components Consider the use of CBG and CT information obtain from

the CG method for: modeling of response propensities adjusting for nonresponse bias

Page 21: Paul Biemer RTI International and  University of North Carolina Andy Peytchev RTI International

21

EMAIL ME TO REQUEST FULL [email protected]