Economics 105: Statistics

Economics 105: Statistics• Any questions?• No GH due Friday. For next couple classes, please read first 4 sections of Chapter 13 and Freakonomics, Chapter 5 (copy is in P:\economics\Eco 105 (Statistics) Foley\freakonomics Ch_5.pdf)

Brief Introduction to Research Design

Design Notation

Internal Validity

Experimental Design

Design Notation• Observations or measures are indicated with an “O”• Treatments or programs with an “X”• Groups are shown by the number of rows• Assignment to group is by “R,N,C”

– Random assignment to groups– Nonequivalent assignment to groups– Cutoff assignment to groups

• Time

Design Notation Example

R O1,2 X O1,2

R O1,2 O1,2

Os indicate differentwaves of

measurement.

Vertical alignmentof Os shows that

pretest and posttestare measured at same time.

X is the treatment.There are twolines, one foreach group.

R indicates the groups

are randomly assigned.

Subscriptsindicate

subsets ofmeasures.

Types of DesignsRandom assignment?

Control group or multiple measures?

No

Yes

Yes

Randomized(true experiment)

Quasi-experiment

No

Nonexperiment

Non-Experimental Designs

X O

O X O

X O O

Post-test only (case study)

Single-group, pre-test, post-test

Two-group, post-test only(static group comparison)

Experimental DesignsR O1 X O1,2

R O1 O1,2

R X O1,2

R O1,2

• Pretest-Posttest Randomized Experiment Design

• If continuous measures, use t-test

• If categorical outcome, use chi-squared test

• Posttest only Randomized Experiment Design

• Less common due to lack of pretest

• Probabilistic equivalence between groups

Experimental DesignsR O X OR O OR X OR O

• Advantages• Information is available on the effect of treatment

(independent variable), the effect of pretesting alone, possible interaction of pretesting & treatment, and the effectiveness of randomization

• Disadvantages• Costly and more complex to implement

Solomon Four-Group Design

Establishing Cause and Effect

Single-Group Threats

Multiple-Group Threats

“Social” Interaction Threats

• Internal validity is the approximate truth about inferences regarding cause-effect relationships.

• “Internal” means internal to the study, not “external”, that is, not talking about generalizing the results yet.

Internal Validity

Threats to Internal ValidityR X

OR

OHistory

MaturationTesting

InstrumentationMortality

Regression to the meanSelection

Selection-historySelection- maturation

Selection- testingSelection- instrumentation

Selection- mortality*Selection- regressionDiffusion or imitation*

Compensatory equalization*Compensatory rivalry*

Resentful demoralization*

Single-Group

Multiple-Group

Social Interaction

Single-Group Threatsto Internal Validity

Administerprogram

Measureoutcomes

X O

Two designs:

Administerprogram

Measureoutcomes

X O

Measurebaseline

O

Post-test only a single group

What is a “single-group” threat?

• Diabetes educational program for newly diagnosed adolescents in a clinic

• Pre-post, single group design• Measures (O) are paper-pencil, standardized

tests of diabetes knowledge (e.g. disease characteristics, management strategies)

Example

• Any other event that occurs between pretest and posttest

• For example, adolescents learn about diabetes by watching The Health Channel

Program Posttest

X O

Pretest

O

History Threat

• Normal growth between pretest and posttest.• They would have learned these concepts anyway,

even without program.

Program Posttest

X O

Pretest

O

Maturation Threat

• The effect on the posttest of taking the pretest• May have “primed” the kids or they may have

learned from the test, not the program• Can only occur in a pre-post design

Program Posttest

X O

Pretest

O

Testing Threat

• Any change in the test from pretest and posttest• So outcome changes could be due to different

forms of the test, not due to program• May do this to control for “testing” threat, but

may introduce “instrumentation” threat

Program Posttest

X O

Pretest

O

Instrumentation Threat

• Nonrandom dropout between pretest and posttest• For example, kids “challenged” out of program by

parents or clinicians• Attrition

Program Posttest

X O

Pretest

O

Mortality Threat

• Group is a nonrandom subgroup of population.• For example, mostly low literacy kids will appear

to improve because of regression to the mean.• Example: height

Program Posttest

X O

Pretest

O

Regression Threat

When you select a sample from

the low end of a distribution ...

the group will do better on a

subsequent measure.

The group mean on the first measure

appears to “regress toward the mean” of

the population.

Selectedgroup’smean

Overallmean

Regression to the mean

Overallmean

Regression to the Meanpre-test scores ~ N

post-test scores ~ N & assuming no effect of treatment pgm

Regression to the Mean

Regression to the Mean• How to Reduce the effects of RTM

(Barnett, et al., International Journal of Epidemiology, 2005)

1. When designing the study, randomly assign subjects to treatment and control (placebo) groups. Then effects of RTM on responses should be same across groups.

2. Select subjects based on multiple measurements

• RTM increases with larger variance (see graphs) so subjects can be selected using the average of 2 or more baseline measurements.

Multiple-Group Threats to Internal Validity

• When you move from single to multiple group research the big concern is whether the groups are comparable.

• Usually this has to do with how you assign units (e.g., persons) to the groups (or select them into groups).

• We call this issue selection or selection bias.

The Central Issue

Administerprogram

Measureoutcomes

Measurebaseline

Alternativeexplanations

Alternativeexplanations

X OOOO

Do not administerprogram

Measureoutcomes

Measurebaseline

The Multiple Group Case

• Diabetes education for adolescents

• Pre-post comparison group design

• Measures (O) are standardized tests of diabetes knowledge

Example

• Any other event that occurs between pretest and posttest that the groups experience differently.

• For example, kids in one group pick up more diabetes concepts because they watch a special show on Oprah related to diabetes.

X OOOO

Selection-History Threat

• Differential rates of normal growth between pretest and posttest for the groups.

• They are learning at different rates, even without program.

X OOOO

Selection-Maturation Threat

• Differential effect on the posttest of taking the pretest.

• The test may have “primed” the kids differently in each group or they may have learned differentially from the test, not the program.

X OOOO

Selection-Testing Threat

• Any differential change in the test used for each group from pretest and posttest

• For example, change due to different forms of test being given differentially to each group, not due to program

X OOOO

Selection-Instrumentation Threat

• Differential nonrandom dropout between pretest and posttest.

• For example, kids drop out of the study at different rates for each group.

• Differential attrition

X OOOO

Selection-Mortality Threat

• Different rates of regression to the mean because groups differ in extremity.

• For example, program kids are disproportionately lower scorers and consequently have greater regression to the mean.

X OOOO

Selection-Regression Threat

“Social Interaction” Threats to Internal Validity

• All are related to social pressures in the research context, which can lead to posttest differences that are not directly caused by the treatment itself.

• Most of these can be minimized by isolating the two groups from each other, but this leads to other problems (for example, hard to randomly assign and then isolate, or may reduce generalizability).

What Are “Social” Threats?

• Controls might learn about the treatment from treated people (for example, kids in the diabetes educational group and control group share the same hospital cafeteria and talk with one another).

Diffusion or Imitation of Treatment

• Administrators give a compensating treatment to controls.

• Researchers feel badly and give control group kids a video to watch pertaining to diabetes. Contaminates the study!

=

Compensatory Equalization of Treatment

• Controls compete to keep up with treatment group.

Compensatory Rivalry

• Controls "give up" or get discouraged

• Likely to exaggerate the posttest differences, making your program look more effective than it really is

Resentful Demoralization

What is a Clinical Trial?• “A prospective study comparing the effect and

value of intervention(s) against a control in human beings.”

• Prospective means “over time”; vs. retrospective• It is attempting to change the natural course of a

disease• It is NOT a study of people who are on drug X

versus people who are not

• http://www.clinicaltrials.gov/info/resources

http://www.clinicaltrials.gov/info/resources

Model of Two-Group Randomized Clinical Trial

What are the characteristics of a Clinical Trial?• Begins with a primary research question, and the trial

design flows from this question (constrained by practicalities)

• Everything must be exhaustively defined in advance (to prevent accusations of fishing for a positive finding)

• The hypothesis (“-es”)• Population to be studied• inclusion criteria• exclusion criteria• contraindications to therapy• indications to therapy• Treatment strategy (treatment, exact dosage, dosage

schedule, etc)• The outcome(s)

Beta-Blocker Heart Attack Trial (BHAT)• Published in Journal of the American Medical AssociationJAMA 1982; 247: 1701 - 1714JAMA 1983; 250: 2814 – 2819• Up until about 25 years ago, the treatment of myocardial

infarction consisted of bed rest, alleviation of symptomatic pain, possible administration of early antiarrhythmics

• But a third of people who have a heart attack die from it ‘suddenly’

• In 1976, NIH sponsored a conference to discuss potential agents to be used in either a primary or secondary prevention setting to reduce sudden death, for which there was no treatment.

• The conference made an official recommendation to do a clinical trial.

Beta-Blocker Heart Attack Trial (BHAT)

• Primary Research Question• To test in a multicenter, randomized, double-blind,

placebo, controlled trial, whether the daily administration of propranolol to patients who had had at least one documented MI would results in a significant reduction in all-cause mortality during 2 to 4 years of follow-up (expected mean follow-up = 36 months).

Beta-Blocker Heart Attack Trial (BHAT)• Inclusion criteria

• Men/Women• Aged 30 to 69 yrs• Documented (defined) MI within 5 to 21 days of

randomization• Exclusion criteria

• Contraindication to propranolol (e.g., asthma, severe bradycardia)

• Likely to be prescribed propranolol (e.g., for severe angina)

• Unlikely to be a compliant participant• Likely to die of noncardiac cause (e.g., cancer)

• What do these do to generalizability?

http://www.medterms.com/script/main/art.asp?articlekey=17824

BHAT Design and Conduct

1916 Patients - Propranolol

Randomized 3,837 Participants

Screened 16,400 Patients

Time

Treat and Collect Follow-up Data

1921 Patients - Placebo

Follow-up Time

Mean 2 yrs (trial stopped early)

138 Deaths

188 Deaths

Beta-Blocker Heart Attack Trial (BHAT)• Results

• BHAT (and similar trials) demonstrated great benefitin reducing all-cause mortality and cardiac-specific mortality (including sudden death)in three-quarters of Post-MI Patients (1/4 had contraindication to propranolol)

• Relevance today?• Beta-blockers still should be given post-MI• What happened after BHAT is illustrative of what often

happens a clinical trial result is published • Results reported in 1981 (short report in JAMA)

In 1987, only 36% of post-MI patients on a beta-blocker In 1989, 40% In 1992, 63% In 1993, only 33% of post-MI women

Example: Job Corps• What is Job Corps? http://jobcorps.doleta.gov/

• January 5, 2006 Thursday Late Edition – Final

SECTION: Section C; Column 1; Business/Financial Desk; ECONOMIC SCENE; Pg. 3

HEADLINE: New (and Sometimes Conflicting) Data on the Value to Society of the Job Corps

BYLINE: By Alan B. Krueger.

Alan B. Krueger is the Bendheim professor of economics and public affairs at Princeton University. His Web site is www.krueger.princeton.edu.

He delivered the 2005 Cornelson Lecture in the Department of Economics here at Davidson (that’s the big econ lecture each year).

http://jobcorps.doleta.gov/

Example: Job Corps• Quotations from “New (and Sometimes Conflicting) Data on the Value

to Society of the Job Corps” by Alan B. Krueger.

• Since 1993, Mathematica Policy Research Inc. has evaluated the performance of the Job Corps for the Department of Labor.

• Its evaluation is based on one of the most rigorous research designs ever used for a government program. From late 1994 to December 1995, some 9,409 applicants to the Job Corps were randomly selected to be admitted to the program and another 6,000 were randomly selected for a control group that was excluded from the Job Corps.

• Those admitted to the program had a lower crime rate, higher literacy scores and higher earnings than the control group.

RCT for Credit Card Offers

Source: Agarwal, et al. (2010), Journal of Money, Credit & Banking, 42 (4)

RCT for Education in India

Source: Banerjee, et al. (2007), Quarterly Journal of Economics

RCT for Education in India

Source: Agarwal, et al. (2010), Journal of Money, Credit & Banking, 42 (4)

Recommended Reading

Amazon link Amazon link

http://www.amazon.com/Poor-Economics-Radical-Rethinking-Poverty/dp/1586487981/ref=sr_1_1?ie=UTF8&qid=1331056268&sr=8-1

http://www.amazon.com/How-Doctors-Think-Jerome-Groopman/dp/0547053649/ref

Documents

Economics 105: Statistics