15
1/27/15 1 ALTERNATIVE RESEARCH DESIGNS “Quasi” Experiments Like experiments, but do not possessExperimental control Participants may experience differences other than in the IV Randomization Participants usually all assigned to same condition Biggest differenceQuasi-experiments don’t contain a control group “Quasi” Experiments Sometimes real experiments aren’t feasible Examining effects that rely on person-factors Cannot randomly assign people to be “extraverted,” “depressed,” “adopted,” “a pack-a-day smoker,” etc. Any experiment examining person-factors as IVs is not a true experiment Because such factors cannot be randomly assigned Having a control group may be unethical E.g., giving ill people a placebo medication

ALTERNATIVE RESEARCH DESIGNS · • ABA or ABAB designs • Lack crucial characteristics of true experiments • Experimental control, random assignment, control group • Most often

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: ALTERNATIVE RESEARCH DESIGNS · • ABA or ABAB designs • Lack crucial characteristics of true experiments • Experimental control, random assignment, control group • Most often

1/27/15  

1  

ALTERNATIVE RESEARCH DESIGNS

“Quasi” Experiments • Like experiments, but do not possess… • Experimental control

•  Participants may experience differences other than in the IV

• Randomization •  Participants usually all assigned to

same condition

• Biggest difference… • Quasi-experiments don’t contain

a control group

“Quasi” Experiments • Sometimes real experiments aren’t feasible

• Examining effects that rely on person-factors •  Cannot randomly assign people to be “extraverted,”

“depressed,” “adopted,” “a pack-a-day smoker,” etc.

•  Any experiment examining person-factors as IVs is not a true experiment • Because such factors cannot be randomly assigned

• Having a control group may be unethical •  E.g., giving ill people a placebo

medication

Page 2: ALTERNATIVE RESEARCH DESIGNS · • ABA or ABAB designs • Lack crucial characteristics of true experiments • Experimental control, random assignment, control group • Most often

1/27/15  

2  

Types of Quasi-experiments • Single-case Design

• Unlike case studies… •  Case studies just describe behavior, life events, etc.

•  Single-case designs measure the effect of specific IVs on specific DVs (like an experiment)

•  “Single-case” doesn’t mean examining only one person •  The book discusses this as being synonymous with case-

studies; however, most single-case research examines multiple individuals

•  “Single-case” means participants are analyzed individually, instead of only paying attention to group averages

Single-case Design • Most often involves measuring DV before and after the experimental manipulation • Before manipulation = Baseline period

• After manipulation = Treatment period

• E.g., Mayberg et al., 2005

•  Treated 6 severely depressed patients with “deep brain stimulation”

•  Took multiple measures of depres-sion before and after treatment

Single-case Design • E.g., Mayberg et al., 2005

Dep

ress

ion

Sco

re

1

10

Baseline Treatment

5

Benefitted

No benefit

Individual cases

Page 3: ALTERNATIVE RESEARCH DESIGNS · • ABA or ABAB designs • Lack crucial characteristics of true experiments • Experimental control, random assignment, control group • Most often

1/27/15  

3  

Single-case Design • E.g., Mayberg et al., 2005

• Quasi-experiment because…

•  Participants weren’t randomly assigned to conditions

• No control group

• Possible confounds?

•  Placebo effect

•  Just “being treated” lessened symptoms

Possible Confounds • E.g., Mayberg et al., 2005

• Possible confounds? •  Maybe 4 out of 6 patients got better on their own

• Typically 60% of people overcome mental illness without any kind of treatment • However, this usually takes longer than a few weeks

• Also, all patients had severe and previously untreatable depression

•  Baseline measures can affect patients’ depression

• Having to report all of their symptoms may motivate participants to overcome their depression

Dealing with Confounds • One way to rule out some of these confounds…

• Reversal Design – after treatment is shown to have an effect, remove treatment to see if participants return back to baseline •  Known as “ABA Design” (A = Baseline, B = Treatment)

• E.g., Mayberg et al., 2005

•  Took one patient and, without his knowledge, deactivated deep brain stimulation

•  Patient returned to baseline within 2 weeks

A B

A

Page 4: ALTERNATIVE RESEARCH DESIGNS · • ABA or ABAB designs • Lack crucial characteristics of true experiments • Experimental control, random assignment, control group • Most often

1/27/15  

4  

Dealing with Confounds • Reversible Designs

• Not always ethical, especially when treatment is beneficial

• So, most treatment programs employ ABAB Design

• E.g., Mayberg et al., 2005

•  After patient returned to baseline, deep brain stimulation was re-started (again without his know-ledge)

•  He quickly returned back to treatment levels

A B

A B

Single-case Designs • Characteristics:

• Examines participants individually •  As opposed to group averages

• Most employ baseline-treatment designs •  ABA or ABAB designs

•  Lack crucial characteristics of true experiments •  Experimental control, random assignment, control group

• Most often employed to test effectiveness of treatments

Types of Quasi-experiments • Single-group Design

• Similar to single-case designs, but group averages are analyzed

• E.g., Effect of speed limits on traffic safety

•  Natural experiment: Because of oil crisis, U.S. government mandated 55mph speed limit on all highways in 1974

•  This was fully reversed in 1995 (return to baseline)

Page 5: ALTERNATIVE RESEARCH DESIGNS · • ABA or ABAB designs • Lack crucial characteristics of true experiments • Experimental control, random assignment, control group • Most often

1/27/15  

5  

Single-group Design • E.g., National change in speed limit

• Does raising the national speed limit increase fatalities?

40000

40500

41000

41500

42000

42500

1994 1995 1996

Repeal

Fata

litie

s in

Driv

ing

U.S

.

Single-group Design • E.g., National change in speed limit

• Possible confounds? •  Historical changes

• E.g., Cars got bigger, more cars on the road

•  Changes in population

• E.g., different driving population, more drivers on the road

•  Random fluctuation in number of fatalities

36000 37000 38000 39000 40000 41000 42000 43000 44000 45000 46000

1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000

Single-group Design • E.g., National change in speed limit (a wider look)

• Did speed limit change really increase fatalities? Repeal

Fata

litie

s in

Driv

ing

U.S

.

Page 6: ALTERNATIVE RESEARCH DESIGNS · • ABA or ABAB designs • Lack crucial characteristics of true experiments • Experimental control, random assignment, control group • Most often

1/27/15  

6  

Possible Confounds • No true control group

• Comparison groups (55mph vs. 65mpg) are one year separated, so many things could have changed

• All single-case/ group studies involve examining the same population across time • Results can be confounded by changes in sample over

time

…versus…

1994 1995

Time as a Confound • Fatalities across time:

34000

36000

38000

40000

42000

44000

46000

48000

1982

19

83

1984

19

85

1986

19

87

1988

19

89

1990

19

91

1992

19

93

1994

19

95

1996

19

97

1998

19

99

2000

20

01

2002

20

03

2004

What happened in late ‘80s / early ‘90s?

Inclusion of air bags in new cars

Time as a Confound • Fatalities across time

34000

36000

38000

40000

42000

44000

46000

48000

1982

19

83

1984

19

85

1986

19

87

1988

19

89

1990

19

91

1992

19

93

1994

19

95

1996

19

97

1998

19

99

2000

20

01

2002

20

03

2004

If speed limit change happened in 1988, researchers may have thought it decreased fatalities

Page 7: ALTERNATIVE RESEARCH DESIGNS · • ABA or ABAB designs • Lack crucial characteristics of true experiments • Experimental control, random assignment, control group • Most often

1/27/15  

7  

Time as a Confound • Historical factors – changes in culture across time can affect DV, especially when study takes place over years • E.g., much changed on the roads throughout the 1990s

•  Bigger cars, more traffic/ people, less maintained roads

• Maturation factors – changes in participants across time can affect DV • E.g., different demographic of drivers on the road in

early vs. mid ’90s

Other Confounds • Testing factors – simply being exposed to baseline measures can affect the DV • E.g., exposure to depression scale can motivate

participants to get better

• E.g., study examining effect of SAT-training on SAT scores can be confounded by practice effects

… …

Other Confounds • Demand factors – simply being in a study can affect the DV • E.g., depressed patients may get better simply

because…

• They know they are being treated (placebo effect)

• Doctors believe they will get better (experimenter effect)

• They are being observed (Hawthorne effect)

• Biggest problem: There is no control group to compare them too

Page 8: ALTERNATIVE RESEARCH DESIGNS · • ABA or ABAB designs • Lack crucial characteristics of true experiments • Experimental control, random assignment, control group • Most often

1/27/15  

8  

A Common Case… • DARE (Drug Abuse Resistance Education)

• Single-group design tested in various schools in 1980s

• Baseline measure of drug-use was assessed

• DARE was implemented

• Treatment measure of drug-use was assessed

•  Initial studies of DARE showed positive results

• DARE went on to be implemented in 75% of U.S. schools and 42 other countries

• Possible confounds?

Possible Confounds • Historical factors – perhaps drug-use lessened on its own in the 1980s

Possible Confounds • Historical factors – perhaps drug-use lessened on its own in the 1980s

• Maturation factors – perhaps the demographics of students in schools began to change

• Testing factors – perhaps mere exposure to baseline measures affected drug-use

• Demand factors – perhaps knowledge that school was taking part in a drug-program affected student/ teacher/ parent behavior

Page 9: ALTERNATIVE RESEARCH DESIGNS · • ABA or ABAB designs • Lack crucial characteristics of true experiments • Experimental control, random assignment, control group • Most often

1/27/15  

9  

Possible Confounds • Another possible confound:

• DARE was implemented in schools that had the worst drug-record in 1980

• Why is this a problem? •  These schools were statistically most likely to improve on

their own in the following year • There is a lot of random fluctuation in school drug-use

from year to year • Schools that are the worst one year will not likely be the

worst again next year (like lightening striking twice) • Known as “regression to the mean”

Regression to the Mean • Major confound in studies where cases/ groups are selected because they are “extreme”

• Most behaviors vary randomly over time •  E.g., Rate of smoking in one individual

# of

Cig

aret

tes

Average # of cigarettes smoked per day Jan. 2010 May 2010 Sept. 2010 Dec. 2010

< 2

> 20

5

10

15 Average 10-cig. per day smoker

Regression to the Mean • Study examines effects of smoking treatment on “heavy smokers” (20 cigarettes per day)

•  If study was conducted in October, this guy could be included

# of

Cig

aret

tes

Average # of cigarettes smoked per day

Jan. 2010 May 2010 Sept. 2010 Dec. 2010

< 2

> 20

5

10

15 Average 10-cig p/day smoker

• However, he will decrease his smoking regardless of treatment

• Causes treat-ment to seem effective

Page 10: ALTERNATIVE RESEARCH DESIGNS · • ABA or ABAB designs • Lack crucial characteristics of true experiments • Experimental control, random assignment, control group • Most often

1/27/15  

10  

Regression to the Mean • Not every “heavy smoker” will be a product of chance fluctuations • But if even a few of them are, their

“treatment scores” will bias the sample

• Whenever cases/ groups are selected because they are at the “extreme”… •  It’s likely some are actually average,

but happen to appear extreme at the time of the study

Example in the Classroom • Scores on an exam are a product of…

• Actual knowledge

•  Luck

• Most students have an average amount of actual knowledge •  So, those who score at the extremes likely had some amount

of luck (good or bad) playing into their score

• This luck isn’t likely to strike these same students twice, so their future scores will return to the mean

Example in the Classroom • E.g.,

• Students in PSY1 who had highest score on Exam #1

Exa

m G

rade

s

Exam #1 (Baseline)

Exam #2 Exam #3 Exam #4 < 30%

>90%

40%

50%

60%

80%

70%

Page 11: ALTERNATIVE RESEARCH DESIGNS · • ABA or ABAB designs • Lack crucial characteristics of true experiments • Experimental control, random assignment, control group • Most often

1/27/15  

11  

Example in the Classroom • E.g.,

• Students in PSY1 who had highest score on Exam #1

Exa

m G

rade

s

Exam #1 (Baseline)

Exam #2 Exam #3 Exam #4 < 30%

>90%

40%

50%

60%

80%

70% Average regression

toward the mean

Example in the Classroom • E.g.,

• Students in PSY1 who had lowest score on Exam #1

Exa

m G

rade

s

Exam #1 (Baseline)

Exam #2 Exam #3 Exam #4 < 30%

>90%

40%

50%

60%

80%

70%

Example in the Classroom • E.g.,

• Students in PSY1 who had lowest score on Exam #1

Exa

m G

rade

s

Exam #1 (Baseline)

Exam #2 Exam #3 Exam #4 < 30%

>90%

40%

50%

60%

80%

70% Average regression

toward the mean

Page 12: ALTERNATIVE RESEARCH DESIGNS · • ABA or ABAB designs • Lack crucial characteristics of true experiments • Experimental control, random assignment, control group • Most often

1/27/15  

12  

Example in the Classroom • Regression to the mean can cause problems

• Can create a false sense of confidence in students who score really high due to luck

• Can create a false sense of dread in students who scores really low due to luck

Another Example • Regression to the mean can cause problems

• Not commonly taken into account in our expectations

•  E.g., people who are highly intelligent owe that partly to their genes and upbringing, but also partly to luck

•  When two such people have a child, they are usually surprised to find their child is of average intelligence

• Same with attractiveness, creativity, drive, etc.

Back to DARE • So, even if there weren’t any historical, maturation, testing, or demand factors confounding the study… • Schools were likely to improve without DARE simply

because of regression to the mean

• Experimental evidence… • Researchers haven’t been able to replicate the positive

effects of DARE in controlled experimental trails

• Evaluations of long-term effects of DARE have shown it has no effect on keeping kids off drugs or out of jail (Rosenbaum & Hanson, 1998; Thombs, 2000)

Page 13: ALTERNATIVE RESEARCH DESIGNS · • ABA or ABAB designs • Lack crucial characteristics of true experiments • Experimental control, random assignment, control group • Most often

1/27/15  

13  

Single Case / Group Designs • There are many similar stories of programs being widely implemented because of positive results in quasi experiments • Still preferred method of program evaluation in schools,

social/ welfare programs, and most businesses

• Advantages • Allows testing of effects when a true experiment cannot

be conducted •  E.g., all schools with drug problems desperately wanted to

be in the DARE program

Single Case / Group Designs • There are many similar stories of programs being widely implemented because of positive results in quasi experiments • Still preferred method of program evaluation in schools,

social/ welfare programs, and most businesses

• Disadvantages • Without a control group or random assignment to

conditions, there can be any number of confounds actually driving effects • Historical, maturation, testing, demand factors

• Regression to the mean

Developmental Research • Studying changes in individuals over time

• Similar to single-case/ group designs •  IVs (life events, diseases, family upbringing, etc.) cannot be

assigned by researcher (i.e., no random assignment)

•  No way to make all participants have the same experience except for variation in IV (i.e., no experimental control)

• Types: • Cross-sectional Method – participants of different age

groups are studied simultaneously •  Longitudinal Method – one group of participants are

observed as they age

Page 14: ALTERNATIVE RESEARCH DESIGNS · • ABA or ABAB designs • Lack crucial characteristics of true experiments • Experimental control, random assignment, control group • Most often

1/27/15  

14  

Cross-sectional vs. Longitudinal • Advantage of Cross-sectional Designs • Much cheaper

and quicker than conducting a longitudinal study •  Studying effects

of childhood factors on adult behavior can take 20 years!

Development of participants:

Development of researcher:

Cross-sectional vs. Longitudinal • Disadvantage of Cross-sectional Designs

• Can be biased by “cohort effect” • Cohort – group of people born around the same time

• Share a common culture/ history

• Cohorts can greatly differ from one another • How is a cohort born in the ‘60s likely to differ from

one born in the ‘90s? • E.g., cross-sectional study finds that as people

become older, they show significant decreases in ability to learn new skills • Test of skills all conducted on computers

Possible Confounds • All developmental designs suffer from…

• Historical confounds – major events in history may change participants in unknown ways

• E.g., effects of stress on life outcomes may be very different depending on when in history the data was collected

Stressors of the 1940s

Stressors of the 2000s

Page 15: ALTERNATIVE RESEARCH DESIGNS · • ABA or ABAB designs • Lack crucial characteristics of true experiments • Experimental control, random assignment, control group • Most often

1/27/15  

15  

Possible Confounds • All developmental designs suffer from…

•  Test confounds – longitudinal studies usually involve participants taking the same tests over and over again

• E.g., study examining changes in IQ over time may be confounded by practice effects

• Many tests cannot be performed on children, so studies must compare childhood tests and adult tests that might not actually be measuring the same thing • Measuring Openness in children: reaction to new toys

• Measuring Openness in adults: reaction to new works of art