6
Political Methodology: Applied Statistics in Political Science Kosuke Imai Department of Politics Statistics & Machine Learning @ Princeton Symposium February 18, 2011 Kosuke Imai (Department of Politics) Political Methodology SML @ Princeton 1 / 12 Political Methodology Applied statistics in political science Relatively young but fast growing field: The 1st annual summer meeting in 1984 The 28th annual summer meeting at Princeton this summer The 1st issue of Political Analysis published in 1989 The most cited journal among over 100 political science journals Influence from many other fields Examples: Econometrics: instrumental variables methods Psychometrics: item response theory Biostatistics: survival analysis Computer science: analysis of text and speech Kosuke Imai (Department of Politics) Political Methodology SML @ Princeton 2 / 12

Political Methodology: Applied Statistics in Political Science · 2018-08-15 · Political Methodology Applied statistics in political science Relatively young but fast growing field:

  • Upload
    others

  • View
    6

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Political Methodology: Applied Statistics in Political Science · 2018-08-15 · Political Methodology Applied statistics in political science Relatively young but fast growing field:

Political Methodology:Applied Statistics in Political Science

Kosuke Imai

Department of Politics

Statistics & Machine Learning @ Princeton Symposium

February 18, 2011

Kosuke Imai (Department of Politics) Political Methodology SML @ Princeton 1 / 12

Political Methodology

Applied statistics in political science

Relatively young but fast growing field:The 1st annual summer meeting in 1984The 28th annual summer meeting at Princeton this summerThe 1st issue of Political Analysis published in 1989The most cited journal among over 100 political science journals

Influence from many other fieldsExamples:

Econometrics: instrumental variables methodsPsychometrics: item response theoryBiostatistics: survival analysisComputer science: analysis of text and speech

Kosuke Imai (Department of Politics) Political Methodology SML @ Princeton 2 / 12

Page 2: Political Methodology: Applied Statistics in Political Science · 2018-08-15 · Political Methodology Applied statistics in political science Relatively young but fast growing field:

Current Research Projects

1 Program EvaluationMexican universal health care program (a.k.a. Seguro Popular)Nigerian conditional oil-revenue transfer program

2 Statistical Analysis of Causal MechanismsHow, not just whether, does treatment causally affects outcome?Causal mediation analysis, natural direct and indirect effectsIdentification, inference, sensitivity analysis, experimental designs

3 Estimation of Treatment Effect HeterogeneityWhich treatment (combination of treatments) works best for whom?Qualitative treatment-covariate/treatment-treatment interactionsUse of machine learning methods

4 Survey Methodology for Asking Sensitive QuestionsHow to elicit truthful answers to sensitive survey questions?Item count technique (list experiments), endorsement experimentsMeasuring support for militant groups in Afghanistan and Pakistan

Kosuke Imai (Department of Politics) Political Methodology SML @ Princeton 3 / 12

Estimation of Treatment Effect Heterogeneity

Motivating Application: Optimal Get-out-the-vote CampaignsNon-partisan: maximize turnoutPartisan: maximize probability of winning

Numerous GOTV field experiments with various mobilizationstrategies

Modes: phone, personal visit, postcard, text message, etc.Messages: civic duty, close election, social pressure, etc.

Question: Which mobilization strategy (combination of strategies)is effective for which voter?

Kosuke Imai (Department of Politics) Political Methodology SML @ Princeton 4 / 12

Page 3: Political Methodology: Applied Statistics in Political Science · 2018-08-15 · Political Methodology Applied statistics in political science Relatively young but fast growing field:

Initial Results based on Classification Trees

Maximum Proportion of Voters Contacted

Ove

rall

Tur

nout

Incr

ease

0% 20% 40% 60% 80% 100%

0.00

0.01

0.02

0.03

ATE Strategy

Optimal Strategy

● ●

●●

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

Maximum Proportion of Voters Contacted

Pro

babi

lity

of R

epub

lican

Win

ning

Optimal Strategy

ATE Strategy

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

Maximum Proportion of Voters Contacted

Act

ual P

ropo

rtio

n of

Vot

ers

Con

tact

ed

OptimalStrategy

ATE Strategy

Challenge: Treatment-covariate interactions tend to beoverwhelemed by covariate main effects

Kosuke Imai (Department of Politics) Political Methodology SML @ Princeton 5 / 12

Development of Alternative Methodology

Basic problems:1 Variable selection: finding qualitative interactions2 Subset selection: finding “marginal” voters

Support Vector Machine with two separate LASSO constraints:

yi = X>i︸︷︷︸

other effects

β + Z>i︸︷︷︸

interactions

γ

with the following loss function

1n

n∑i=1

|1− yi yi |+︸ ︷︷ ︸subset selection

+λx

k∑j=1

|βj |+ λz

m∑j=1

|γj |︸ ︷︷ ︸variable selection

where yi ∈ {−1,1}

Development of optimization algorithmComparison with Classification Trees, BART, and Boosting

Kosuke Imai (Department of Politics) Political Methodology SML @ Princeton 6 / 12

Page 4: Political Methodology: Applied Statistics in Political Science · 2018-08-15 · Political Methodology Applied statistics in political science Relatively young but fast growing field:

Survey Methodology for Sensitive Questions

Political scientists use surveys to study sensitive issues such asracial prejudice and corruptionDirect questioning =⇒ social desirability bias and nonresponse

Application in progress: Measuring citizens’ support for foreignforces and Taliban in AfghanistanDirect questioning =⇒ you will get lies, nonresponse, and killed

Violence levels

[0,50](50,100](100,200](200,300](300,500](500,4e+03]

● ●●

● ●

●●

●●

●●●

●●●●

●●

●●

● ●

●●

● ●

● ●

●●

●●

●●

●●●●●

●●

●●

●●

●●

●●

●●

●●

●●

Herat

Mazar−e Sharif

Kandahar

Kabul

Kosuke Imai (Department of Politics) Political Methodology SML @ Princeton 7 / 12

Item Count Technique

Use aggregation to protect privacyRandomize the sample into the “treatment” and “control” groupsThe script for the control group:

Now I’m going to read you three things that sometimesmake people angry or upset. After I read all three,just tell me HOW MANY of them upset you. (I don’twant to know which ones, just how many.)

(1) the federal government increasing the tax ongasoline;(2) professional athletes getting million-dollar-plussalaries;(3) large corporations polluting the environment.

How many, if any, of these things upset you?

Kosuke Imai (Department of Politics) Political Methodology SML @ Princeton 8 / 12

Page 5: Political Methodology: Applied Statistics in Political Science · 2018-08-15 · Political Methodology Applied statistics in political science Relatively young but fast growing field:

Item Count Technique

Use aggregation to protect privacyRandomize the sample into the “treatment” and “control” groupsThe script for the treatment group:

Now I’m going to read you three things that sometimesmake people angry or upset. After I read all three,just tell me HOW MANY of them upset you. (I don’twant to know which ones, just how many.)

(1) the federal government increasing the tax ongasoline;(2) professional athletes getting million-dollar-plussalaries;(3) large corporations polluting the environment.(4) a black family moving next door to you.

How many, if any, of these things upset you?

Kosuke Imai (Department of Politics) Political Methodology SML @ Princeton 9 / 12

Comparison of Direct and Indirect Quetioning

"black leaders asking the government for affirmativeaction"

Est

imat

ed P

ropo

rtio

n / D

iffer

ence

in P

ropo

rtio

ns

−0.2

−0.1

0.0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

Democrats Republicans Independents

List

Direct

DifferenceList − Direct

List

Direct

DifferenceList − Direct

ListDirect

DifferenceList − Direct

Kosuke Imai (Department of Politics) Political Methodology SML @ Princeton 10 / 12

Page 6: Political Methodology: Applied Statistics in Political Science · 2018-08-15 · Political Methodology Applied statistics in political science Relatively young but fast growing field:

Methodological Development and Future Agenda

Assumptions:1 No Design Effect: Addition of sensitive item does not change

responses to control items2 No Liar: Respondents provide truthful response to sensitive item

What we have developed so far:1 multivariate regression analysis methods2 statistical tests to detect violations of the assumptions3 statistical methods to model deviations from the assumptions4 R package that implements these methods

Next steps:1 extension to a hierarchical model2 spatial pattern of support for Taliban and foreign forces

Kosuke Imai (Department of Politics) Political Methodology SML @ Princeton 11 / 12

About Us

Most political scientists analyze data but few focuses onmethodological research

Marc Ratkovic:Visiting Ph.D. student from Wisconsin finishing up Ph.D.Soon to be a postdoctoral fellow at PrincetonResearch interests: high-dimensional problems in political science

Teppei Yamamoto:5th year graduate student finishing up Ph.D.Soon to be an assistant professor at MITResearch interests: causal inference, modeling of election data

Where we are: the ground floor of CorwinWeekly political methodology seminar: Friday noon in Corwin 127

Kosuke Imai (Department of Politics) Political Methodology SML @ Princeton 12 / 12