58
Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2013 William Greene Department of Economics Stern School of Business

William Greene Department of Economics Stern School of Business

  • Upload
    hagen

  • View
    52

  • Download
    0

Embed Size (px)

DESCRIPTION

Empirical Methods for Microeconomic Applications University of Lugano, Switzerland May 27-31, 2013. William Greene Department of Economics Stern School of Business. 2A. Models for Count Data, Inflation Models. Agenda for 2A. Count Data Models Poisson Regression Overdispersion and NB Model - PowerPoint PPT Presentation

Citation preview

Page 1: William Greene Department of Economics Stern School of Business

Empirical Methods for Microeconomic Applications

University of Lugano, SwitzerlandMay 27-31, 2013

William GreeneDepartment of EconomicsStern School of Business

Page 2: William Greene Department of Economics Stern School of Business

2A. Models for Count Data, Inflation Models

Page 3: William Greene Department of Economics Stern School of Business

Agenda for 2A• Count Data Models• Poisson Regression• Overdispersion and NB

Model• Zero Inflation• Hurdle Models• Panel Data

Page 4: William Greene Department of Economics Stern School of Business
Page 5: William Greene Department of Economics Stern School of Business

Doctor Visits

Page 6: William Greene Department of Economics Stern School of Business

Basic Model for Counts of Events• E.g., Visits to site, number of

purchases, number of doctor visits• Regression approach

• Quantitative outcome measured• Discrete variable, model probabilities• Nonnegative random variable

• Poisson probabilities – “loglinear model”

Page 7: William Greene Department of Economics Stern School of Business

2

1

1

| ]

Moment Equations :

Inefficient but robust if nonPoisson

Ni ii

Ni i i ii

y

y

Estimati

Nonlinear Least Squares:

Maximum Likelihoo

on:

Min

x

d

ji i

i

i i i

exp(-λ )λProb[Y = j | ] =j!

λ = exp( ) = E[y

i

i

x

β'x x

1

1

log log( !)

Moment Equations :

Efficient, also robust to some kinds of NonPoissonness

Ni i i ii

Ni i ii

y y

y

Max

x

:

Page 8: William Greene Department of Economics Stern School of Business

Poisson Model for Doctor Visits

Page 9: William Greene Department of Economics Stern School of Business

Alternative Covariance Matrices

Page 10: William Greene Department of Economics Stern School of Business

Partial Effects

iE[y | ]= λi

ii

x βx

Page 11: William Greene Department of Economics Stern School of Business

Poisson Model Specification Issues• Equi-Dispersion: Var[yi|xi] = E[yi|xi].• Overdispersion: If i = exp[’xi + εi],

• E[yi|xi] = γexp[’xi]• Var[yi] > E[yi] (overdispersed)• εi ~ log-Gamma Negative binomial model• εi ~ Normal[0,2] Normal-mixture model• εi is viewed as unobserved heterogeneity (“frailty”).

Normal model may be more natural. Estimation is a bit more complicated.

Page 12: William Greene Department of Economics Stern School of Business

Overdispersion• In the Poisson model, Var[y|x]=E[y|x]• Equidispersion is a strong assumption• Negbin II: Var[y|x]=E[y|x] + 2E[y|x]2

• How does overdispersion arise:• NonPoissonness• Omitted Heterogeneity

j

u1

exp( )Prob[y=j|x,u]= , exp(x u)j!Prob[y=j|x]= Prob[y=j|x,u]f(u)du

exp( u)uIf f(exp(u))= (Gamma with mean 1)( )Then Prob[y=j|x] is negative binomial.

Page 13: William Greene Department of Economics Stern School of Business

Negative Binomial Regression

iyi ii i i i i

1 i

i i

i i i

i i i i i

( y )P(y | x ) r (1 r ) , r

(y 1) ( ) exp( )E[y | x ] Same as PoissonVar[y | x ] [1 (1/ ) ]; =1/ = Var[exp(u )]

x

Page 14: William Greene Department of Economics Stern School of Business

NegBin Model for Doctor Visits

Page 15: William Greene Department of Economics Stern School of Business
Page 16: William Greene Department of Economics Stern School of Business
Page 17: William Greene Department of Economics Stern School of Business

Negative Binomial Specification• Prob(Yi=j|xi) has greater mass to the right and left

of the mean• Conditional mean function is the same as the

Poisson: E[yi|xi] = λi=Exp(’xi), so marginal effects have the same form.

• Variance is Var[yi|xi] = λi(1 + α λi), α is the overdispersion parameter; α = 0 reverts to the Poisson.

• Poisson is consistent when NegBin is appropriate. Therefore, this is a case for the ROBUST covariance matrix estimator. (Neglected heterogeneity that is uncorrelated with xi.)

Page 18: William Greene Department of Economics Stern School of Business

Testing for OverdispersionRegression based test: Regress (y-mean)2 on mean: Slope should = 1.

Page 19: William Greene Department of Economics Stern School of Business

Wald Test for Overdispersion

Page 20: William Greene Department of Economics Stern School of Business

Partial Effects Should Be the Same

Page 21: William Greene Department of Economics Stern School of Business
Page 22: William Greene Department of Economics Stern School of Business
Page 23: William Greene Department of Economics Stern School of Business

Model Formulations for Negative BinomialPoisson

exp( )Prob[ | ] ,

(1 )exp( ), 0,1,..., 1,...,

[ | ] [ | ]

i ii i

i

i i i

i i i

iyY y

yy i N

E y Var y

x

xx x

E[yi |xi ]=λi

Page 24: William Greene Department of Economics Stern School of Business

NegBin-1 Model

Page 25: William Greene Department of Economics Stern School of Business

NegBin-P Model

NB-2 NB-1 Poisson

Page 26: William Greene Department of Economics Stern School of Business

Zero Inflation?

Page 27: William Greene Department of Economics Stern School of Business

Zero Inflation – ZIP Models• Two regimes: (Recreation site visits)

• Zero (with probability 1). (Never visit site)• Poisson with Pr(0) = exp[- ’xi]. (Number of visits,

including zero visits this season.)• Unconditional:

• Pr[0] = P(regime 0) + P(regime 1)*Pr[0|regime 1]• Pr[j | j >0] = P(regime 1)*Pr[j|regime 1]

• This is a “latent class model”

Page 28: William Greene Department of Economics Stern School of Business

Two Forms of Zero Inflation Models

ji i

i i i i

i

ji i

i i i i

i

ZIP - tau = ZIP(τ)

exp(-λ )λProb(y = j | x ) = , λ = exp( )

j!Prob(0 regime) = F( )

Zero Inflation = ZIP

exp(-λ )λProb(y = j | x ) = , λ = exp( )

j!Prob(0 regime) = F( )

β x

β x

β x

γ z

Page 29: William Greene Department of Economics Stern School of Business

An Unidentified ZINB Model

Page 30: William Greene Department of Economics Stern School of Business
Page 31: William Greene Department of Economics Stern School of Business

Notes on Zero Inflation Models• Poisson is not nested in ZIP. tau = 0 in ZIP(tau)

or γ = 0 in ZIP does not produce Poisson; it produces ZIP with P(regime 0) = ½.• Standard tests are not appropriate• Use Vuong statistic. ZIP model almost always wins.

• Zero Inflation models extend to NB models – ZINB(tau) and ZINB are standard models• Creates two sources of overdispersion• Generally difficult to estimate• Tau form is not a good model – not generally used

Page 32: William Greene Department of Economics Stern School of Business
Page 33: William Greene Department of Economics Stern School of Business

Partial Effects for Different Models

Page 34: William Greene Department of Economics Stern School of Business

The Vuong Statistic for Nonnested Modelsi,0 0 i i 0 i,0

i,1 1 i i 1 i,1

Model 0: logL = logf (y | x , ) = m Model 0 is the Zero Inflation ModelModel 1: logL = logf (y | x , ) = m Model 1 is the Poisson model(Not nested. =0 implies the splitting p

0 i i 0i i,0 i,1

1 i i 1

n 0 i i 0i 1

1 i i 12

a n 0 i i 0 0 i i 0i 1

1 i i 1 1 i i 1

robability is 1/2, not 1)f (y | x , )Define a m m log f (y | x , )

f (y | x , )1n logn f (y | x , )[a]Vs / n f (y | x , ) f (y | x , )1 log logn 1 f (y | x , ) f (y | x , )

Limiting distribution is standard normal. Large + favors model0, large - favors model 1, -1.96 < V < 1.96 is inconclusive.

Page 35: William Greene Department of Economics Stern School of Business
Page 36: William Greene Department of Economics Stern School of Business

A Hurdle Model• Two part model:

• Model 1: Probability model for more than zero occurrences

• Model 2: Model for number of occurrences given that the number is greater than zero.

• Applications common in health economics• Usage of health care facilities• Use of drugs, alcohol, etc.

Page 37: William Greene Department of Economics Stern School of Business
Page 38: William Greene Department of Economics Stern School of Business

Hurdle Model

Prob[y > 0] = F( )Prob[y=j] Prob[y=j] Prob[y = j | y > 0] = = Prob[y>0] 1 Prob[y 0| x]

exp( ) Prob[y>0]=1+exp( )exp(- Prob[y=j|y>0,x]=

Two Part Modelγ'x

A Poisson Hurdle Model with Logit Hurdleγ'xγ'x

j) , =exp( )j![1 exp(- )]F( )exp( ) E[y|x] =0 Prob[y=0]+Prob[y>0] E[y|y>0] = 1-exp[-exp( )]

β'x

γ'x β'xβ'x

Marginal effects involve both parts of the model.

Page 39: William Greene Department of Economics Stern School of Business

Hurdle Model for Doctor Visits

Page 40: William Greene Department of Economics Stern School of Business

Partial Effects

Page 41: William Greene Department of Economics Stern School of Business
Page 42: William Greene Department of Economics Stern School of Business
Page 43: William Greene Department of Economics Stern School of Business
Page 44: William Greene Department of Economics Stern School of Business

Application of Several of the Models Discussed in this Section

Page 45: William Greene Department of Economics Stern School of Business

Winkelmann finds that there is no correlation between the decisions… A significant correlation is expected … [T]he correlation comes from the way the relation between the decisions is modeled.

Page 46: William Greene Department of Economics Stern School of Business

Probit Participation Equation

Poisson-Normal Intensity Equation

Page 47: William Greene Department of Economics Stern School of Business

Bivariate-Normal Heterogeneity in Participation and Intensity Equations

Gaussian Copula for Participation and Intensity Equations

Page 48: William Greene Department of Economics Stern School of Business

Correlation between Heterogeneity Terms

Correlation between Counte

Page 49: William Greene Department of Economics Stern School of Business

Panel Data Models Heterogeneity; λit = exp(β’xit + ci)

• Fixed Effects Poisson: Standard, no incidental parameters issue NB

Hausman, Hall, Griliches (1984) put FE in variance, not the mean Use “brute force” to get a conventional FE model

• Random Effects Poisson

Log-gamma heterogeneity becomes an NB model Contemporary treatments are using normal heterogeneity with

simulation or quadrature based estimators NB with random effects is equivalent to two “effects” one time

varying one time invariant. The model is probably overspecified

Random Parameters: Mixed models, latent class models, hiererchical – all extended to Poisson and NB

Page 50: William Greene Department of Economics Stern School of Business

Random Effects

Page 51: William Greene Department of Economics Stern School of Business

A Peculiarity of the FENB Model• ‘True’ FE model has λi=exp(αi+xit’β). Cannot

be fit if there are time invariant variables.• Hausman, Hall and Griliches (Econometrica,

1984) has αi appearing in θ.• Produces different results• Implies that the FEM can contain time invariant

variables.

Page 52: William Greene Department of Economics Stern School of Business
Page 53: William Greene Department of Economics Stern School of Business

See: Allison and Waterman (2002),Guimaraes (2007)

Greene, Econometric Analysis (2011)

Page 54: William Greene Department of Economics Stern School of Business
Page 55: William Greene Department of Economics Stern School of Business

Censoring and Truncation in Count Models

• Observations > 10 seem to come from a different process. What to do with them?

• Censored Poisson: Treat any observation > 10 as 10.

• Truncated Poisson: Examine the distribution only with observations less than or equal to 10.• Intensity equation in hurdle

models• On site counts for recreation

usage.

Censoring and truncation both change the model. Adjust the distribution (log likelihood) to account for the censoring or truncation.

Page 56: William Greene Department of Economics Stern School of Business

Bivariate Random Effects

Page 57: William Greene Department of Economics Stern School of Business
Page 58: William Greene Department of Economics Stern School of Business