prev

next

out of 143

View

244Category

## Documents

Embed Size (px)

E62: Stochastic Frontier Models and Efficiency Analysis E-1

E62: Stochastic Frontier Models and Efficiency Analysis

E62.1 Introduction

Chapters E62-E65 present LIMDEPs programs for two types of efficiency analysis, stochastic frontier analysis (SFA) and data envelopment analysis (DEA). To a large extent, these are

competing methodologies. No formulation has yet been devised that unifies the two in a single

analytical framework. Arguably, the former is a fully parameterized model whereas the latter is

nonparametric, albeit also atheoretical in nature. The stochastic frontier model is used in a large literature of studies of production, cost,

revenue, profit and other models of goal attainment. The model as it appears in the current literature

was originally developed by Aigner, Lovell, and Schmidt (1977). The canonical formulation that

serves as the foundation for other variations is their model,

y = x + v - u,

where y is the observed outcome (goal attainment), x + v is the optimal, frontier goal (e.g.,

maximal production output or minimum cost) pursued by the individual, x is the deterministic part

of the frontier and v ~ N[0,v2] is the stochastic part. The two parts together constitute the

stochastic frontier. The amount by which the observed individual fails to reach the optimum (the frontier) is u, where

u = |U| and U ~ N[0,u2]

(change to v + u for a stochastic cost frontier or any setting in which the optimum is a minimum). In

this context, u is the inefficiency. This is the normal-half normal model which forms the basic form of the stochastic frontier model.

Many varieties of the stochastic frontier model have appeared in the literature. A major

survey that presents an extensive catalog of these formulations is Kumbhakar and Lovell (2000).

(See, as well, Bauer (1990), Greene (2008) and several other surveys, many of which are cited in

Kumbhakar and Lovell and in Greene.) The estimator in LIMDEP computes parameter estimates for

most single equation cross section and panel data variants of the stochastic frontier model.

A large number of variants of the stochastic frontier model based on different assumptions

about the distribution of the inefficiency term, u have been proposed in the received literature. Most of these are available in LIMDEP, as suggested in the list below. The bulk of the received

technology centers on cross section style modeling. However, recent advances include many

extensions that take advantage of the features of panel data. A large array of panel data estimators

are also supported by LIMDEP as well.

E62: Stochastic Frontier Models and Efficiency Analysis E-2

The conventional approach to deterministic frontier estimation is currently data envelopment

analysis. This is usually handled with linear programming techniques. The analysis assumes that

there is a frontier technology (in the same spirit as the stochastic frontier production model) that can

be described by a piecewise linear hull that envelopes the observed outcomes. Some (efficient)

observations will be on the frontier while other (inefficient) individuals will be inside. The

technique produces a deterministic frontier that is generated by the observed data, so by construction,

some individuals are efficient. This is one of the fundamental differences between DEA and SFA. Data envelopment analysis is documented in Chapter E65.

The analysis of production, cost, etc. in the stochastic frontier framework involves two steps.

In the first, the frontier model is estimated, usually by maximum likelihood. In the second, the

estimated model is used to construct measures of inefficiency or efficiency. Individual specific

estimates are computed that provide the basis of comparison of firms either to absolute standards or

to each other. The sections of this chapter develop several model forms used in the first step.

Efficiency estimation, the second step, appears formally in Section E62.8. The general methodology

is then used in the already developed specifications and with several proposed in the sections that

follow, as well as in Chapters E63 and E64.

E62.2 Stochastic Frontier Model Specifications

The stochastic frontier model is

y = x + v-u, u =|U|.

In this area of study, unlike most others, estimation of the model parameters is usually not the

primary objective. Estimation and analysis of the inefficiency of individuals in the sample and of the

aggregated sample are usually of greater interest. This part of the development will present tools for

estimation of inefficiency.

Typically, the production or cost model is based on a Cobb-Douglas, translog, or other form

of logarithmic model, so that the essential form is

log y = x + v - u

where the components of x are generally logs of inputs for a production model or logs of output and

input prices for a cost model, or their squares and/or cross products. In this form, then, at least for

relatively small variation, u represents the proportion by which y falls short of the goal, and has a

natural interpretation as proportional or percentage inefficiency. The numerous examples below will

demonstrate. Users are also referred to the various survey sources listed earlier.

The results one obtains are, of course, critically dependent on the model assumed. Thus,

specification and estimation of model parameters, while perhaps of secondary interest, are

nonetheless a major first step in the model building process. In nearly all received formulations, the

random component, v, is assumed to be normally distributed with zero mean. In some models, v may

be heteroscedastic. But, in either form, the large majority of the different frontier models that have

been proposed result from variations on the distribution of the inefficiency term, u. The range of

specifications examined in this chapter includes the following:

Distributional assumptions: half normal, exponential, gamma

Partially nonparametric frontier function

Sample selection model

E62: Stochastic Frontier Models and Efficiency Analysis E-3

The following extensions are presented in Chapter E63:

Truncated normal with nonzero, heterogeneous mean in the underlying U

Heteroscedasticity in v and/or u

Heterogeneity in the parameter of the exponential or gamma distribution

Amsler et al.s scaling model

Alvarez et al.s model of fixed, latent management

A number of treatments for panel data are presented in Chapter E64.

E62.3 Basic Commands for Stochastic Frontier Models

The command for all specifications of the stochastic frontier model is

FRONTIER ; Lhs = y ; Rhs = one, ... ; other specifications $

NOTE: One must be the first variable in the Rhs list in all model specifications.

The default specification is Aigner, Lovell and Schmidts canonical normal-half normal model. The default form is a production frontier model,

y = x + v - u, u = |U|.

That is, the right hand side of the equation specifies the maximum goal attainable. To specify a cost

frontier model or other model in which the frontier represents a minimum, so that

y = x + v + u, u = |U|, use

; Cost

This specification is used in all forms of the stochastic frontier model. As noted below, one

additional specification you may find useful is

; Start = values for , , .

(The meanings of the parameters are developed below.) ALS also developed the normal-exponential

model, in which u has an exponential distribution rather than a half normal distribution. To request

the exponential model, use

; Model = Exponential (or ; Model = E )

in the FRONTIER command. For this model, the parameters are (,,v). Further details appear below. There are also several model forms, and numerous modifications such as heteroscedasticity

that are developed below.

E62: Stochastic Frontier Models and Efficiency Analysis E-4

This is the full list of general specifications that are applicable to this model estimator.

Controlling Output from Model Commands

; Par keeps ancillary parameters , , etc. with main parameter vector in b. ; OLS displays least squares starting values when (and if) they are computed.

; Table = name saves model results to be combined later in output tables.

Robust Asymptotic Covariance Matrices

; Covariance Matrix displays estimated asymptotic covariance matrix (normally not shown),

same as ; Printvc.

; Choice uses choice based sampling (sandwich with weighting) estimated matrix.

; Cluster = spec requests computation of the cluster form of corrected covariance estimator.

Optimization Controls for Nonlinear Optimization

; Start = list gives starting values for a nonlinear model.

; Tlg [ = value] sets convergence value for gradient.

; Tlf [ = value] sets convergence value for function.

; Tlb [ = value] sets convergence value for parameters.

; Alg = name requests a particular algorithm, Newton, DFP, BFGS, etc.

; Maxit = n sets the maximum iterations.

; Output = n requests technical output during iterations; the level n is 1, 2, 3 or 4. ; Set keeps current setting of optimization parameters as permanent.

Predictions and