Upload
mcu-in
View
0
Download
0
Embed Size (px)
Citation preview
Spatial Competition with Endogenous LocationChoices: An Application to Discount Retailing1
Ting Zhu Vishal Singh
April 2007
1The authors are Assistant Professor of Marketing at the Graduate School of Business,University of Chicago and Assistant Professor of Marketing respectively at Tepper School ofBusiness, Carnegie Mellon University. The paper is based on Ting Zhu’s thesis at CarnegieMellon University. The authors can be reached at [email protected] (Singh) [email protected] (Zhu) for comments and suggestions.
Abstract
This paper examines the importance of geographical differentiation in store location deci-sions of firms in the retail discount industry. Using a novel data set that includes the storelocations and accompanying market conditions for all stores belonging to the Wal-Mart,Kmart, and Target chains, we study the factors that influence the entry and location de-cisions of those firms. The model involves an incomplete information game between thethree players where each firm has private information about its own profitability. A keyfeature of our modeling approach is that it permits asymmetries across firms in the im-pact of exogenous market characteristics and competitive interaction effects. In addition,the variations in the exogenous firm specific characteristics, such as the distances fromthe market to firms’ headquarters and the distances to the nearest distribution centersprovide the source for model identification. Results show the importance of accountingfor firm asymmetries in their response to market conditions and competition interactions.We use the parameter estimates of the payoff functions to predict the equilibrium marketstructure under a variety of market conditions that provide additional insights into thecompetitive structure of the industry.
Keywords: Entry, discrete games, location choice, retail competition, discount stores
1 Introduction
Firms employ a variety of strategies to differentiate their product offerings from compe-
tition. In the retailing sector, these strategies include the breadth and depth of product
assortment, price, promotions, merchandise quality, customer service, and so forth. How-
ever, as a well known aphorism states, the three most important decisions in retailing
are location, location, and location. Indeed, store location has long been regarded as
a primary driver of retail competition and one of the most important factor in a con-
sumer’s store choice decision (Huff 1964, Arnold, Oum, and Tigert 1983, Brown 1989,
Craig et. al. 1989). Store location decisions are particularly important since they are less
adjustable in the short-run without incurring significant costs. On the other hand, other
marketing mix elements such as price and product assortment can readily be modified in
response to competition or other changes in the market. When deciding whether to enter
a particular local market and where to locate within that market, firms consider a variety
of factors including demand and cost conditions, logistical issues, and proximity to and
size of their consumer base. In addition, firms must account for the actual and potential
presence of competitors. While favorable market conditions such as large population base
and low cost labor encourage entry, such markets are also likely to attract competitors.
In the long-run, the interaction of these different factors determines the observed entry
patterns by firms.
In this paper, we study the strategic entry and location choice decisions in the retail
discount industry. For all practical purposes, the industry involves an oligopoly comprised
of three dominant firms: Wal-Mart, Kmart, and Target. Using a novel data set that
includes store locations and associated market conditions for all stores belonging to these
three chains, we study the factors that determine firm’s entry and location choices. We
partition the United States into a series of small markets and further divide each market
into a finite number of spatially distinct retail locations that are heterogeneous in demand
and cost conditions. We model the entry decisions as the outcome of an incomplete
information game between the three players where we assume that each firm has private
information about its own profitability, but only observes the distribution of its rivals’
profitability. In this game of location choices, distance to rivals serves as a form of
product differentiation. The empirical analysis then relates the predictions of the model
to observed behavior of the three chains. The resulting estimates provide insights into
the importance of numerous factors for store profitability. Most important, the analysis
provides insights into a fundamental tension faced by these firms, namely the desire to be
1
in attractive locations while also obtaining insulation from competition through spatial
differentiation.
Our modeling approach builds on the literature on static game theoretic models where
agents make discrete choices such as entry and exit decisions1. This literature endogenizes
the competitive structure in a market such as number of firms by implicitly analyzing
the first stage of a two-stage game in which firms first decide whether or not to enter
followed by price or quantity competition. The general premise is that the observed entry
decisions of firms and the associated industry structure in a market reveal information
about the underlying economic profits. The key insight that the analyst can infer features
of these profits even without information of price, quantity, or margins, since firms will
enter a market if and only if they expect positive profits. Unlike the structural models
of supply and demand that provide marginal conditions for optimal actions, the discrete
nature of entry and exit decisions implies threshold conditions for players’ unobserved
profits in order for a particular outcome to obtain. As such, the empirical approach to
analyze discrete games is intuitively analogous to that of single-agent discrete decision
processes in which estimation of underlying payoffs is based on inequality restrictions
that are required for a certain choice to be optimal. For example, in the marketing
literature, researchers have inferred parameters of utility functions through application
of this revealed preference argument to data on discrete choices by consumers. In both
the single-agent setting and the multi-agent game, the underlying objective is the same,
namely the estimation of unobserved latent payoffs using observed behavior. However,
whereas the single-agent problem involves an arguably independent decision, the inter-
dependencies in payoffs of agents in a game theoretic setting necessitates the use of
some equilibrium concept, most commonly Nash equilibrium or some refinement of Nash
equilibrium.
Bresnahan and Reiss (1987, 1990, 1991) and Berry (1992) provide early salvo into this
literature. They describe the general structure of such games and discuss some issues
that arise in empirical analysis of the model. One common problem endemic to models of
discrete strategic interaction is the presence of multiple equilibria in which certain values
of the underlying latent payoffs could simultaneously be consistent with more than one
1Obviously a more comprehensive model would explicitly consider dynamic entry patterns of firms.However, as noted by Reiss and Wolak (2006) in their recent review of the literature, “while someprogress has been made in formulating and solving such (dynamic) games, to date their computationaldemands have largely made them impractical for empirical work. As a consequence, almost all structuralmarket structure models are static”. In this paper we follow the literature on static entry games but (asnoted below) explicitly allow for firm heterogeneity.
2
equilibrium outcome. The presence of multiple equilibria complicates estimation since
the model does not provide exact probabilities for each outcome as would be required in
maximum likelihood estimation.2 This difficulty contrasts with the single-agent setting in
which the inequalities of the model provide a direct mapping from each possible value of
the latent payoffs to one, and only one, outcome. Two common solutions to the multiple
equilibria problem are to focus only outcomes that are common across all equilibria or
to change the structure of the game. An example of the first approach would involve
examining the total number of firms rather than their identities while the latter approach
could involve a sequential entry structure rather than a simultaneous move game. Mazzeo
(2002) extends this literature by incorporating product differentiation where a firm’s
payoff asymmetrically depends on whether a competitor is more or less similar to the
firm. However, in Mazzeo’s framework firms are restricted to choose from a few possible
product “types” as a larger choice set becomes computationally cumbersome due to the
large number of profit constraints that need to be satisfied in equilibrium. Seim (2001,
2005) solves this dimensionality problem by allowing firms to possess private information
about their own profitability. Her incomplete information approach circumvents the need
for checking a large number of entry configurations to find the equilibria and allows for a
considerably larger dimensional set of product types than is feasible in games of complete
information.
In this paper, we follow Seim (2001, 2005) by setting up the model as an incomplete
information game where players privately observe their own location-specific potential
profits, but observe only the distribution of their rivals’ profitability. Unlike previous
applications in the literature, we explicitly allow for asymmetries related to the identity
of the firms. The previous literature has assumed that firms are symmetric in terms of
their sensitivity to exogenous market conditions and in their impact on other competitors.
In other words, a firm’s profits depend on the total number of other firms in the market
rather than their actual identities beyond simple type differences as in Mazzeo (2002).
In reality, firms care about the actual identities of their competitors rather than simply
the total number of competitors3. For instance, entry by firms such as Exxon-Mobil,
Microsoft, Home Depot, or Citigroup into a geographical or product market may be
viewed by competing firms as a greater threat to profits than entry by smaller independent
2An additional complication is the complex inequalities that the model yields for a certain outcometo be an equilibrium. In terms of the stochastic structure of the model, these inequalities translate intocomplex regions of integration for the unobservables of the model. Berry (1992) introduced simulationtechniques in the context of entry games to deal with these issues.
3Kyle (2005) makes similar arguments in the context of entry in pharmaceutical drugs.
3
firms. In the current context of discount stores, such asymmetries may be crucial. For
instance, Kmart’s profit may be different when it faces a Wal-Mart rather than a Target
or when it faces a Wal-Mart supercenter rather than a Wal-Mart discount store. Similarly,
the impact of market demand and cost conditions may vary across firms as well as store
formats. As such, it is paramount to allow such asymmetries in an analysis of this
industry.
Our modeling approach allows for both competitive effects and preferences over ex-
ogenous market conditions to be firm-specific. A firm’s payoff from entering a market
and choosing a particular location depends on its expectation of the optimal location
choice of competitors, exogenous market characteristics that are common to all firms,
and other observed factors that are specific to each firm. Examples of common factors
include demand and cost conditions in the market such as population and retail wages,
while firm-specific factors could include variables related to logistical issues such as the
distance from a firm’s distribution center. In addition to the observed variables, all
firms’ profits also depend on market components that are unobserved by the analyst, but
are observed by all firms. For example, positive or negative marketwide shocks such as
availability of commercial land or restrictiveness of zoning laws are difficult to measure
and quantify, but are likely to play an important role in the entry decisions of all firms.
Finally, a firm’s payoff in each location includes a component specific to that firm and
location which could represent idiosyncratic operational costs or managerial talent that
are unobserved by both the analyst and other players.
The incomplete information model provides a tractable framework for studying the
complex interactions of optimal site selection in which a trade-off exists between prox-
imity to competitors and the desirability of certain location characteristics. The model
translates the discrete actions of competitors into smooth location choice probabilities
that represent the likelihood of competitor entry into a location rather than actual real-
izations, thus allowing for a larger set of locations (or product types) than is generally
feasible in games of complete information. The resulting Bayesian Nash equilibrium
(BNE) is derived using a nested fixed-point algorithm which is then embedded in a sim-
ulated maximum likelihood procedure to estimate the model parameters.
To implement the model, we assembled an original data set that includes all store
locations for the three largest players in the discount store industry: Wal-Mart, Kmart,
and Target. Interestingly, all three chains started in 1962 when Wal-Mart opened its first
store in Rogers, Arkansas, Kmart began operations in Garden City, Michigan, and Target
opened a store in Roseville, Minnesota. Over the next few decades, these firms expanded
4
their operations both domestically and internationally to reach dominant positions in
the retailing world. In the process, they have also experimented with several other retail
formats, most prominently the supercenter which involves a combination of a full-service
grocery store and a general merchandize outlet. The data includes the exact location
(latitude/longitude) of all stores in operation as of 2003 across the United States. We
partition the United States into a series of smaller markets and further divide each market
into smaller neighborhoods. Each store can then be mapped to one of these neighborhood
locations. Besides location, we observe certain other store characteristics such as format
and store size. In addition, we collected information on the location of both the firm’s
headquarters and its distribution centers. Distances from the distribution centers to
each market location are used as proxies to capture logistical issues in firm’s location
choice decisions. Finally, we supplement the data with detailed information collected
from the US Census Bureau and the Bureau of Labor Statistics to capture demand and
cost conditions at every geographical location.
In the empirical application, we create discrete distance bands around each location
to capture the effects of competitors and market size. The results show that population
in closer distance bands to a location contribute more to profitability than population
in distant bands. Interestingly, the parameters for distant population are significantly
larger for Wal-Mart, especially for its supercenter format, suggesting that Wal-Mart’s
trading area is larger than those of its rivals. All firms are found to prefer markets
with lower retail wages, markets where a larger proportion of families have children and
vehicles, and markets that are closer to their respective headquarters. We also find
significant differences across firms in certain demand characteristics. For instance, Wal-
Mart Supercenters are found to prefer markets with lower income levels, while Target
prefers markets with higher income and education levels. These results seem consistent
with common perceptions of these stores. For example, Target has been known to position
itself as an upscale discounter that caters to a higher income and better educated clientele
than its competitors. Similarly, lower income levels in Wal-Mart locations are consistent
with Wal-Mart’s everyday low pricing and value positioning. We also find that Wal-Mart,
particularly for its supercenter format, gives more weight to locations that are closer to
its distribution center, reflecting its logistical efficiencies.
In terms of competitive interactions, we find that proximity to competitors is an im-
portant determinant of profitability. All firms are found to exert a strong negative impact
on competitors when they are in close proximity, but the effect decreases with distance to
rivals suggesting strong returns to spatial differentiation in this industry. Looking at the
5
competitive effects across firms, we find strong asymmetries in the competitive interaction
parameters. For example, while a nearby Kmart strongly impacts Wal-Mart, that effect
is significantly lower than the corresponding impact of a Wal-Mart on Kmart’s payoffs.
Target stores are found to fare well under competition from other discount stores except
when these competitors are in particularly close proximity. Wal-Mart’s supercenter for-
mat is found to be the most formidable player as it substantially impacts competitors
even at a large distance.
We use the parameter estimates from the latent payoff functions to predict equilib-
rium market structures under a variety of market conditions. These simulations yield a
number of insights into the competitive structure of the industry. For example, as market
population and income vary with all other variables held constant, we find that Wal-Mart
is able to operate as a monopolist under a wide range of market conditions. Target is able
to enter as monopolist in certain extreme conditions, such as a relatively small population
and very high income, that generally are not attractive to its rivals. Conversely, Kmart is
rarely able to find locations that are not attractive to at least one other player. Similarly,
we find that when entry involves only discount store formats, multiple firms are able to
enter the market under moderately good demand conditions by choosing different loca-
tions within a market. On the other hand, Wal-Mart supercenter tends to deter entry by
competitors altogether unless a market becomes very attractive. This effect arises since
the supercenter format hurts even those competitors that are located far away implying
that the opportunities to mute competition through geographical differentiation are lim-
ited. However, as a market becomes more and more attractive, good demand conditions
overcome the competition effect.
The rest of the paper is organized as follows. In the next section, we describe our
modeling approach. The data used in the study are described in Section 3. Section 4
outlines the estimation strategy and presents the results. Section 5 concludes with a
discussion of limitations of the current study and directions for future research.
2 Model
In the past few years, there has been an increasing emphasis in the literature on struc-
tural empirical modeling that is industry-specific. These studies incorporate industry-
and firm-specific details to provide a deeper understanding of underlying economic prim-
6
itives comprised of demand and cost factors and competitive effects.4 In the marketing
literature, researchers have examined issues such as brand value creation and competi-
tive advantage (Besanko et al. 1998), product line competition (Kadiyali et al. 1999,
Draganska and Jain 2004), channel power (Kadiyali et al. 2000), retailer pricing (Sudhir
2001, Chintagunta 2002, Villas-Boas et al. 2005 ), and price discrimination (Besanko et
al. 2003, Chintagunta et al. 2003, Khan and Jain 2004).
A common feature of these papers is that they treat the market structure in an in-
dustry as exogenous.5 Starting with the pioneering work of Bresnahan and Reiss (1987,
1990, 1991) and Berry (1992), researchers have considered empirical analysis of game
theoretic models that endogenize the competitive structure of the industry such as num-
ber of firms and market concentration.6 Below we describe a model of the store location
choices of firms in the retail discount industry. The model is structured as an incomplete
information game following the framework originally developed in Seim (2005). How-
ever, as noted above, we will explicitly account for the identities of the firms as well as
allowing for asymmetries across firms. To illustrate, consider the two markets in Figure
1 each with a Wal-Mart supercenter and Kmart. In the approach of Bresnahan and
Reiss (1991) and Berry (1992), both markets would have the same market structure as
their model considers only the total number of undifferentiated competitors (besides the
non-strategic fixed costs in Berry’s model.) Seim’s framework would categorize these two
market structures differently due to the presence of geographic differentiation. However,
if a Wal-Mart discount store or a Target store replaced the Wal-Mart supercenter, the
market structure would not change as Seim (2005) does not consider identities of the
players. In the model presented below, we explictly account for firm identities and allow
for firm-specific effects that are related to competitive interactions and the impact of
exogenous market characteristics.
4Recent surveys include Kadiyali et al. (2001), Dube et al. (2004), Reiss and Wolak (2004), andChintagunta et al. (2005).
5In a recent paper, Chan et al. (2005) study geographic locations and price competition of gasolineretailers in Singapore. However, the location decisions are based on the assumption of social welfaremaximization by a policy planner (government) rather than strategic actions of firms.
6Applications in this literature cover a wide spectrum including airlines (Berry 1992, Ciliberto andTamer 2004), motels (Mazzeo 2002), movie rental stores (Seim 2005), eyeglass retailers (Watson 2004),fast food outlets (Toivanen and Waterson 2005), movie release timing (Einav 2004), and supermarkets(Orhun 2005).
7
2.1 Model Specification
The general model involves a game played by N players, each with a finite set of actions
Ai, where i is an index of firm, i ∈ 1, ..., N. Define the Cartesian product A =∏
i Ai
and let a = (a1, ..., aN) represent an element of A. Firms’ payoffs are a function of their
moves and market characteristics:
πi (x, ai, a−i, ε) = fi (x, a) + εi (ai) (1)
In equation (1), firm i’s payoff depends on three components: market characteristics, x,
that are publicly observed by all firms; actions of the players, a = (ai, a−i) where a−i
represents the actions of all players other than i; and an idiosyncratic shock, εi (ai), that
influences i’s payoff from action ai. The term fi (x, a) captures the way in which firm i’s
payoff depends on the behavior of its competitors.
2.1.1 Location Choice
In the empirical analysis, we wish to analyze the factors underlying entry and location
decisions in the retail discount industry. As described in the next section, we partition
the United States into a series of M small markets, and each market m into Lm small
neighborhoods or locations that vary in demand and cost conditions. A firm’s entry and
location decision depends on observed and unobserved exogenous market factors as well
as the endogenous location choices of other firms. We assume that firm i’s payoff function
at location l in market m is a linear function of market and location characteristics and
competition effects:
πiml = ximlβi −∑i′ 6=i
Lm∑k=1
δii′klai′mk + ξm + εiml (2)
i, i′ = 1, 2, ...N (firms),
m = 1, 2, ...M (markets),
l, k = 1, 2, ..., Lm (locations),
where ximl are the observed exogenous variables that could vary across different locations
and markets. In addition, these exogenous characteristics could also include firm-specific
variables such as distance to the firm’s headquarters or nearest distribution center. The
indicator variables am = (aim, a−im)N×Lmdenote firms’ location choices such that aiml = 1
when firm i chooses location l and aiml = 0 otherwise. The dimension of am is N × Lm,
8
where N is the number of players, and Lm is the number of possible locations in market
m. Besides the observed variables, profits also depend on unobserved factors. Of these,
ξm are market characteristics that are unobserved by the analyst, but are observed by
all firms. We assume that ξm is normally distributed with mean 0 and variance σ2. For
example, market-specific shocks related to the overall availability of commercial land or
the stringency of zoning restrictions are likely to play an important role in the entry
decisions of all firms, but are difficult to measure and quantify. Finally, εiml are firm-
specific unobservables representing the private information of firm i’s profitability in
location l. These factors could include idiosyncratic operational costs or managerial talent
that are unobserved by both the analyst and other players. If firm i does not enter the
market so that aiml = 0 for all l = 1, ..., Lm, we normalize the payoff as
πim0 = εim0. (3)
The unknown parameters are (βi, δii′kl, σ) where βi represents firm ı’s preference for
various market characteristics and δii′kl reflect the competitive effect that player i′ exerts
on player i when they respectively operate stores in locations k and l. As discussed
earlier, we allow all the model parameters to be firm-specific. Thus, the model permits
different firms (i) to impact competitors differently and (ii) to put different weight on
market conditions.7
Since εiml is private information, a firm cannot predict its competitors’ responses
(a−im) with certainty. Instead, a firm’s actions are based on its conjectures or expectation
of competitors’ responses. Thus, firm i’s expected payoff from choosing location l in
market m is
πiml = ximlβi −∑i′ 6=i
Lm∑k=1
δii′klE (ai′mk) + ξm + εiml (4)
where E (ai′mk) denotes the expectation of firm i′ choosing location k. Firm i chooses
the location that maximizes its expected payoff:
aiml =
1 if πiml ≥ πimk, ∀ k = 0, 1, ..., Lm
0 otherwise
If a firm’s private information εim (aim) is independently and identically distributed across
firms and locations with a type 1 extreme value distribution, the equilibrium probability
of firm i choosing location l in market m conditional on its beliefs of other firms’ behavior
7As discussed below in the context of format choices, it is also possible for the parameters to becontingent on the actions taken by firms.
9
is given by
Piml =exp
(ximlβi −
∑i′ 6=i
∑Lm
k=1 δii′klPi′mk + ξm
)1 +
∑Lm
l′=1 exp(ximl′βi −
∑i′ 6=i
∑Lm
k=1 δii′kl′Pi′mk + ξm
) . (5)
The probability of no entry in the market is then Pim0 = 1 −∑Lm
l=1 Piml. Other firms
similarly construct their beliefs and choose their best responses based on the expected
payoffs. Since firms are asymmetric in terms of their preferences of market conditions and
competition, each firm may have different probability of choosing a given location. Define
Pm = (Pim, P−im)(Lm×N) as a vector with Lm×N elements that describes the conjectures
of each location being chosen by each individual firm. In the ensuing Bayesian Nash
equilibrium, firms’ choices are consistent with their conjectures yielding a fixed-point
mapping
Pm = F (Xm, ξm, Pm; δ, β) (6)
from the probability space of firms’ beliefs to their own strategies. The equilibrium
location conjectures are summarized by a vector with Lm × N probabilities resulting
from the system of Lm ×N equations defined in equation(5).
2.1.2 Location and Format Choice
As discussed in the data section below, the players in this industry operate two store
formats: discount stores and supercenters. The supercenter format combines the general
merchandise aspect of the discount stores with a full-service grocery store. The model
described above can be extended to allow the firms to choose a store format. With Lm
available locations in a market and a choice of store format, the total number of options
available to firms are 2Lm +1,where there are Lm possible locations for the discount store
format, Lm possible locations for a supercenter, and one “no entry” option. To capture
differences in the impact of competition and exogenous characteristics across formats, the
latent payoff of player i with store format f ∈ Discount store, Supercenter at location
l in market m can be reformulated as
πimlf = ximlβif −∑f ′
∑i′ 6=i
Lm∑k=1
δii′kf ′ lf ai′mkf ′ + ξm + εimlf , (7)
As before, firms choose action that maximizes their expected latent payoff where they
now choose a format in addition to a location. In the current application, since Kmart
and Target have significantly fewer supercenters than discount stores, we restrict the
format choice to Wal-Mart.
10
2.2 Identification
Bajari et. al. (2005) discuss identification in models of incomplete information game.
They establish the sufficient conditions for model identification when the unobservables
in firms’ payoff functions are distributed i.i.d. across actions ai and agent i. Similar to
the “outside good” assumption in a single agent model, where the mean utility from a
particular choice is set equal to zero, we normalize the mean of payoff of “no entry” choice
to zero for identification. Additionally characteristics of rival firms constitute exclusion
restrictions for the source of identification. Suppose that there is some variable xi that
enters the firm i’s payoff but does not enter the payoff of other firms directly. The
exogenous variation in xi can pin down the form of strategic interaction. For instance,
distances from the local market to firm i’s nearest distribution center and headquarter are
productivity shocks that only affect firm i’s entry decisions directly but don’t affect other
firms’ payoffs. For example, two market A and B have the same market characteristics,
but A is closer to Wal-Mart’s distribution center than B is. Hence the probability of
having a store in A is larger due to the logistics efficiency. If we observe that fewer
Kmart stores in market A, then we can infer the intensity of competition by comparing
the difference of Kmart’s entry decisions in market A and B. The intuition is the difference
of Kmart’s decisions between the two markets is only driven by its expectation on Wal-
Mart’s choices rather than any other market characteristics.
2.3 Illustration of Asymmetric Competition Effects
The model presented above is quite flexible in the asymmetries that it allows in the model
parameters. In order to understand how the primitives of the model affect firms’ choices,
we conducted a series of simulations to explore how the probability of market entry and
location choice depends on the strength of competition among players. For this exercise,
we consider a simplified version the model (equation (4)) discussed in the previous section
and assume that there are two players – firm A and firm B− and two possible locations
– location 1 and location 2. We further assume that the characteristics of locations xl
is unidimensional and βi = 1 so that firms are not heterogeneous in their sensitivity
to the exogenous variable.8 In the exercise below, we normalized x1 = 0 for location 1
and change market characteristics of location 2, x2, to study how firms’ location choices
change as the attractiveness of location 2 changes.
8The marketwide random component is ignored as it does not provide any additional insight into theimportance of asymmetric effects.
11
The focus of the simulation exercise is to investigate how competition between firms
influences their decisions and how firms’ locations choices differ when they have differ-
ent competitive positions in the market (i.e., strong players vs. weak players). Define
the competition parameters δilk as the impact on firm i if it chooses location l and the
competitor chooses location k. We consider four cases that involve different competitive
interactions.9 In Case 1, there is no competition between players. In Case 2, firms are
symmetric in their competition effects. Further, the competition effect is stronger when
players are in the same location implying returns to differentiation. This case is similar
to the models considered in Seim (2005) and Watson (2004). The next two cases incorpo-
rate asymmetry in terms of competition where Firm B is a stronger player in the market
as it exerts a larger negative impact on Firm A. The difference between the two scenarios
is that in Case 3, Firm B can significantly impact A even at a distance whereas spatial
differentiation can shield A from B in Case 4.
Parametersx1 = 0, βA = βB = 1
Case 1 δill = 0, δi
lk = 0Case 2 δi
ll = −2, δilk = −1
Case 3 δAll = −3.5, δA
lk = −2, δBll = −1.5, δB
lk = 0Case 4 δA
ll = −3.5, δAlk = 0, δB
ll = −1.5, δBlk = 0
i = A, B, l, k = 1, 2 and l 6= k
In figure 2, we plot the probability of location choice and market entry as the at-
tractiveness of location 2 varies. The left panel depicts the choice probabilities of each
firm with respect to the two locations. The diamond line and the star line represent the
probability that firm A chooses location 1 and location 2, respectively. The probability
of location 1 and 2 being chosen by firm B are plotted as dashed lines and dotted lines,
respectively. The right panels summarize the overall probability of entry for two firms
with the dashed line representing firm A’s entry probability and dotted lines firm B’s.
Case 1: With no competition between firms, the model degenerates to a standard single
agent model, and both firms act identically. The choice probabilities and entry
probabilities are monotonic functions of location 2’s attractiveness.
Case 2: When firms are symmetric and the strength of competition decreases with dis-
tance, both firms have the same probability of choosing each of the two locations.
9The scenarios and parameter values are partly motivated by our empirical results.
12
The choice probability of location 2 is again a monotonic function of its attrac-
tiveness as is the overall probability of entry. However, if we contrast this case to
Case 1, we observe that entry probabilities and the probability of choosing a more
attractive location are lower in this case where competitive effects exist. This ob-
servation indicates that each player makes a trade-off between the attractiveness of
location and the intensified competition. Competition makes the “good” locations
less attractive due to concerns about entry by rivals.
Case 3: When firm B is a stronger player and can impact firm A significantly at different
distances, firm A is much less likely to enter the market relative to firm B and
is correspondingly less likely to enter either of the two individual locations. In
this scenario, firm B deters firm A from entering at all unless location 2 becomes
sufficiently attractive at which point A’s desire to avail itself to the attractive
location overcomes its wariness about competing with B.
Case 4: When firm B is a stronger player, but competitive effects are local, the choice
patterns over the two locations differ dramatically across the two firms. The
stronger player (firm B) continues to be more inclined to choose location 2 as
location 2 becomes more attractive. The behavior of the weaker player (firm A)
is more interesting. Unlike in all other cases, its choice probability of location 1 is
not monotonically decreasing in the attractiveness of location 2. The bell-shaped
function reveals that the weak firm will sacrifice a good location in order to avoid
competition when location 2 is moderately attractive. In this situation, firm A
believes that firm B is sufficiently likely to enter the relatively attractive location 2
that A is willing to enter the less attractive location in order to insulate itself from
competition with B. However, as location 2 becomes more attractive, the attractive
market conditions eventually dominate the desire to differentiate implying that firm
A will also be more likely to choose the good location. Comparing cases 3 and 4, lo-
cation differentiation will be more attractive when the relative competitive position
changes with distance. In case 3, firm A cannot shield itself from competition by
differentiation and thus is unlikely to enter at all for moderately attractive markets.
This case would thus be associated with a dominant firm monopoly in moderately
attractive markets and agglomeration to good locations in very attractive markets.
Conversely, case 4 would be more likely to yield a spatially differentiated duopoly
in moderately attractive markets with agglomeration again occurring when one
location becomes sufficiently attractive.
13
These simulations indicate the potential impact of asymmetries in competitive effects
across locations as well as heterogeneity across firms. Most importantly, they indicate the
way in which the parameters of the payoff functions are empirically identified. As market
characteristics vary, different scenarios will yield different entry patterns. Intuitively,
the empirical analysis will then attempt to identify the relevant “case” and associated
parameter values that make observed entry patterns as close as possible to those predicted
by the model.
3 Data Description
We apply the model to the discount store industry. Discount stores originated in the late
1950s to offer general merchandise items at a substantial discount relative to conventional
department stores. For the most part, the industry is currently an oligopoly with three
dominant firms: Wal-Mart, Kmart, and Target. Interestingly, all three chains started in
1962 with Wal-Mart opening its first store in Rogers, Arkansas, Kmart in Garden City,
Michigan, and Target in Roseville, Minnesota. Over the past few decades, these firms
have obtained dominant positions in the retail world while expanding their operations
both domestically and internationally. They have also experimented with several other
retail formats, most prominently the supercenter which combines a full-service grocery
store and a general merchandise outlet. In addition, these stores include several ancillary
services such as pharmacies, dry cleaning, vision centers, automotive care centers, hair
salons, and income tax preparation resources (in season) to provide consumers with a
true “one-stop” shopping environment.
Besides physical store locations considered in this study, the firms have also looked
to differentiate themselves in other dimensions. For instance, with its EDLP strategy
and the “Always Low Prices, Always” slogan, Wal-Mart has tried to establish itself as
the low price leader in the industry. It has also emphasized the supercenter format
more than Target and Kmart with approximately half of its 3,000 stores selling grocery
products. Similarly, Target has looked to place itself in a niche as an “upscale” discounter
with higher-end products. It has partnered with several designers to offer contemporary
merchandise, runs its own Target Guest credit card, and heavily promotes its nationwide
bridal and baby shower gift registries. Target’s panache is reflected in the pop-culture
use of a French-influenced pronunciation for its name (Tar-jay).10
10For a detailed background on these firms, interested readers are referred to Zhu, Singh, and Manuszak(2005) and three recent books on each firm: The Wal-Mart Decade: How a New Generation of LeadersTurned Sam Walton’s Legacy into the World’s #1 Company (Robert Slater) Portfolio Hardcover (2003);
14
In this study, we use a novel dataset that includes locations (street addresses and
zip codes) of all stores in operation as of 2003 for Wal-Mart, Kmart, and Target. In
addition, we observe store format information, discount store versus supercenter. The
total number of stores belonging to each chain is presented in Table 1. We do not consider
Sam’s Club as its main competitors are other membership clubs such as Costco and BJ’s.
Wal-Mart is the largest player with almost 3000 total stores and, as mentioned above, has
stores equally split between the two store formats. By contrast, Kmart and Target are
relatively minor players in the supercenter format. In Table 2, we show the distribution
of stores for these three chains across the country. Not surprisingly, the highest number
of stores are found in the two largest states, California and Texas. The three chains
have the highest share (in terms of number of stores) in their states of origin: Kmart in
Michigan, Wal-Mart in Arkansas, and Target in Minnesota. The chains also tend to have
a disproportionate share of stores in the states neighboring their headquarters.
Our main objective in this paper is to analyze the factors underlying the entry and
location choices of these firms and to infer competitive interaction parameters based on
their revealed preferences. For the purpose, we need to partition the United States into
small markets and each market into smaller neighborhoods. While various market def-
initions are possible, a practical consideration is the availability of data that capture
demand and cost conditions at each location. Our approach relies on census-delineated
lines to define the individual markets as well as the locations within markets. In par-
ticular, we use counties as our definition of markets and census tracts within counties
as locations.11 Census tracts are small, relatively permanent geographic entities within
counties (or the statistical equivalents of counties) delineated by a committee of local
data users and tend to be fairly homogeneous with respect to population characteristics,
economic status, and living conditions. In the empirical estimation, neighboring loca-
tions will be defined to be all locations within a given distance range which we specify
below.12
On Target: How the World’s Hottest Retailer Hit a Bullseye ( Laura Rowley) Wiley (2003); Kmart’sTen Deadly Sins: How Incompetence Tainted an American Icon, (Marcia Layton Turner) John Wiley &Sons (July 18, 2003)
11For very large urban counties (for example LA county), we use the first three digits of the tractnumber to create sub-markets. The complete census tract ID is defined as follows: SS-CCC-TTTTTT,where the first two digits “SS” represents State FIPS code, the next three digits “CCC” representsCounty FIPS Code, and the last six digits “TTTTTT” is the census tract number. This definitiondefines a market as the tracts within a county with the same first three digits in their tract numbersand all six digit codes as locations.
12Two recent applications (Seim 2005, Watson 2004) also use census tract as locations although theirdefinition of market is mid-size cities. We could alternatively define markets based on 4-digit zip codesand locations based on 5-digit zip codes. However, this approach would be problematic since zip code
15
In Table 3, we present the competitive configurations across US counties along with
some characteristics of those markets. The letters W, K, and T represent the presence
of Wal-Mart, Kmart, and Target firms in the market where, for the sake of brevity, we
have omitted information associated with store format. For instance, (0,0,0) represents
markets with no firms; (W,0,0) are Wal-Mart monopoly counties; (W,K,0) are duopoly
counties of Wal-Mart and Kmart, and so on. A few patterns are apparent from the
numbers presented in Table 3. First, markets with no firms have significantly smaller
populations compared to markets with at least one firm. Second, Wal-Mart operates as
a monopolist in a larger number of counties than its rivals. As the associated market
populations suggest, this reflects its ability to operate in smaller markets. Third, Tar-
get stores generally operate in markets with larger population and significantly higher
incomes compared to Wal-Mart and Kmart which is consistent with the anecdotal ev-
idence discussed above. Finally, the population numbers generally indicate that larger
markets are required for additional entry.
Table 4 lists the full set of variables used in the estimation where we separate the
variables that are observed at the market, location, and firm level. We also present the
overall averages and means by MSA and rural Non-MSA markets. The average market
size in our data is approximately 700 square miles although the rural markets tend to
be larger in size than MSA markets. In terms of population we find the opposite as
MSA markets have double the number of people reflecting that the urban areas are quite
densely populated compared to rural markets. We can also see that MSA markets tend
to have higher retail wages and more retail establishments. The next row shows the
number of census tracts within a market. The average number of locations in a market
ranges from approximately six in the rural markets to over nine in MSA markets.
The next panel of the table describes the location-specific variables. The first two
rows are the populations within a two and a two-to-ten mile radius of the location. We
find a similar pattern to the market-level characteristics as MSA markets have signifi-
cantly larger population numbers than the non-MSA markets. The next three variables
%Novehicle, %Child, and HHsize are the percent of population with no vehicles, per-
cent of population with children in the family, and household size respectively. These
three variables do not differ substantially across MSA and rural markets. However, MSA
locations do tend to have higher levels of income and house values as well as a higher
proportion of younger and educated population.
The last set of variables are firm-level variables that are specific to each location. In
classification yields markets that are not contiguous.
16
particular, the data include distances from locations to the headquarters and distribution
centers of firms. These variables were created to proxy for logistical issues in a firm’s
location decisions and, as we shall see below, tend to be quite important.
4 Empirical Application
4.1 Estimation Issues
The model described in (4) is very flexible as it allows for competition effects to be
asymmetric across firms and for the intensity of competition to depend on proximity.
However, the general model has a large number of competitive parameters (the δ’s) to be
estimated. Without any constraints the number of competitive interaction parameters is
Lm × Lm × N × (N − 1) which grows quickly even with a small number of players and
locations. For instance, with three players and ten locations, the number of competition
parameters is 600. Clearly, some restrictions on the number of parameters are required.
To deal with this difficulty, we create discrete distance bands and assume that the com-
petitive intensity between players is the same as long as they are within the same band.13
In the estimation, we categorize the distance between locations into three bands: less
than 2 miles, 2 to 10 miles, and more than 10 miles. Note that this approach implies
that if δii′0 is the competitive impact of firm i at location k1 that is within 2 miles from
location l chosen by player i′, then the competitive effect is still δii′0 were firm i to choose
another location k2 that is also within 2 miles of l. In other words, in this framework
the intensity of competition is determined by the distance between locations and not
the identities of those locations. The advantage of the approach is that the number of
parameters reduces significantly. For instance, using the three distance bands described
above, the total number of parameters is reduced to 3× 3× 2 = 18.
The model can be summarized as follows:
πiml = ximlβi −∑i′ 6=i
B−1∑b=0
Lm∑k=1
δii′bai′mk1 (k ∈ bl) + ξm + εiml (8)
13We assume that firms are located at the centroid of the location, i.e., census tract. We calculate thedistance between locations using the Haversine Formula. Based on latitude-longitude coordinate data,the distance between two points, a and b, is given by
da,b = 2R arcsin[min
((sin (0.5 (latb − lata)))2 + cos (lata) cos (latb) (sin (0.5 (lonb − lona)))2
)0.5
, 1]
where R = 3961 miles denotes the radius of the earth. A similar approach of creating competitive effectsusing distance bands is used by Seim (2005) and Watson (2004).
17
where B is the total number of bands and 1 (k ∈ bl) is an indicator function with
1 (k ∈ bl) =
1 if db ≤ dlk < db+1
0 otherwise, (9)
where db and db+1 denote cut-offs that define a distance band. In other words, 1 (k ∈ bl) =
1 whenever location k is in distance band b relative to location l.
The parameters to be estimated are θ = β, δ, σ where β captures firms’ preferences
over the exogenous market conditions, δ reflects competitive effects across firms and
locations, and σ measures the importance of the market level random component. To
estimate the model, we nest the fixed point algorithm in (5) into a maximum likelihood
routine. For a given set of values of the parameters and the market, location, and
firm specific exogenous variables, the system of equations in (5) is solved numerically
for its fixed point for each market. The fixed point is a vector with∑
i Lim elements
which represent the probability of each location being chosen by each firm. Given those
probabilities, P (ym) =∏
i P (yim) is the probability of the equilibrium realization that
is consistent with the observed market outcome.The likelihood function is given by
L (y, X, ξ; θ) =M∏
m=1
P (ym|Xm, ξm; β, δ) . (10)
The unconditional likelihood then involves integrating over the distribution G (.) of the
unobserved market component ξm, where ξm is assumed normally distributed with vari-
ance σ in our application.
L (y, X; θ) =M∏
m=1
∫P (ym|Xm, ξm; β, δ) dG (ξm|σ) , (11)
where ym = (yWm, yKm, yTm) are observed actions taken by Wal-Mart, Kmart and Target.
Computationally, the model is quite complicated. To simplify the estimation, we em-
ploy a simulated maximum likelihood routine as follows.14 Given a set of parameter val-
ues and a vector of simulated draws with standard normal distribution ς =(ς1, ς2, ..., ςR
)14One problem of the likelihood approach is that there might be multiple equilibria in the incomplete
information game. However, as long as the strength of competition effect is moderate, the equilibriumcan be solved uniquely. We show analytical and simulation results in the appendix. An alternativeestimation strategy is to use a two-stage pseudo MLE procedure described in Bajari, Benkard and Levin(2005), which does not require computation of equilibrium outcomes. The approach involves two stepswhere the first step is to recover the agents’ belief on their competitors behavior, which in practice,involves regression of observed actions on state variables, such as demand and cost shifters.Suppose θ1
are the parameters we obtained from the first stage regression and θ2 are the primitive parameters infirms’ payoff functions. The first-stage parameters can be estimated either nonparametrically or someparametric model (for example multinomial logit) when the dimension of the problem is large. Given the
18
for the unobserved market characteristic, we begin with an initial guess of the choice
probability for each firm P 0m ∈ [0, 1]
∑i Lim in each market. We then calculate each firm’s
expected payoff for each location conditional on its beliefs about its competitors’ location
choice probability P 0−im. For each random draw ςr, we solve the fixed point problem (5)
by successive approximation to obtain the Bayesian Nash equilibrium probabilities P rm.
We then repeat the procedure R times to find the Bayesian Nash equilibrium for each
random draw. The predicted probability of the observed outcome is approximated by∫P (ym|Xm, ξm; β, δ) dG (ξm|σ) =
1
R
R∑r=1
P rm (ym) . (12)
We then use the simulated probabilities in a log-likelihood function:
θ = arg maxθ
LL (y, X; θ) =M∑
m=1
ln
[1
R
R∑r=1
P rm (ym)
]. (13)
4.2 Results
We include a variety of variables that capture demand, cost, and logistical factors that
could be important in a firm’s location decision. Since these factors might operate dif-
ferently in large MSA markets compared to rural areas, we divide the sample into MSA
and non-MSA markets and estimate separate models for each. We also allow Wal-Mart
to choose between its discount store and supercenter formats. As noted in the data sec-
tion above, the other two players mainly operate discount stores so we ignore the format
choice decisions for these players.
In Table 5, we report the parameter estimates for MSA markets.15 We will focus on
those estimates since similar results obtain for non-MSA markets as displayed in Table
predicted probabilities from the first stage, Pi
(ai|x, θ1
), the structural parameters θ2 can be recovered
by maximizing the following likelihood.
L(θ1, θ2
)=
N∑m=1
I∑i=1
log
exp(∑
a−imfim (aim, a−im, xm; θ2) P−im
(a−im|xm, θ1
))1 +
∑a′im
exp(∑
a−imfim (a′im, a−im, xm; θ2) P−im
(a−im|xm, θ1
))
The estimation procedure has computational advantage over the full MLE in that it does not requireto solve for fixed point problem to get the equilibrium outcomes. The first stage estimation recoversequilibrium beliefs from the data directly. However, the cost of the two-stage approach is that it does notuse all the information available from the model, especially in the first stage estimation. In addition itassumes that all the relevant state variables are observed during the first stage estimation. In practice,there are a number of factors that are observed by firms, but not by analysts. As noted below, theunobserved market characteristics, ξm are found to be quite significant in the empirical application.
15As a caveat, it should be noted that the parameters should be interpreted with care due to both
19
6. Not surprisingly, we find that closer population contributes more to profitability that
more distant population as reflected in the lower value of the coefficients for population
in more distant bands. However, the parameters for distant population are significantly
larger for Wal-Mart, particularly for its supercenter format. This finding suggests that
the trading area for Wal-Mart is larger than that of its rivals. The estimated effect
of population density is negative for all players except Kmart. While this may seem
counterintuitive, note that population density is measured at the specific location cell
level, while the population measures include populations from the immediate as well as
neighboring locations that fall within the respective distance band. Since the discount
stores tend be large in size, it is not necessarily surprising that population density in
the immediate location is low. In addition, a location with high population density may
have high real estate costs implying that the chains would prefer locations with lower
immediate population density, but higher overall surrounding population.
The next two variables, %Child and %Novehicle, represent the percent of population
with children under the age of 18 and percent of population with no vehicles. The
parameters for %Child are positive for Wal-Mart and Target, but are insignificant for
Kmart. Thus, both Wal-Mart and Target stores prefer markets where a larger proportion
of families have kids. In relative terms, the impact is larger for Wal-Mart, particularly
for its supercenter format. The parameter for %Novehicle is significantly negative for all
stores which is intuitive as these stores are built with large parking lots that are generally
inaccessible without vehicles. The next two demographic variables are median income
and education measured as the percentage of population with a college degree. These
two variables reveal some interesting differences across firms. The income parameter is
negative for both Wal-Mart formats, insignificant for Kmart, and positive for Target.
These results suggest that, all else equal, Wal-Mart enters markets with lower incomes
while Target stores prefer high income areas. In addition, Target stores tend to have a
more educated population base. These results are consistent with common perceptions
of the positioning and target audiences of these stores. For example, as discussed above,
Target has historically attempted to position itself as an upscale discounter that caters
to a higher income and more educated clientele than its competitors. Similarly, lower
income levels in Wal-Mart locations are consistent with Wal-Mart’s every day low pricing
and value positioning.
the reduced form nature of the payoff function and the fact that, as with any discrete choice model, theparameters are only estimated up to an unknown positive scale factor. While the latter point implies thatthe absolute magnitudes of the estimates are difficult to interpret, their signs and relative magnitudesare directly interpretable.
20
The next variable HouseValue is the median house value which ideally will capture
the impact of real estate prices. The coefficient is negative for Wal-Mart discount store,
positive for its supercenter format, and insignificant for Kmart and Target. However, the
instability of these results appear to indicate that residential house value is in fact an
inaccurate proxy for commercial land value. The coefficients for retail wages are negative
and significant indicating that all firms strongly prefer markets with lower wages. This
is not surprising since labor tends to account for a significant proportion of operational
costs in retailing. The variable “num of retailers” is the total number of other retailers
and captures the general business density at a location. Note that these retail stores
are considered to be non-strategic players in our model even though some of them may
actually compete with the three discount stores. The coefficient is negative for the two
Wal-Mart stores and positive for Kmart and Target. This suggests that Wal-Mart tends
to prefer stand alone locations while its competitors are more likely to be surrounded by
other businesses.
The last two variables Dist HQ and Dist DC are the distance of different market
locations from the headquarters and nearest distribution center of a firm. Unlike all
the variables discussed above, these distance measures are firm-specific. The negative
coefficients for distance to headquarters suggests that, all else equal, firms prefer markets
that are closer to their respective headquarters. This is consistent with the distribution
of stores across states reported in Table 2. Looking at the distance to distribution center,
we find that Wal-Mart, particularly for its supercenter format, prefers locations that are
closer to its distribution center. This is not surprising as Wal-Mart is well known for its
efficiency in distribution, and a large part of its success as a retailer has been attributed
to these logistical efficiencies.16
Looking at the competitive interaction parameters, some interesting patterns emerge.
As expected, all parameters are negative indicating that presence of rivals reduce a firm’s
profits. However, proximity to competitors is an important determinant of profitability
as the competition effect diminishes with distance. For instance, the point estimates
suggest that the impact of Kmart on a Wal-Mart’s discount store is approximately 45%
lower when it is located more than ten miles away instead of being located in the imme-
diate neighborhood. This decreasing relationship between distance and competition is
consistent across all firms and formats which suggests strong returns to spatial differen-
16According to an independent study by McKinsey &Co., Wal-Mart’s efficiency gains were the sourceof 25% of the entire U.S. economy’s productivity improvement from 1995 to 1999. “Can Wal-Mart getany bigger” Time Magazine (Vol. 161, issue 2, 2003)
21
tiation in this industry as firms can shield their profits by choosing sufficiently different
locations. Looking at the competitive effects across firms, we find strong asymmetries
in the competitive interaction parameters. For example, looking at Kmart and the two
Wal-Mart formats, we find that while Kmart substantially impacts Wal-Mart when it is
in close proximity, the effect is significantly lower than the effect of Wal-Mart on Kmart.
Similarly, when Kmart is located beyond ten miles, its effect on Wal-Mart’s discount
store and supercenter are approximately one-fourth of the impact that the Wal-Mart
stores have on Kmart at a similar distance. Target stores fare well in the face of com-
petition from other discount stores except when the competitors are in close proximity.
However, the chain appears vulnerable to Wal-Mart’s supercenter format. Perhaps most
interesting, the impact of Wal-Mart supercenter appears to decline less rapidly with dis-
tance when compared to the effect of any of the other firms. This finding suggests that
Wal-Mart’s supercenter format yields a substantially larger market area relative to other
formats as even competitors which are relatively far from a supercenter are not insulated
from competition with it.
To better interpret the competition effects, one can calculate measures on the demand
reduction in terms of population when a competitor moves into a location, holding all
else constant. For example, moving a new Wal-Mart discount store within 2 miles of
Kmart is equivalent to reduction of 33,600 in population for Kmart in the immediate
neighborhood. The effect of a new Kmart on a Wal-Mart discount store is only 15,800
which is less than half the impact that Wal-Mart has on Kmart. If a new Wal-Mart
supercenter moves in, even in a far away location (beyond 10 miles), it is equivalent to a
reduction in population of 24,000 for Kmart and 18,900 for Target. The corresponding
numbers for a Wal-Mart discount store are 20,900 for Kmart and 2,000 for Target. Thus,
Kmart seems vulnerable to both formats of Wal-Mart even in far away locations, while
Target fares well when faced with a distant Wal-Mart discount store, but is still affected
strongly by a Wal-Mart supercenter. In fact, the impact of Target is stronger on Kmart
and Wal-Mart’s discount store when located more than 10 miles away rather than the
other way around.
4.3 Varying Market Conditions and Equilibrium Outcomes
The model parameters can be used to predict the equilibrium market structures for differ-
ent sets of market conditions and provide insights on how these firms shield themselves
from competition by spatial differentiation. For instance, consider a market with two
possible locations that are 10 miles apart, and three potential competitors: Wal-Mart
22
supercenter, Kmart, and Target. Further, suppose the population in location 1 is 30,000
and income is at the mean level in the population which is normalized to zero. In ad-
dition, all other variables are set at their population mean levels. Figure 3 displays the
predicted market structures as population and income change in location 2. Note that
the parameter estimates of the latent payoff functions suggest that the impact of the two
variables may work differently for different firms. In particular, while all firms prefer
increase in population especially in the immediate location, Target is the only firm that
prefers higher income levels. The darker shade represents choice of location 1, lighter
shade the choice of location 2 (Red and Blue if viewed in color), and non-shaded ar-
eas represent situations where firms stay out of the market. The left panel shows the
predicted outcomes without considering competition, while the right panel shows the
equilibrium structures after taking competition into account. Figure 3-1a shows that
when firms do not compete, no entry and monopoly outcomes are rare as all three firms
are willing to enter the market with moderate increases in population and income levels.
Figure 3-1b shows that the region of no-entry and monopoly increase dramatically once
we account for competition. Moreover, the occurrence of oligopoly is much less prevalent
and only arises at significantly high population and income levels.
In Figures 3-2a to 3-4b, we show how the predicted entry and location choices vary
by individual firms. Comparing individual firms under competition and no-competition,
some interesting patterns emerge. Wal-Mart’s no-entry region, while larger under compe-
tition compared to the no-competition case, is relatively small compared to its rivals. The
impact of competition is highest on Kmart as it gives up the majority of the space that it
would have occupied if competitive concerns were irrelevant. Looking at Wal-Mart’s lo-
cation choices, we observe that as the income in location 2 increase above a certain point,
Wal-Mart is willing to enter the market by switching to location 1. On the other hand,
both Target and Kmart prefer not to enter the market altogether with moderate market
conditions and choose location 2 as it becomes sufficiently attractive. These results are
primarily driven by Wal-Mart supercenter’s ability to enter markets with lower popu-
lation levels, attract population from further away locations, and its ability to impact
competitors even when it is located far away. Comparing the monopoly region in Figure
3-1b with those of three firms under competition (Figures 3-2b, 3-3b, 3-4b), we can see
that majority of the monopoly region is occupied by Wal-Mart with a small region (those
with moderate population and very high income) being Target’s monopoly. Kmart on
the other hand is rarely able to operate as a monopolist. Of course these predictions
would change at different values of the other market variables. For example, Kmart may
23
have monopoly markets in regions close enough to its headquarters.
Overall, the results show that all firms exert a strong negative impact on its competi-
tors when they are in close proximity and that this effect decreases with distance to rivals.
Wal-Mart’s supercenter format is found to be the most formidable player as it strongly
impacts competitors even from a distant location. Not surprisingly, in our empirical data
we observe a large number of monopoly markets for Wal-Mart supercenter. We also find
asymmetries across players in terms of exogenous market characteristics. In Table 7, we
show the parameter estimates from a model that ignores player identities. With minor
differences, this is the model used by Seim (2005) and Watson (2004) in their applica-
tions to video rental stores and eyewear retailers respectively. Several similarities exist
in the parameter estimates with the full model presented above. For instance, payoffs
are higher with population closer the firm’s location. Similarly, the competitive effect
decreases with distance to rivals suggesting returns to geographical differentiation. How-
ever, this model can not discern the differences across firms with regards to preference for
exogenous variables as well as competition effects. While this model may be reasonable
in industries with small local players who are reasonably homogeneous in many respects,
in our application the impact of income, education, business density, distance to distri-
bution center, and competition is found to be quite different based on the identities of
individual players.
5 Conclusion
In this paper we study the strategic entry and location choice decisions in the retail
discount industry. Using a novel data set that includes the store locations and the ac-
companying market conditions for all stores belonging to Wal-Mart, Kmart, and Target
chains, we study the factors determining firm’s entry and location choices. The model
is set up as an incomplete information game between the three players where firms are
assumed to have private information on their own profitability but observe only the dis-
tribution of their rivals’ profitability. The model incorporates asymmetries related to
the identity of the firms by allowing the competitive effects and preference for exoge-
nous market conditions to be firm-specific. The incomplete information setup provides
a tractable model for studying the complex interactions of optimal site selection as a
trade-off between proximity to good markets and increased competition with rivals in
these prime locations.
Results show that population in closer distance bands to firm’s location contribute
24
more to profitability than population in distant locations. All firms are found to prefer
markets with lower retail wages, markets where larger proportion of families have kids
and vehicles, and markets that are closer to their respective headquarters. We also find
significant differences across firms in certain demand characteristics. For instance, Wal-
Mart and Kmart stores are found to prefer markets with lower income levels, while Target
prefers markets with higher income and education levels. These results seem consistent
with common perceptions of these stores. We also find that Wal-Mart, particularly for
its supercenter format, gives strong weight to locations that are closer to its distribution
center, reflecting its logistical efficiencies. In terms of the competitive interaction param-
eters, we find that proximity to competitors an important determinant of profitability.
All firms are found to exert a strong negative impact on their competitors when they
are in close proximity, but the effect goes down with distance to rivals suggesting strong
returns to spatial differentiation in this industry. However, looking at the competitive
effects across firms, we find strong asymmetries in the competitive interaction parame-
ters. For example, while Kmart is able to impact Wal-Mart strongly when it is in close
proximity, its effect is significantly lower than what Wal-Mart exerts on Kmart’s payoffs.
Target stores are found to fare well under competition from other discount stores except
when these competitors are in close proximity. Wal-Mart’s supercenter format is found
to be the most formidable player as it is able to impact the competitors strongly even
when it is located far away.
There are of course several caveats to this study and directions for future research.
First, the incomplete information assumption in a static game implies that players are
allowed to have ex post regret about their decisions, which naturally leads to dynamic
considerations (Aguirregabiria and Mira 2004, Pakes et al. 2004). While applications
of multi-agent dynamic games pose significant computational challenges, they may pro-
vide insights into the rich dynamics inherent in strategic decisions like market entry and
exit that are lost in a static setting. Second, to reduce the number of parameters we
resorted to three distance bands that could have been made finer at the cost of addi-
tional parameters. Finally, in our application the only unobservables that are correlated
across players are market specific that do not differentiate between locations within a
market. Location specific unobservables can obviously be incorporated in the model
at the expense of additional computational burden. Although addressing many of these
shortcomings pose significant challenges in terms of data requirements and computations,
with advancements in computing power these hurdles can be overcome in the future.
25
Appendix: Equilibrium Properties of Incomplete In-
formation Games
A well-known problem with multi-agent discrete games is that they have non-existence ofpure strategy equilibria and multiple equilibria in which certain values of the underlyinglatent payoffs could simultaneously be consistent with more than one equilibrium out-come. For existance, researchers generally assume that the data is generated by a purestrategy equilibrium and that the model satisfies restrictions sufficient to guarantee theexistence of at least one such equilibrium. In the current application, since equation(6) isa continuous mapping from a probability space to itself, the existence of a solution of (6)can be proved from Brouwer’s Fixed Point Theorem. However, the uniqueness of such asolution is not guaranteed. The game will have multiple equilibria if, for given parameterθ, more than one value of P satisfies (6) in which case the likelihood function is not welldefined. Aradillas-Lopez (2005) discusses the sufficient conditions for the uniqueness ofequilibrium of incomplete information game and points out that the conditions for exis-tence of a well-defined likelihood function are generally weaker in incomplete informationgame than in the complete information version of the game.
Heckman (1978) studies the general class of simultaneous equation qualitative re-sponse models and shows that if the distribution of the unobservables is continuous withunbounded support, a necessary and sufficient condition for the model to have a uniqueequilibrium for any values of exogenous variables is that the model is recursive. Thoughuseful in some applications, this condition implies setting some strategic interactionsparameters (which are of main interest) to zero. Instead, researchers have generallyadopted one of the two approaches to deal with multiple equilibria problem. First, mod-els sometimes yield a prediction for some outcome that is unique across all equilibria. Forinstance, BR (1990, 1991) and Berry (1992) ignore the identities of the players that entera market and only consider the total number of entrants, an outcome for which theirmodels yield a unique prediction. Alternatively, some papers impose additional struc-ture on the model that guarantees equilibrium uniqueness. For example, Zhu, Singh, andManuszak (2005) consider sequential move games that yields additional conditions for anoutcome to be a subgame perfect Nash equilibrium.
In this appendix, we study the equilibrium properties for the model proposed inSection 2. We begin with a proof on the sufficient conditions of uniqueness for the twoplayer - two action game and then rely on simulation techniques for a more complexgame, i.e., when the action space or the number of players becomes larger.
Consider a simple 2 × 2 game with two players and two possible actions: in or out.Firm i’s payoff of entry is
πi = xiβi − δip−i + εi, i = 1, 2, (A1)
where p−i is the entry probability of i’s competitor, and εi is the private information heldby firm i and is type 1 extreme value distributed. Similar to the discussion on equation(7), the BNE is obtained by solving the following two simultaneous equations:
p1 =exp (x′1β1 − δ1p2)
1 + exp (x′1β1 − δ1p2),
and
p2 =exp (x′2β2 − δ2p1)
1 + exp (x′2β2 − δ2p1).
26
Define f =
(f1
f2
), where
f1 = p1 −exp (x′1β1 − δ1p2)
1 + exp (x′1β1 − δ1p2),
f2 = p2 −exp (x′2β2 − δ2p1)
1 + exp (x′2β2 − δ2p1),
then taking the derivates of f1 with respect to p2 yields
∂f1
∂p2
= −δ1 exp (x′1β1 − δ1p2) [1 + exp (x′1β1 − δ1p2)]− δ1 exp (x′1β1 − δ1p2) exp (x′1β1 − δ1p2)
[1 + exp (x′1β1 − δ1p2)]2
=−δ1 exp (x′1β − δ1p2)
[1 + exp (x′1β1 − δ1p2)]2 ,
and similarly∂f2
∂p1
=δ2 exp (x′2β2 − δ2p1)
[1 + exp (x′2β2 − δ2p1)]2 .
Hence the Jacobian matrix f ′ is 1δ1 exp(x′
1β1−δ1p2)[1+exp(x′
1β1−δ1p2)]2
δ2 exp(x′2β2−δ2p1)
[1+exp(x′2β2−δ2p1)]
2 1
.
From the Gale-Nikaido Theorem: Suppose f : Rn → Rn is continuously differentiablefunction, and define Ω as the rectangle Ω = x ∈ Rn, a ≤ x ≤ b , where a and b are givenvectors in Rn. Then f is one-to-one in Ω if for all x ∈ Ω, Jacobian matrix f ′ (x)has onlystrictly positive principal minors. The sufficient condition that (1) has unique solutionfor (p1, p2) is
1−[
δ1 exp (x′1β1 − δ1p2)
[1 + exp (x′1β1 − δ1p2)]2
] [δ2 exp (x′2β2 − δ2p1)
[1 + exp (x′2β2 − δ2p1)]2
]> 0,
or equivalently,
δ1δ2 <[1 + exp (x′1β1 − δ1p2)]
2 [1 + exp (x′2β2 − δ2p1)]2
exp (x′1β1 − δ1p2) exp (x′2β2 − δ2p1)
= 4 +2
exp (x′1β1 − δ1p2)+ 2 exp (x′1β1 − δ1p2) +
2
exp (x′2β2 − δ2p1)+ 2 exp (x′2β2 − δ2p1)
+exp (x′2β2 − δ2p1)
exp (x′1β1 − δ1p2)+
exp (x′1β1 − δ1p2)
exp (x′2β2 − δ2p1)
+1
exp (x′1β1 − δ1p2 + x′2β2 − δ2p1)+ exp (x′1β1 − δ1p2 + x′2β2 − δ2p1)
≥ 16
27
Hence as long as the interaction term of the competition intensity δ1δ2 is below 16, theuniqueness of the Bayesian Nash equilibrium is ensured in this 2× 2 game.
The message from the simple example above is that as long as the strength of strategicinteraction is within a certain range, equilibrium is unique. As the game becomes morecomplex, i.e., the action space or the number of players gets larger, the matrix algebrabecomes cumbersome. We rely on simulation technique to investigate the uniquenessproperty of incomplete information games. In the simulation, We are interested in howthe equilibrium uniqueness property is related to (1) the variation of market character-istics across locations (choice alternatives), (2) the heterogeneity across players, and (3)the magnitude of competition effect. This simulation exercise is motivated by Bayer andTimmins (2004). We assume
• There are two locations in the market, location 1 and location 2.
• There are two players A and B who decide whether to enter the market or not andwhich location to choose.
• The competition between players at the same location is stronger than the compe-tition between players from different locations.
Players’ payoff functions are assumed to be:
πil = xlβi −2∑
k=1
δilkPi′l + εil, i, i
′ ∈ A, B and i 6= i′, l, k = 1, 2
where δilk denotes the how firm i’s payoff at location l is affected by its competitor’s entry
of location k and δill ≥ δi
ll′ , where l′ 6= l. εil are iid type one extreme value distributed.We normalize x1 = 1, βA = 1, and change x2 from 1 to 10 and βB from 1 to 2. The
ratio x2/x1 captures the variation of market conditions across locations. The larger theratio, the more different the two locations are. Similarly, the ratio βB/βA representsthe level of firm heterogeneity. Given the same market conditions, firms’ profitabilitywill be different due to their preferences of those market factors. Given x1, x2, β1, β2
and δilk, the fixed points of the probability vector (P1A, P2A,P1B,P2B) are solved from
different starting values: (0 0 0 0)′, (0 0 1 0)′, (0 0 0 1)′, (1 0 0 0)′, (1 0 1 0)′, (1 0 0 1)′,(0 1 0 0)′, (0 1 1 0)′, (0 1 0 1)′. The discussion on why one should choose those points asstarting points to test equilibrium multiplicity can be found in Bayer and Timmins(2004).As we change δA
11 from 0 to 100, and find δill, such that as δi
ll > δill, we start to observe
multiple fixed points for a particular combination of x1, x2, β1, β2. In figure 4 and figure5 we plot how δi
ll changes as x2/x1 and β2/β1 changes under two scenarios:Case 1: Symmetric competition , δA
lk = δBlk, δi
ll′ = 12δill
Case 2: Asymmetric competition: δAlk = 2δB
ik, δill′ = 1
2δill
In the first case, firms are symmetric in terms of competition, while in the secondscenario Player B is stronger than player A in the sense that it exerts stronger negativeimpact on its competitor. The competition intensity is weaker if two players choosedifferent locations. Comparing the plot for case 2 and that for case 1, we can findthat the threshold for case 2 is larger. The reason is that the uniqueness conditions aredetermined by the interaction terms of the competition intensity. Hence the range forthe competition intensity of δ1
11 is larger if δAlk > δB
lk.The results from simulation can be summarized as follows. First, as long as the
strength of competition effect is moderate, there is a unique equilibrium in the incom-plete information game. Second, there exists a thereshold δA
lk such that when δAlk > δA
lk,
28
one starts to observe multiple equilibrium given the location characteristics and firms’preference of those market characteristics. We find that as the variation of market char-acteristics increases, the threshold δA
lk increases. Put another way, the multiple equilibriais less likely to be observed when there is larger variation across locations. This find-ing is consistent with Seim(2001). Finally, as the asymmetry between players increasesboth in terms of their preference to market conditions (βB/βA) and the asymmetric ofcompetition effect, the threshold δA
lk becomes higher.
29
Reference
1. Aguirregabiria, V. and P. Mira (2004), “Sequential Estimation of Dynamic DiscreteGames,” Working Paper, Boston University.
2. Aradillas-Lopez, A. (2005), “Semiparametric Estimation of a Simultaneous Gamewith Incomplete Information.” Working Paper, Berkeley.
3. Arnold, S. J., T. H. Oum and D. J. Tigert (1983), “Determinant Attributes inRetail Patronage: Seasonal, Temporal, Regional and International Comparisons,”Journal of Marketing Research, 20 (May) 149-57.
4. Bajari P., L. Benkard, and J. Levin (2005), “Estimating Dynamic Models of Im-perfect Competition,” Working paper, Duke.
5. Bajari, P., H. Hong, J. Krainer, and D. Nekipelov (2005), ‘Estimating StaticModels of Strategic Interactions’, Working Paper, University of Michigan.
6. Bayer P. and C. Timmins (2004), “ On the Equilibrium Properties of LocationalSorting Models,” Working Paper, Yale University.
7. Berry, S. T. (1992), “Estimation of a Model of Entry in the Airline Industry,”Econometrica, 60 (4), 889-917.
8. Besanko, D., J. P. Dube, and S. Gupta (2003), “Competitive Price Discrimina-tion Strategies in a Vertical Channel Using Aggregate Retail Data,” ManagementScience, 49 (9), 1121-38.
9. Besanko, D., S. Gupta, and D. Jain (1998), “Logit Demand Estimation underCompetitive Pricing Behavior: An Equilibrium Framework,” Management Science,44 (11), 1533-47.
10. Bresnahan, T. F. and P. C. Reiss (1987), “Do Entry Conditions Vary across Mar-kets,” Brookings Papers on Economic Activity (3), 833-81.
11. — (1990), “Entry in Monopoly Markets,” Review of Economic Studies, 57 (4),531-53.
12. — (1991), “Entry and Competition in Concentrated Markets,” Journal of PoliticalEconomy, 99 (5), 977-1009.
13. — (1991), “Empirical-Models of Discrete Games,” Journal of Econometrics, 48(1-2), 57-81.
14. Brown, S. (1989), “Retail Location Theory: The Legacy of Herald Hotelling,”Journal of Retailing, 65, 450-70.
15. Chan, T., V. Padmanabhan, and P.B. Seetharaman (2005) “Modeling Locationsand Pricing Decisions in the Gasoline Market: A Structural Approach” WorkingPaper, Olin School of Management
16. Chintagunta, P. K. (2002), “Investigating Category Pricing Behavior at A RetailChain,” Journal of Marketing Research, 39 (2), 141-54.
30
17. Chintagunta, P., J. P. Dube, and V. Singh (2003), “Balancing Profitability andCustomer Welfare in a Supermarket Chain, Quantitative Marketing & Economics,1(1), 111-47.
18. Chintagunta P., T.Erdem, P. Rossi and M. Wedel “Structural Modeling in Market-ing: Review and Assessment,” Marketing Science, forthcoming.
19. Ciliberto, F. and E. Tamer (2004), “Market Structure and Multiple Equilibria inAirline Markets,” Working Paper, Princeton University.
20. Craig, S. C., A. Ghosh and Sara McLafferty (1984), “Models of Retail LocationProcess: A Review,” Journal of Retailing, 60, 5-36.
21. Draganska, M. and D. Jain (2004), “Consumer Preferences and Product-line PricingStrategies: An Empirical Analysis” Working Paper, Stanford University.
22. Dube, J.P., K. Sudhir, A. Ching, G.S. Crawford, M. Draganska, J.T. Fox, W. Hart-mann, G.J. Hitsch, V.B. Viard, N. Vilcassim and M. Villas-Boas (2004), “RecentAdvances in Structural Modeling: Dynamics, Product Positioning and Entry,” ForChoice Conference 2004 based Special Issue of Marketing Letters.
23. Einav, L. (2004), “Not All Rivals Look alike: Estimating an Equilibrium Model ofthe Release Date Timing Game.” Working Paper, Stanford University.
24. Heckman, J., (1978), “Dummy Endogenous Variables in a Simultaneous EquationSystem,” Econometrica, Econometric Society, 46(4), 931-59.
25. Huff D.(1964), ”Defining and Estimating a Trading Area,” Journal of Marketing,28, 34-38.
26. Kadiyali, V., P. Chintagunta, and N. Vilcassim (2000), “Manufacturer-retailerChannel Interactions and Implications for Channel Power: An Empirical Inves-tigation of Pricing in a Local Market,” Marketing Science, 19 (2), 127-48.
27. Kadiyali, V., K. Sudhir, and V. R. Rao (2001), “Structural Analysis of Compet-itive Behavior: New Empirical Industrial Organization Methods in Marketing,”International Journal of Research in Marketing, 18 (1-2), 161-86.
28. Kadiyali, V., N. Vilcassim, and P. Chintagunta (1999), “Product line extensions andcompetitive market interactions: An empirical analysis,” Journal of Econometrics,89 (1-2), 339-63.
29. Khan, R. J. and D. C. Jain (2005). ”An empirical analysis of price discriminationmechanisms and retailer profitability.” Journal of Marketing Research 42(4), 516-524.
30. Kyle, M. K. (2005), “The Role of Firm Characteristics in Pharmaceutical ProductLaunches,” Forthcoming, RAND Journal of Economics.
31. Mazzeo, M. J. (2002), “Product Choice and Oligopoly Market Structure,” RandJournal of Economics, 33 (2), 221-42.
32. — (2002), “Competitive Outcomes in Product-differentiated Oligopoly,” Review ofEconomics and Statistics, 84 (4), 716-28.
31
33. Orhun Y.(2005), “Spatial Differentiation in The Supermarket Industry,” WorkingPaper, Berkeley.
34. Pakes, A., M. Ostrovsky, and S. Berry (2004), “Simple Estimators for the Pa-rameters of Discrete Dynamic Games with Entry/Exit Examples,” Working Paper,Harvard University.
35. Reiss, P. C. and F. Wolak (2004), “Structural Econometric Modeling: Rationalesand Examples from Industrial Organization,” Working Paper, Stanford University.
36. Seim, K. (2005), “An Empirical Model of Firm Entry with Endogenous Product-Type Choices ,” Forthcoming, Rand Journal of Economics.
37. — (2001), “Spatial Differentiation and Market Structure: The Video Retail Indus-try,” Yale University Dissertation.
38. Sudhir, K. (2001), “Structural Analysis of Manufacturer Pricing in the Presence ofa Strategic Retailer,” Marketing Science, 20 (3), 244-64.
39. Tamer, E. (2003). ”Incomplete simultaneous discrete response: Model with multi-ple equilibria.” Review of Economic Studies 70(1), 147-165.
40. Toivanen, O. and M. Waterson, “Market structure and entry: where’s the beef?,”RAND Journal of Economics,Forthcoming.
41. Villas-Boas, J. M. and Y. Zhao (2005). ”Retailer, manufacturers, and individ-ual consumers: Modeling the supply side in the ketchup marketplace.” Journal ofMarketing Research 42(1): 83-95.
42. Watson, R. (2004), “Product Variety and Competition in the Retail Market forEyeglasses,” Working Paper, University of Texas at Austin.
43. Zhu, T., V. Singh and M. Manusazk (2005), “Market Structure and Competitionin the Retail Discount Industry,” Working Paper, Carnegie Mellon University.
32
Table 1: Number of Stores in US
Store Names Number of Stores
Kmart Discount Store 1425Super Kmart Center 59
Target Discount Store 1133SuperTarget Center 127
Wal Mart Store 1490Wal Mart Supercenter 1442
33
Table 2: Number of Stores Across States
State Kmart Target Wal-Mart Total State Kmart Target Walmart Total
AK 5 0 9 14 MT 10 7 11 28AL 29 10 87 126 NC 47 32 101 180AR 6 4 86 96 ND 7 4 8 19AZ 23 37 43 103 NE 8 11 22 41CA 130 186 142 458 NH 9 5 26 40CO 23 31 53 107 NJ 43 29 35 107CT 9 11 32 52 NM 15 8 26 49DE 6 2 8 16 NV 11 14 20 45FL 113 79 194 386 NY 62 37 78 177GA 40 40 105 185 OH 90 44 105 239HI 7 0 6 13 OK 10 10 94 114IA 25 22 53 100 OR 14 16 28 58ID 8 5 16 29 PA 106 31 104 241IL 70 64 116 250 RI 1 2 8 11IN 45 37 87 169 SC 29 15 60 104KS 13 14 55 82 SD 10 4 10 24KY 31 12 79 122 TN 39 23 96 158LA 12 10 82 104 TX 20 110 310 440MA 20 20 43 83 UT 18 9 27 54MD 24 24 39 87 VA 46 31 77 154ME 6 1 22 29 VT 3 0 4 7MI 93 52 68 213 WA 21 29 35 85
MN 33 66 57 156 WI 39 29 71 139MO 29 26 114 169 WV 17 3 29 49MS 9 2 66 77 WY 9 2 10 21
All 1493 1260 3057 5810∗
*The number of stores is larger than table 1 because some neighborhood stores areincluded in table 2.
34
Table 3: Market Configurations Defined by County
CountyMarket Structure N Population Income
(0,0,0) 1356 15,029 31,260(W,0,0) 913 36,507 33,630(0,K,0) 70 93,493 38,582(0,0,T) 9 341,596 46,940(W,K,0) 339 78,374 36,657(W,0,T) 106 249,937 45,213(0,K,T) 17 395,953 44,314(W,K,T) 410 395,574 43,836
35
Tab
le4:
Mar
ket
Char
acte
rist
ics
Var
iable
sD
escr
iption
All
Mar
kets
Non
-MSA
MSA
Mea
nStd
Dev
Mea
nStd
Dev
Mea
nStd
Dev
Mark
et-
Level
Are
aM
arke
tLan
dA
rea
ofa
Mar
ket
(squar
em
iles
)69
610
6095
711
7533
273
4P
OP
Mar
ket
Tot
alpop
ula
tion
ofa
Mar
ket
31,7
9327
,234
22,7
6319
,214
44,4
0531
,455
Wag
eA
vera
geR
etai
lW
age
19,1
264,
034
17,2
703,
695
21,7
552,
864
Num
ofR
etai
lers
Num
ber
ofR
etai
lE
stab
lish
men
ts39
546
913
014
077
451
3N
um
ofFood
Num
ber
ofFood
Ret
aile
r50
6117
1896
70N
um
ofLoca
tion
sN
um
ber
ofTra
cts
7.37
7.70
5.73
4.11
9.33
6.06
Loca
tion-L
evel
PO
P<
2Pop
ula
tion
wit
hin
2miles
ofa
loca
tion
5,99
15,
826
4,15
83,
637
8,55
17,
186
PO
P2
10Pop
ula
tion
bet
wee
n2
to10
miles
loca
tion
9,73
719
,361
5,78
38,
919
20,8
3621
,107
%N
oveh
icle
%H
ouse
hol
dw
ithou
tVeh
icle
0.07
0.03
0.07
0.03
0.06
0.03
%C
hild
%pop
ula
tion
wit
hC
hild
bel
ow18
0.23
0.03
0.23
0.03
0.24
0.03
HH
size
Ave
rage
HH
size
2.59
0.23
2.56
0.20
2.63
0.25
Inco
me
HH
Med
ian
Inco
me
38,0
8111
,622
32,4
406,
264
45,9
6012
,744
Old
rati
o%
pop
ula
tion
abov
e65
0.14
0.04
0.16
0.04
0.12
0.04
Educa
tion
%Pop
wit
hB
achel
orD
egre
e0.
120.
050.
100.
040.
150.
05H
Uva
lue
Med
ian
valu
eof
Hou
sing
Unit
129,
497
103,
123
77,4
0243
,128
143,
590.
109,
888
Fir
m-L
evel
Dis
tW
alH
QD
ista
nce
toW
al-M
art
Hea
dquar
ter
715
363
637
344
823
362
Dis
tK
mar
tH
QD
ista
nce
toK
mar
tH
eadquar
ter
745
475
804
454
664
492
Dis
tTar
etH
QD
ista
nce
toTar
get
Hea
dquar
ter
819
351
760
345
902
342
Dis
tW
alD
CD
ista
nce
toW
al-M
art
Dis
tC
tr10
982
125
9485
53D
ist
Km
art
DC
Dis
tance
toK
mar
tD
ist
Ctr
174
117
202
120
133
100
Dis
tTar
etD
CD
ista
nce
toTar
get
Dis
tC
tr15
499
172
108
129
77
36
Tab
le5:
Est
imat
ion
Res
ult
sfo
rM
SA
Mar
kets
Wal
-Mar
tSto
reW
al-M
art
Sup
Km
art
Tar
get
Par
amet
erStd
Err
Par
amet
erStd
Err
Par
amet
erStd
Err
Par
amet
erStd
Err
Inte
rcep
t-3
.727
10.
0440
-3.4
585
0.05
15-3
.940
20.
0706
-5.7
757
0.11
73P
OP
<2m
ile
0.83
510.
0298
1.09
680.
0262
1.17
950.
0257
1.29
010.
0217
PO
P2-
10m
ile
0.48
050.
0143
0.61
180.
0113
0.67
790.
0163
0.63
180.
0126
PO
P>
10m
ile
0.11
460.
0305
0.20
160.
0303
0.04
820.
0282
0.03
130.
0292
PO
PD
ensi
ty-0
.460
60.
0640
-0.2
519
0.06
230.
0629
0.03
03-0
.324
40.
0878
%C
hild
0.22
380.
0324
0.33
990.
0423
0.00
220.
0300
0.11
350.
0314
%N
oveh
icle
-0.0
699
0.04
53-0
.072
70.
0301
-0.1
399
0.04
33-0
.070
60.
0317
Inco
me
-0.0
466
0.06
26-0
.460
70.
0586
0.05
040.
0581
0.48
240.
0427
Educa
tion
-0.0
701
0.05
250.
1892
0.04
89-0
.774
50.
0456
0.29
290.
0759
HU
valu
e-0
.108
00.
0588
0.15
880.
0489
0.05
940.
0698
0.06
750.
0548
Ret
ailW
age
-0.4
923
0.05
22-0
.630
50.
0759
-0.4
038
0.08
36-0
.627
00.
0847
Num
ofR
etai
lers
-0.3
063
0.03
18-0
.183
60.
0287
0.13
010.
0400
0.03
290.
0277
Dis
tH
Q-0
.073
00.
0469
-0.4
659
0.04
52-0
.510
20.
0716
-0.5
285
0.03
60D
ist
DC
-0.1
766
0.09
58-0
.336
80.
0491
0.23
420.
0832
-0.1
554
0.04
27W
al-M
art
Sto
re<
2mile
-3.9
623
0.33
50-1
.496
30.
1848
Wal
-Mar
tSto
re2-
10m
ile
-2.6
429
0.44
72-0
.962
60.
4939
Wal
-Mar
tSto
re>
10m
ile
-2.4
647
1.78
36-0
.257
62.
3490
Wal
-Mar
tsu
p<
2mile
-4.7
761
0.36
46-3
.262
10.
3390
Wal
-Mar
tsu
p2-
10m
ile
-4.5
454
0.39
02-2
.487
40.
3516
Wal
-Mar
tsu
p>
10m
ile
-2.8
713
0.79
90-2
.435
00.
2693
Km
art<
2mile
-1.3
194
0.21
87-2
.513
50.
3820
-3.5
263
0.35
07K
mar
t2-
10m
ile
-0.6
351
0.32
45-0
.843
80.
1363
-0.9
987
0.10
87K
mar
t>
10m
ile
-0.5
706
0.41
38-0
.701
10.
1487
-0.9
432
2.24
39Tar
get<
2mile
-1.9
796
0.56
45-3
.999
80.
8493
-1.9
799
0.15
77Tar
get
2-10
mile
-1.4
787
0.52
63-1
.603
50.
2654
-1.8
209
0.29
31Tar
get>
10m
ile
-0.8
690
0.74
68-1
.567
90.
9425
-1.6
350
1.54
22
σ1.
2626
0.01
84
LL
-72
90.5
6
37
Tab
le6:
Est
imat
ion
Res
ult
sfo
rN
on-M
SA
Mar
kets
Wal
-Mar
tSto
reW
al-M
art
Sup
Km
art
Tar
get
Par
amet
erStd
Err
Par
amet
erStd
Err
Par
amet
erStd
Err
Par
amet
erStd
Err
Inte
rcep
t-3
.297
70.
2169
-3.4
794
0.16
11-3
.592
80.
2495
-6.4
059
0.43
48P
OP
<2m
ile
2.14
010.
1202
1.02
890.
1284
1.67
240.
1833
2.10
040.
1759
PO
P2-
10m
ile
0.89
250.
0811
0.59
540.
0856
0.93
750.
0864
0.80
840.
1729
PO
P>
10m
ile
0.04
850.
0967
0.20
690.
1021
0.04
210.
1107
0.07
360.
2234
PO
PD
ensi
ty-0
.348
50.
0937
-0.2
903
0.08
95-0
.172
50.
0998
-0.8
512
0.25
91%
Child
0.14
690.
0998
0.25
520.
1004
0.43
730.
1178
0.36
620.
2588
%N
oveh
icle
-0.2
216
0.10
70-0
.103
60.
1149
-0.2
755
0.15
150.
0413
0.48
17In
com
e0.
3044
0.23
21-0
.308
70.
2291
-0.5
667
0.27
060.
8065
0.54
57E
duca
tion
0.35
390.
1422
-0.0
277
0.15
460.
5190
0.15
230.
6773
0.28
23H
Uva
lue
-0.0
576
0.10
010.
1332
0.05
950.
2231
0.07
24-0
.024
30.
1907
Ret
ailW
age
0.62
320.
1362
-0.6
428
0.14
910.
4638
0.15
350.
2634
0.36
13N
um
ofR
etai
lers
0.19
950.
2729
-0.2
398
0.22
051.
7493
0.30
292.
4026
0.75
22D
ist
HQ
-0.4
644
0.09
49-0
.468
20.
1036
-0.6
043
0.10
61-0
.919
90.
2128
Dis
tD
C-0
.361
00.
1033
-0.3
535
0.11
56-0
.030
60.
0668
-0.1
745
0.22
73W
al-M
art
Sto
re<
2mile
-3.3
957
1.06
25-0
.802
41.
7254
Wal
-Mar
tSto
re2-
10m
ile
-2.9
987
0.95
24-0
.545
41.
9186
Wal
-Mar
tSto
re>
10m
ile
-1.9
669
1.21
11-0
.058
33.
8555
Wal
-Mar
tsu
p<
2mile
-4.5
955
0.99
96-3
.629
12.
2309
Wal
-Mar
tsu
p2-
10m
ile
-3.3
090
0.91
54-2
.765
00.
9655
Wal
-Mar
tsu
p>
10m
ile
-2.9
500
1.51
34-2
.566
32.
5404
Km
art<
2mile
-1.7
034
0.50
70-2
.508
10.
2337
-1.4
193
2.66
51K
mar
t2-
10m
ile
-1.3
578
0.76
39-1
.666
80.
6362
-1.3
402
1.25
59K
mar
t>
10m
ile
-0.4
597
1.67
63-1
.210
11.
7519
-1.0
194
4.09
83Tar
get<
2mile
-3.5
105
1.02
83-2
.902
00.
7482
-2.2
507
1.03
00Tar
get
2-10
mile
-2.6
986
1.16
95-1
.724
41.
0328
-1.8
001
0.67
65Tar
get>
10m
ile
-0.9
601
1.75
60-1
.641
31.
6655
-0.7
464
1.52
87
σ1.
1052
0.13
45
LL
-524
2.78
38
Table 7: Parameter Estimates without Firm Asymmetry
Estimates from Separate MarketsMSA Non-MSA
Parameter Std Err Parameter Std Err
Intercept -4.0350 0.0281 -4.5098 0.1554POP<2mile 0.8470 0.0005 1.9706 0.0383POP2-10mile 0.3934 0.0246 0.8453 0.0465POP>10mile 0.0175 0.0339 0.0668 0.0402POP Density -0.2196 0.0396 -0.4390 0.0534% Child -0.0069 0.0454 0.3462 0.0746% Novehicle -0.2002 0.0341 -0.3261 0.0891Income 0.1104 0.0429 0.2915 0.0426Education -0.0451 0.0608 0.1988 0.1042HUvalue -0.3561 0.0534 0.1389 0.1086Retail Wage -0.3056 0.0686 -0.8286 0.1826Num of Retailers -0.0536 0.0534 1.6674 0.2280Dist HQ -0.5396 0.0341 -0.5453 0.0383Dist DC -0.0555 0.0386 -0.1186 0.0416Competition<2mile -1.3118 0.0919 -1.9736 0.0655Competition2-10mile -0.8500 0.0965 -1.3723 0.0655Competition>10mile -0.5828 0.2376 -1.3522 0.1360
LL -5743.88 -3786.79
39
Wal Mart Store
Wal Mart Supercenter
Wal Mart SupercenterBig KmartBig Kmart
P u l a s k iP u l a s k i L a u r e lL a u r e l
K n o xK n o x
J a c k s o nJ a c k s o n
C l a yC l a y
L i n c o l nL i n c o l n
W h i t l e yW h i t l e yW a y n eW a y n e
R o c k c a s t l eR o c k c a s t l e
M c C r e a r yM c C r e a r y
C a s e yC a s e y
O w s l e yO w s l e y
L e eL e eG a r r a r dG a r r a r d
B e l lB e l l
M a d i s o nM a d i s o n
Figure 1: Different Market Structures
40
−5 0 5 100
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Pro
b
p1A
p2A
p1B
p2B
x
1a. Location Choice without Competition
−5 0 5 100.5
0.55
0.6
0.65
0.7
0.75
0.8
0.85
0.9
0.95
1
Pro
b
p1A+p
2A
p1B+p
2B
x
1b. Entry Probability without Competition
−5 0 5 100
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Pro
b
p1A
p2A
p1B
p2B
x
2a. Location Choice with Symmetric Competition
−5 0 5 10
0.4
0.5
0.6
0.7
0.8
0.9
1
Pro
b
p1A+p
2A
p1B+p
2B
x
2b. Entry Probability with Symmetric Competition
−5 0 5 100
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Pro
b
p1A
p2A
p1B
p2B
x
3a. Location Choice with Asymmetric Competition (Strong Distant Competitor)
−5 0 5 100.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Pro
b
p1A+p
2A
p1B+p
2B
x
3b. Entry Probability with Asymmetric Competition (Strong Distant Competitor)
−5 0 5 100
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Pro
b
p1A
p2A
p1B
p2B
x
4a. Location Choice with Asymmetric Competition (Strong Close Competitor)
−5 0 5 100.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Pro
b
p1A+p
2A
p1B+p
2B
x
4b. Entry Probability with Asymmetric Competition (Strong Close Competitor)
Figure 2: Competition Effects
41
0 1 2 3 4 5 6 7 8−3
−2
−1
0
1
2
3
4
5
Population of Location 2 (in 10,000)
Inco
me
of L
ocat
ion
2 (S
td D
ev a
bove
mea
n)
No Entry
Monopoly
Duopoly
Oligopoly
1a. Market Structure without Competition
0 1 2 3 4 5 6 7 8−3
−2
−1
0
1
2
3
4
5
Population of Location 2 (in 10,000)
Inco
me
of L
ocat
ion
2 (S
td D
ev a
bove
mea
n)
No Entry
MonopolyDuopoly
Oligopoly
Duopoly
Oligopoly
1b. Market Structure with Competition
0 1 2 3 4 5 6 7 8−3
−2
−1
0
1
2
3
4
5
Population of Location 2 (in 10,000)
Inco
me
of L
ocat
ion
2 (S
td D
ev a
bove
mea
n)
Location 2
Location 1
2a. Wal-Mart Location Choice without Competition
0 1 2 3 4 5 6 7 8−3
−2
−1
0
1
2
3
4
5
Population of Location 2 (in 10,000)
Inco
me
of L
ocat
ion
2(S
td D
ev a
bove
mea
n)
Location 2
Location 1
2b. Wal-Mart Location Choice with Competition
0 1 2 3 4 5 6 7 8−3
−2
−1
0
1
2
3
4
5
Population of Location 2 (in 10,000)
Inco
me
of L
ocat
ion
2 (S
td D
ev a
bove
mea
n)
Location 2
Location 1
3a. Kmart Location Choice without Competition
0 1 2 3 4 5 6 7 8−3
−2
−1
0
1
2
3
4
5
Population of Location 2 (in 10,000)
Inco
me
of L
ocat
ion
2 (S
td D
ev a
bove
mea
n)
Location 2
3b. Kmart Location Choice with Competition
0 1 2 3 4 5 6 7 8−3
−2
−1
0
1
2
3
4
5
Population of Location 2 (in 10,000)
Inco
me
of L
ocat
ion
2 (S
td D
ev a
bove
mea
n)
Location 2
Location 1
4a. Target Location Choice without Competition
0 1 2 3 4 5 6 7 8−3
−2
−1
0
1
2
3
4
5
Population of Location 2 (in 10,000)
Inco
me
of L
ocat
ion
2 (S
td D
ev a
bove
mea
n)
Location 2
4b. Target Location Choice with Competition
Figure 3: Market Structure Predictions
42