Will a Fat Tax Work? Romana Khan , Kanishka Misra , Vishal ...faculty.london.edu/kmisra/assets/documents/Fat_Tax.pdf · 1 Will a Fat Tax Work? Romana Khan1, Kanishka Misra2, Vishal

1

Will a Fat Tax Work?

Romana Khan1, Kanishka Misra

2, Vishal Singh

34

Abstract:

Of the many proposals to counter the obesity epidemic, the most contentious is the use of

the so-called “fat tax” to discourage consumption of unhealthy products. Using milk sales

data from a quasi-experimental field setting, we conduct a large scale empirical

investigation of the potential for price incentives to alter consumption behavior. We find

that even small price differences are effective in inducing consumers to switch to low fat

milk, particularly amongst lower income groups who are most vulnerable to obesity. Our

results provide empirical evidence and guidelines on how price incentives, via taxes or

subsidies, can shift consumer's choices and serve as an effective mechanism against

obesity.

1 McCombs School of Business, University of Texas at Austin, Austin, TX. 2 London Business School, London, England. Misra thanks the Center for Marketing, at the London

Business School for providing research support for this project. 3 Stern School of Business, New York University, New York, NY. 4 To whom correspondence should be addressed: [email protected]

2

When next at the grocery store, linger a moment longer in front of the milk cooler to

compare the prices for milk across fat content. You may be surprised to learn that

relative prices vary depending on which store you happen to patronize. At some stores,

prices are uniform across all fat content, whilst in others, they are non-uniform -

generally decreasing with fat content. The key issue here is whether and how these price

differences impact people's choices. To put it simply, do people switch to lower fat milk

when it is the cheaper option? This question is of interest because milk is among the top

three leading sources of saturated fat in the American diet (1). More importantly, it

relates to the larger hotly debated issue of whether price incentives (via a tax or subsidy)

can influence people's choices and serve as an effective mechanism to curb obesity.

Obesity in the US has reached epidemic proportions, two-third of adults and one in three

children are overweight or obese (2) (3). Obesity has been linked with increased an risk

of conditions such as heart disease and diabetes (4), and is estimated to cause 112,000

deaths every year (5). Moreover, it imposes significant externalities through productivity

losses and healthcare costs. Recent estimates put obesity-related medical expenditures as

high as $168 billion per year, half of which are paid by taxpayers in the form of Medicare

and Medicaid (6).

Given the individual and societal costs associated with obesity, the issue has received

attention from healthcare professionals, social scientists, and public officials.

Recommendations range from modification of food labels to educational programs

promoting healthier lifestyles (4) (7) (8). Among these interventions, the most

contentious is the use of the so-called “fat tax” to discourage consumption of unhealthy

products (9) (10). Proponents of the measure point to successes achieved in combating

tobacco use, and the potential to use tax revenues to offset other obesity-related costs.

Ideological opposition has come on the grounds of personal responsibility and the role of

government (11), as well as the potentially regressive nature of the policy (12).

Policymakers and researchers are also skeptical about the effectiveness of taxes in

generating additional revenue and/or changing consumption behavior (13) (14).

A major obstacle to evaluating the potential efficacy of a tax policy is the lack of

sufficient evidence on how it may impact consumption behavior. Although lessons from

tobacco taxes can be helpful, the food industry is significantly more diverse and not

3

limited to a subset of the population. There have been two general approaches to provide

guidelines on the likely impact of a fat tax. The first involves manipulating prices in a

controlled experimental setting to create incentives to switch to healthier options. Results

from both lab (15) and field experiments (16) (17) show that relative price reductions on

healthier options are highly effective in shifting demand toward them. The second

approach involves using price elasticity estimates for a class of products (e.g. sugared

beverages) to simulate changes in demand under hypothetical taxes (18) (19)5. Given that

at the category level most food items tend to be relatively price inelastic (20), the general

conclusion from this approach is that low taxes (such as those in place on carbonated

drinks in several states) will not alter behavior (21). Not surprisingly, recent studies

linking state level soda taxes to health outcomes have found limited evidence of any

association (22) (23) (24). To understand the apparent discrepancy in findings between

the two approaches, two aspects of current tax policies on soft drinks and snacks merit

mention. First, they are levied on the entire product class rather than on specific items,

giving consumers limited incentives to substitute within the category (e.g. regular to diet

soda). Second, taxes are usually in the form of sales taxes, rather than reflected in shelf

prices. Evidence suggests that a majority of consumers do not take sales taxes into

account when making purchase decisions (25).

This article provides a large-scale empirical field study on the impact of price differences

across fat content on consumer demand for milk. We use scanner data from 2001 to 2006

on UPC-level sales and prices from a nationwide sample of 1,567 grocery stores,

provided by IRI (26). The demographic profiles of each store's customer base show

significant heterogeneity in characteristics such as age, income, household size, race and

employment status (SOM text, table S1).

The analysis is motivated by extensive variation across stores in market shares and

pricing across fat content (SOM text, figure S4, table S3). As noted above, retail prices of

milk in the US are either uniform or (weakly) decreasing with fat content. The price

structure at a particular retailer is determined by chain policy at the state level (SOM text,

table S2), rather than by local demand conditions or demographics. This exogenous

5 Note that conclusions from research with secondary field data may be highly dependent on the type of

econometric model used by the researcher.

4

variation in pricing structure provides a natural quasi-experiment where we can observe

differences in purchase patterns directly from the data as opposed to simulating the

impact via an econometric model. The most prevalent structure, in approximately one-

third of stores, is strictly uniform - where prices across all fat content are equal. The

remaining stores span an array of non-uniform structures which share two key features:

whole milk is the most or one of the most expensive types and prices decrease with fat

content. As a preliminary analysis, figure 1 plots average prices and market shares under

uniform and non-uniform prices. When prices are uniform, whole milk has the highest

market share at 36.4%. Under non-uniform prices, where 2% milk is on average 14 cents

cheaper than whole milk, the market share of whole milk falls to 29.7%. The majority of

movement in market share is to 2% milk, which has the highest market share under non-

uniform prices.

While aggregate differences in market share indicate a high level of responsiveness to

price, they do not control for factors such as demographics. Table 1 shows the results

from a regression of whole milk market share on the average price of whole milk and its

substitutes: 2%, 1% and skim milk, and demographic characteristics surrounding each

store (see table S4 for full results). The dependent variable is the logit transformation of

the average market share of whole milk in each store in each year (SOM text). As

expected, the price coefficients indicate that an increase in the price of whole milk

decreases the share of whole milk, while increases in the prices of its lower fat substitutes

result in higher shares for whole milk. Evaluated at the mean, the price elasticity for

whole milk is -2.73, indicating elastic demand. This is higher in magnitude compared to

category-level elasticity measures reported in the literature for milk (14) and food

products in general (20). Note however that we are estimating within category elasticity,

and choice elasticity (i.e. substitution of products within category) tends to be high for

food products (27). This is important because the conclusion that a fat tax will have

limited impact on shifting consumer demand is driven by low price elasticity estimates at

the category level. While overall category elasticity may be low, our results indicate that

within-category demand is highly elastic, and price incentives can be used to shift

demand toward healthier options within a category. The closest substitute for whole milk

5

is 2% milk, with a cross-price elasticity of 1.86, while 1% and skim milk are weaker

substitutes, with cross price elasticities of 0.74 and 0.70, respectively6.

While the price elasticity indicates that demand for whole milk is highly responsive to

price, there may be nonlinearities in this relationship. To address this issue, we estimate a

regression of whole milk market share on the price ratio of whole milk to 2% milk (since

the majority of movement in market share is between whole and 2%). To account for

potential nonlinearities, we discretize the price ratio with a sequence of dummy variables

defined relative to the base of uniform prices: whole milk prices higher than 2% by 1-5%,

5-10%, 10-15%, 15-20%, and greater than 20% (table S5). Figure 2 plots the regression-

based market shares of whole milk at different levels of the price premium. Under

uniform prices, the market share of whole milk is 35%. A 1-5% premium of whole milk

over its 2% substitute reduces whole milk share by 5%. When the premium increases to

5-10%, whole milk share falls an additional 6%. Further increases lead to additional

reductions in whole milk share, but the marginal impact is lower. The key finding here is

that influencing choice through price mechanisms can be achieved with relatively small

price differences, with the majority of shifts in demand achieved with premiums of just 5-

10%.

Our results also reveal differences in demand across demographic groups (SOM text).

Given the higher prevalence of obesity among lower income groups (28), it is important

to assess whether the price incentive is equally effective across income levels. Figure 3

shows the regression-based (table S6) market shares for the lowest and highest income

quartiles at different levels of the whole milk premium. Under uniform prices, the

discrepancy between income groups is large - whole milk share for lower income exceeds

the higher income group by 17%. As the whole milk premium increases, the share for

both income groups falls, but the response is stronger for lower income, driven by higher

price sensitivity for this group. At a premium of 5-10%, the market share for low income

falls from 43% to 29%, while for high income it falls from 26% to 18%. The discrepancy

between income groups continues to fall as the premium increases, and disappears with a

premium of 15-20%. These results provide strong evidence that policies based on price

6 Our results are also robust when we instrument for price (table S4) to alleviate concerns about price

endogeneity.

6

incentives can be particularly useful in shifting the purchases of lower income

consumers, who are most vulnerable to obesity.

In summary, we find strong field evidence that price incentives can serve as an effective

mechanism to shift people's choices toward healthier options. Our results suggest that

relatively small price differences (5-10% in the case of milk) can induce substantial shifts

in demand, particularly amongst lower income consumers. These results provide

important guidelines to the hotly debated issue of fat taxes, such as those in consideration

for sugared beverages in several states and cities. The beverage industry has spent

millions of dollars on lobbying and advertising against the proposed taxes (29). Our

results suggest that such taxes, if reflected in shelf prices and strategically implemented

to encourage substitution to healthier options within a category, can be quite effective in

altering behavior. Our findings also lend credence to the potential effectiveness of

Walmart's recent announcement to join Michelle Obama's anti-obesity program, by

making healthy choices more affordable and eliminating the price premium for 'better-

for-you' products (items containing less sodium, sugar, and fats) (30). Based on our

findings, the effects of this policy might be more pronounced if instead of simply

eliminating the price premium on 'better-for-you' products, it were shifted to the 'worse-

for-you' products.

7

Table 1: Response of whole milk market share to price. OLS estimates with standard

errors.

Parameter Std. Error

Intercept -1.695 0.047

Whole milk price -1.352 0.046 -2.73 0.093

2% milk price 0.959 0.085 1.86 0.166

1% price 0.382 0.060 0.74 0.117

Skim price 0.372 0.052 0.70 0.098

Demographic controls: Income, Age, Race, and Population Density

Adjusted r-square 0.534

Number of observations 6835

Dependent variable: ln(Whole milk share/1-Whole milk share)

Elasticity of Whole

Milk wrt price of:

8

Figure 1: Market share and price by milk type, under uniform and non-uniform price

structures.

36.4%

29.7%

16.2%17.7%

29.6%

36.3%

15.5%

18.6%

$2.91 $2.91 $2.91 $2.90

$2.87

$2.73$2.71

$2.60

$2.00

$2.10

$2.20

$2.30

$2.40

$2.50

$2.60

$2.70

$2.80

$2.90

$3.00

10%

15%

20%

25%

30%

35%

40%

45%

50%

Whole milk 2% milk 1% milk Skim milk

Pri

ce p

er

ga

llo

n

Ave

rag

e M

ark

et

Sha

re

Uniform Share NonUniform ShareUniform Price Non-Uniform Price

9

Figure 2: Whole milk market share, by level of whole milk price premium over 2% milk.

Estimates are based on regression results in table S5. Vertical bars show 95% confidence

intervals.

35%

30%

24%

23%

19%

18%

14%

19%

24%

29%

34%

0-1% (Uniform) 1-5% 5-10% 10-15% 15-20% >20%

Wh

ole

m

ilk

mar

ket

sh

are

Percentage price premium of whole milk over 2% milk

10

Figure 3: Whole milk market share by income group, by level of whole milk price

premium over 2% milk. Estimates are based on regression results in table S6. Vertical

bars show 95% confidence intervals.

43%

37%

29%

27%

21%

17%

26%

23%

18%

20%21%

19%

10%

15%

20%

25%

30%

35%

40%

45%

0-1% (Uniform) 1-5% 5-10% 10-15% 15-20% >20%

Wh

ole

mil

k m

ark

et s

har

e

Percentage price premium of whole milk over 2% milk

Low Income

High Income

11

References

1. U.S.Department of Agriculture and U.S. Department of Health and Human Affairs,

Dietary Guidelines for Americans, 7th Edition. (2010).

2. C. Ogden, M.D. Carroll, Prevalence for Overweighy, Obesity and Extreme Obesity

Among Adults: United States, Trends 1976-1980 through 2007-2008. National Center for

Health Statistics, 2010.

3. —. Prevalence of Obesity Among Children and Adolescents: United States, Trends

1963-1965 through 2007-2008, National Center for Health Statistics, 2010.

4.U.S. Department of Health and Human Services, The Surgeon General's Vision for a

Healthy and Fit Nation. Office of the Surgeon General (2010).

5. K. Flegal et al., Excess deaths associated with underweight, overweight, and obesity.

JAMA 293 (2005).

6. E. Finkelstein et al., Annual medical mpending attributable to obesity: payer-and

service-specific estimates. Hlth. Aff. 28 (2009).

7. J. Hill et al., Obesity and the environment: where do we go from here? Science 299,

5608 (2003).

8. E. Battle, K. Brownell, Confronting a rising tide of eating disorders and obesity:

treatment vs. prevention and policy. Addict. Behav. 21 (1996).

9. K. Brownell, Get slim with higher taxes. New York Times, December 15, 1994, p. A29.

10. Waist banned: Does a tax on junk food make sense? The Economist, July 30, 2009.

11. R. Epstein, Let the shoemaker stick to his last: a defense of the 'old' public health.

Persp. Biol. Med. 46 (2003).

12. L. McGranahan, D. Schanzenbach, Who would be affected by soda taxes? Essays on

Issues, Federal Reserve Bank of Chicago 284 (March 2011).

13. L. Powell, F. Chaloupka, Food prices and obesity: evidence and policy implications

for taxes and subsidies. Milbank Q. 87, 1 (2009).

14. H. Chouinard et al., Fat taxes: big money for small change. Forum Health Econ.

Policy 10, 2 (2007).

15. L. Epstein et al., The influence of taxes and subsidies on energy purchased in an

experimental purchasing study. Psychol. Sci. 21 (2010).

16. S. French, Pricing effects on food choices. J. Nutr. 133 (2003).

12

17. S. French et al., Pricing and promotion effects on low-fat vending snack purchases:

the CHIPS study. Am. J. Public Health 91 (2001).

18. K. Brownell et al., The public health and economic benefits of taxing sugar-

sweetened beverages. New Engl. J. Med. 361 (2009).

19. E. Finkelstein et al., Impact of targeted beverage taxes on higher- and lower-income

households. Arch. Intern. Med. 170, 22 (2010).

20. T. Andreyeva, M. Long, K. Brownell, The impact of food prices on consumption: a

systematic review of research on the price elasticity of demand for food. Am. J. Public

Health 100, 2 (2010).

21. F. Kuchler et al., Taxing snack foods: what to expect for diet and tax revenues.

Current Issues in Economics of Food Markets, USDA ERS, No. 747-08 (2004).

22. L. Powell, J. Chriqui, F. Chaloupka, Associations between state-level soda taxes and

adolescent body mass index. J. Adolescent Health 45, 3 (2009).

23. R. Sturm et al., Soda taxes, soft drink consumption, and children's body mass index.

Health Affairs 29, 5 (2010).

24. J. Fletcher, D. Frisvold, N. Tefft, The effects of soft drink taxes on child adolescent

consumption and weight outcomes. J. Public Econ. 94 (2010).

25. R. Chetty, A. Looney, K. Kroft, Salience and taxation: theory and evidence. Am.

Econ. Rev. 99, 4 (2009).

26. B. Bronnenberg, M. Kruger, C. Mela, The IRI marketing data set. Mktg. Sci. 27, 4

(2008).

27. T. Bijmolt, H. vanHeerde, R. Pieters, New empirical generalizations on the

determinants of price elasticity. J. Mktg Res. 42, 2 (2005).

28. C. Ogden et al., Obesity and Socioeconomic Status in Adults: United States, 1988-

1994 and 2005-2008. National Center for Health Statistics (2010).

29. V. Bauerlain, B. McKay, Soda Tax Uncaps a Fight, Wall St. Journal, May 23, 2010.

30. S. Stolberg, Walmart Shifts Strategy to Promote Healthy Foods, New York Times,

January 20, 2011.

31. L. Hao, D. Q. Naiman, "Quantile Regression", SAGE publications (2007).

32. R. Koenker, "Quantile Regression", Cambridge University Press (2005).

13

Supporting Online Material for

Will a Fat Tax Work?

Romana Khan, Kanishka Misra, Vishal Singh

Correspondence to: [email protected]

This file includes:

Materials and Methods

SOM Text

Figs. S1 to S5

Tables S1 to S6

mailto:[email protected]

14

1. Materials and Methods

1.1 Store Level Data

Our analysis uses store level sales data provided by IRI (26). Store sales, price and

promotion information is recorded weekly at the UPC level. The data covers a period of

six years, from 2001 to 2006. There are a total of 1,567 reporting stores from 101 chains,

operating in 47 markets across 39 states. There are 416 counties represented in the data;

the population of these 416 counties accounts for 47% of the total US population.

Demographic Information

The customer base of each store is profiled with an extensive set of demographic

variables. Summary statistics are reported in Table S1. The variables fall under four

categories: (1) Age: percent of households with children under age 17, percent of

population under age 5, percent of population over age 55, percent of households with

more than 5 members (large households); (2) Income: per capita income, median income,

percent of population below the poverty level, percent of population unemployed, percent

of population employed in blue collar jobs; (3) Urban/Suburban: population density; (4)

Ethnicity: percent of population that is white. The summary statistics demonstrate the

broad spectrum of demographic profiles served by the stores reporting in the database.

This is a key factor as it allows us to also relate observed variation in demand across

stores to the underlying demographic characteristics of customers served.

Competition Information

We collected additional data to characterize local competition and cost factors. This

includes the median hourly wage, the total number of grocery retailers located within 5

miles of each store, and the number of discount grocery retailers (e.g. Wal-Mart

Supercenter) located within 10 miles of each store. We use a larger radius to measure

competition from discount stores since they typically have a larger trading area than

regular supermarkets. The competitive data is used to understand the underlying factors

impacting the retailer's pricing decision.

15

1.2 The Milk Category: Products used in the analysis

The category of plain milk contains a large number of UPCs representing various brands,

sizes, packaging, and fat content. The volume-based market shares are:

by fat content: whole - 31.5%, 2% - 34.4%, 1% - 15.7%; skim - 18.3%;

by brand: private label - 80.5%, national brands - 19.5%;

by size: 128oz - 79.3%, 64oz - 20.7%;

by packaging: plastic jug - 94.3%, carton - 5.7%.

Our analysis uses the store level sales and price data of private label plain milk in the 128

oz plastic jug at the four major fat content levels (whole, 2 %, 1% , and skim). These four

products represent 67% of the total volume share of plain milk. The ubiquity and high

market penetration of private label plain milk facilitates comparison across a large

number of stores and demographic profiles. We do not include organic, lactose free, and

other variants as they are a small share of the market.

An important feature of the data is that the relative prices of milk across fat content vary

extensively across stores. In approximately one-third of stores, prices are uniform across

all fat. Prices at the remaining stores span an array of non-uniform structures which share

two key features: (1) Whole milk is either the most or one of the most expensive types

(this fails to hold in only 3.2% of observations) and (2) Prices are generally decreasing

with fat content. Incidentally, decreasing prices with fat content reflect the actual

underlying costs of milk production - since butterfat is the more expensive component,

the cost of milk increases with its fat content. 7

Before analyzing the impact of these

7 This relates to the process by which milk is produced. The fat content of unprocessed

raw milk varies with factors such as the breed of cow, the weather, and the cow's feed.

After raw milk from different sources is combined, the milk fat is separated from the milk

liquid. The components are then reconstituted to produce the standardized milk found at

grocery stores. Whole milk has 3.25% fat, the fat content of 2% and 1% milk are revealed

in their names, and skim milk has between 0 and 0.5% fat. The cost of whole milk is

highest because the value of butterfat exceeds the value of the liquid component.

Although not reported, we have access to monthly city-specific Cooperative wholesale

prices for milk that are obtained from the USDA's Agricultural Marketing Services

(AMS). These data show that the wholesale prices for milk are strictly declining with fat

content.

16

pricing structures on market shares, we first conduct analyses to understand what drives

the retailer's choice between uniform and non-uniform prices.

2. SOM Text

2.1 Factors Accounting for Variation in Price Structure Across Stores

Given the differences in relative prices across retailers, a question to address is what

drives the retailer's decision to offer uniform versus non-uniform prices. The observed

choices could potentially be driven by factors such as underlying demand characteristics,

competition, actions of downstream processors, state regulations, and chain policy. Note

that it will be problematic to directly compare outcomes under uniform and non-uniform

pricing structures if this decision is based on the underlying demand characteristics at the

store level. For example, if stores tend to choose uniform prices where demand for whole

milk is already high, then a comparison between uniform and non-uniform stores will

yield biased inferences. It will attribute differences in demand to the price structure, when

it is the price structure that is determined by underlying demand. If however the

observed price structure is driven by factors exogenous to local conditions, an analysis of

the impact of the prices on demand for whole milk is valid.

We investigate this issue with a series of analyses reported in Table S2. Column (1)

shows the results of a regression of the price ratio of whole to 2% milk on an extensive

set of explanatory variables. The first set of factors are demographics - median income,

percent of households with kids under 17, population density, and percent of population

that is white - which serve as measures of local demand characteristics. We also include

measures of the competitive environment that may impact the pricing decision - median

hourly wage, number of retailers within 5 miles, and number of discount retailers within

10 miles of the store. The impact of state regulations, milk cooperatives or processors,

and chain level policy are captured with state and chain fixed effects. The demographic

and competitive variables are standardized. While the effects of median income and

percent households with kids are statistically significant, they are negligible in the size of

their impact. For example, a price ratio increase by 1%, from 1.00 to 1.01, is associated

with a median income decrease by three standard deviations, or $47,268. Column (2)

reports the results of a variance decomposition to understand the explanatory power of

17

the included variables. Chain fixed effects account for 80% of the explained variation,

with state fixed effects accounting for 19%. The analysis suggests that the included

demographic and competitive measures have a limited role in the pricing decision. In

column (3) we report the results of a logit regression where the dependent variable is a

dummy for when prices are uniform. For the included demographics, the coefficients of

median income and percent of population that is white are significant. However, the

marginal effects for a one standard deviation increase in each are relatively small at 3.5%

and 4.3%. The marginal effects of wage and number of retailers within 5 miles are

smaller, -1.86% and 0.6% respectively. The analysis indicates that chain and state level

factors are the key determinants of the observed pricing policy.

Given the analysis above, we conclude that the choice of uniform and non-uniform

pricing structure is primarily driven by chain policy rather than a store level reaction to

local demand conditions. Our empirical findings were corroborated by interviews with

the dairy managers of 3 retail chains (one nationwide chain, one regional chain, one local

chain). They referred to the practice of pricing similar products uniformly as 'line

pricing', and indicated that the practice is a means of simplifying the price decision when

the products in question are very similar.

2.2 Impact of Price Structure on Whole Milk Market Share

Summary Analysis

As noted above, in approximately one-third of stores in the data we observe uniform

prices for all fat content. Table S3 reports the summary statistics for market shares and

prices at each fat content level under uniform and non-uniform price structures. It is

evident that under uniform pricing the market share of whole milk is significantly higher,

and that under non-uniform pricing the majority of substitution from whole milk is to 2%

milk.

Before discussing the regression analyses, we consider a descriptive 'model free' measure

of the impact of price structure based on store matching. In particular, we match (or pair)

every store that charges uniform prices to the (geographically) closest store that charges

non-uniform prices. The key variable of interest is the difference in whole milk share

18

( ) between the store that charges Uniform prices and the store that charges non-

Uniform prices (

).

We consider two store matching criteria. First, we match stores based on their whole milk

prices; therefore every uniform pricing store is matched with the geographically closest

non-uniform pricing store that charges the same whole milk price. For example, a

uniform pricing store A that charges $3 for both whole milk and 2% milk is matched with

the geographically closest non-uniform pricing store that charges $3 for whole milk and a

price less that $3 (say $2.80) for 2% milk. By construction, the measure (

) provides a model free estimate for the impact of a price “subsidy” for 2%

milk. Second, we repeat this matching based on 2% milk prices. For example, here a

uniform pricing store A that charges $3 for both whole and 2% milk is matched with the

closest non-uniform pricing store that charges $3 for 2% milk and a price greater that $3

(say $3.20) for whole milk. Here, the measure (

) provides a

model free estimate for the impact of a “tax” on whole milk. The results of these two

matching methods are shown in figure S1. The dark bars represent the results from

matching based on whole milk prices, an estimate of the impact of a 'fat subsidy' for 2%

milk. The light bars show the results from the matching based on 2% milk prices, an

estimate of the impact of a 'fat tax' for whole milk. The graphs show that both price

incentives, either a subsidy for 2% milk or a tax on whole milk, result in a lower market

share for whole milk. This reduction is increasing with the price difference in non-

uniform pricing stores.

We extend this matching by adding a criterion that the matched stores must serve the

same income group. Therefore a uniform pricing store serving a low (high) income

community that charges $3 for full fat milk and 2% milk is matched with the closest non-

uniform pricing store also serving a low (high) income community that charges $3 for

full fat milk and a price less that $3 (say $2.80) for 2% milk. The results are shown in

figure S2. Here we find that low income consumers respond more to the price incentives

than high income consumers. Finally, we repeat the store matching with limiting the

allowed distance between stores. Therefore we only consider store matches (pairs) where

the distance between the uniform pricing store and the non-uniform pricing store is less

than some threshold. Figure S3 shows that our results are robust to limiting the distance

19

between stores. These summary statistics provide a preview of the results from the main

regression model discussed below.

Regression Results

Dependent Variable: Logit transformation of whole milk market share

Our focal outcome variable is the market share of whole milk in each store-year. The

average market share of whole milk across all years and stores is 31.5%. Figure S4 plots

the distribution of whole milk market share, and demonstrates the variation in share

across stores. An overall goal is to understand the impact of price in explaining this

variation in market share. In our regressions analysis below, we use the logit

transformation of whole milk share ( ) as the dependent variable:

. This

rescales the market share from the (0,1) interval to the real line, and facilitates the use of

linear regression as the main analysis method.

Demand Model

In table S4 column (1), we report the full results of the model in Table 1 of the main text,

with a complete listing of the demographic controls. The dependent variable is the logit

transformation of whole milk market share. The key explanatory variables are the prices

of whole, 2%, 1% and skim milk. Discussions of the price parameters as well as the

resulting own and cross price elasticity estimates are provided in the main text. The

demographic controls include Median Income, Percent of population under age 5, Percent

of population over age 55, Population Density, and Percent of Population that is White.

The demographics enter as standardized variables and account for differences in

preference for whole milk. We find that the market share of whole milk is lower in areas

with higher income and higher percentage of white population. In terms of age, whole

milk consumption is higher where there are a higher proportion of children under age 5,

and in areas with a higher proportion of people over 55.

Robustness Check: Results Using Instruments for Price

A potential concern often raised in the economics literature is the endogeneity of prices.

This refers to the situation where price levels may be correlated with factors that are

20

unobserved by the analyst, but are known to the retailer in setting the price. If this were

the case, it would bias the price coefficients in our demand model. For example, a local

positive demand shock may cause the retailer to raise prices, which would lead us to

under-estimate the true price sensitivity. To address this concern, we use the farm cost of

milk and chain fixed effects as instruments for price. In Table S4 column(2), we report

the results from estimating the model using a two-stage least squares approach, with

instruments for price. We find that instrumenting for prices increases the own and cross-

price sensitivity parameters. In relative terms, 2% milk continues to be the strongest

substitute for whole milk. There is minimal change in the control demographic parameter

estimates.

Robustness Check: Quantile Regression

There are a few potential concerns with using least squares estimates (LS) from the

analysis presented in Table 1 of the main text. First, least squares estimates can give

additional weight to outliers and these can skew the estimates. Second, LS only provides

estimates in the central location rather than the entire distribution. In other words, a least

squares model assumes a homogeneous response to all independent variables while this

response can be heterogeneous. While our main regression results estimate the average

impact of prices, we cannot claim the effects hold true for the entire distribution. Third,

LS makes parametric assumptions about the error term and the errors could be non-

normal. An alternative to least squares regressions is quantile regression that can account

for each of these (31, 32). The main difference between these two approaches is that LS

estimates parameters at the conditional (conditional on independent variables) mean

whereas quantile regression estimates the parameters at different conditional quantiles

(e.g. median) of the whole milk share distribution. To show our estimates are robust to

these potential problems with a LS model, in figure S5 we plot the estimated coefficients

on whole, 2%, 1% and skim milk prices for all quantiles of the conditional distribution of

whole milk share. There are 4 important takeaways: (1) estimates for the whole milk

price (price elasticity) are negative and significant for the entire distribution, (2) estimates

for 2% milk price are positive (cross price elasticity) and significant for the entire

distribution (3) estimates for 1% milk and skim milk are not significantly different from

21

zero for some parts of the distribution, again suggesting that 2% milk is a closer

substitute to whole milk than 1% or skim milk, (4) the OLS estimates are a good

approximation for the price sensitivity estimates.

Capturing Non-linear Effects

In the analysis above, we evaluated the impact of whole, 2%, 1% and skim milk prices on

the market share of whole milk. However, the response to price differences across the

milk types could be non-linear. In particular, the response to a price premium (or

discount) could vary with the level of discount in a non-linear fashion.

To address this, we estimate a regression of the market share of whole milk on the “price

ratio” of whole to 2% milk. The ratio measures the price premium of whole milk over 2%

milk. We use this ratio since the majority of movement in response to changes in relative

price is between whole and 2% milk (as displayed in the summary analysis and cross

price elasticity). To identify potential non-linearities in response, we create a sequence of

dummy variables to discretize the price ratio. Five dummy variables are defined to

indicate when the price ratio is between: 1.01 and 1.05; 1.05 and 1.10; 1.10 and 1.15;

1.15 and 1.20; and greater than 1.20. The base is where the ratio is between .99 and 1.01,

representing the situation where prices of whole and 2% milk are equal. The model

results are shown in table S5. Figure 2 in the main text plots the model-based market

shares of whole milk as the price ratio between whole milk and 2% milk increases. The

plot shows that as the premium of whole milk increases, the market share of whole milk

falls. The response is non-linear, with a decreasing marginal impact of increases in the

price premium.

In addition to demographic controls, we include demographic interactions with the price

ratio to identify whether response to the price differential between whole and 2% milk

varies with demographic characteristics. In terms of income, as before, consumption of

whole milk is lower in higher income areas. However, the positive coefficient on the

interaction between price ratio and per capita income indicates that as the price ratio

increases, the reduction in market share is higher in lower income areas. This suggests

that lower income groups respond more when the premium of whole milk increases. This

finding motivates the analysis in the following section.

22

Non-Linear Effects by Income Group

Next we investigate how response to the price premium of whole over 2% milk varies

across income levels. To do this, we use interactions between two sets of dummy

variables. The first set is the previously discussed dummy variables based on the price

ratio of whole to 2% milk. The second set of dummy variables indicates the top, middle-

two, and the bottom quartiles of per capita income. The analysis uses interactions

between these two sets of variables to capture how response to the price premium varies

across income groups.

Table S6 presents the results of a regression of whole milk market share on the income

dummies, and the interactions between the income and price ratio dummy variables. Note

that this regression does not have an intercept. The coefficients on the dummies for low,

middle and high income quartiles capture whole milk market share under uniform prices.

The coefficients on the interaction dummies capture how market share for each income

level responds as the price ratio between whole and 2% milk increases. Figure 3 in the

main text plots the regression-based market shares for different levels of income and

price ratio. Under uniform prices, the discrepancy between income groups is large -

whole milk share for lower income exceeds the higher income group by 17%. As the

whole milk premium increases, the share for both income groups falls, but the response is

stronger for lower income, driven by higher price sensitivity for this group. At a premium

of 5-10%, the market share for low income falls from 43% to 29%, while for high income

it falls from 26% to 18%. The discrepancy between income groups continues to fall as the

premium increases, and disappears with a premium of 15-20%. These results provide

strong evidence that policies based on price incentives can be particularly useful in

shifting the purchases of lower income consumers, who are most vulnerable to obesity.

23

Figure S1. The difference whole milk market shares between uniform and non-uniform

stores, by whether there is a subsidy for 2% milk versus a tax on whole milk. Matching-

based estimates are shown for different levels of the tax/subsidy. The dark bars represent

the results from matching stores based on whole milk prices, to estimate the impact of a

'fat subsidy' for 2% milk. The light bars represent the results from matching stores based on 2% milk prices, to estimate the impact of a 'fat tax' on whole milk.

-14%

-12%

-10%

-8%

-6%

-4%

-2%

0%

1%-5% 5%-10% 10%+

Wh

ole

milk

sh

are

in U

NIF

OR

M s

tore

MIN

US

wh

ole

m

ilk s

har

e in

NO

N U

NIF

OR

M s

tore

Difference between whole and 2% milk price in the NON UNIFORM store

UNIFORM and NON UNIFORM have same Whole milk price

UNIFORM and NON UNIFORM have same 2% milk price

24

Figure S2. The difference in whole milk market share between uniform and non-uniform

stores, by income level. Matching-based estimates are shown for stores that serve the

same income group.

-25%

-20%

-15%

-10%

-5%

0%

Low income High income

Wh

ole

milk

sh

are

in U

NIF

OR

M s

tore

MIN

US

wh

ole

m

ilk s

har

e in

NO

N U

NIF

OR

M s

tore

25

Figure S3. The difference in whole milk market share between Uniform and Non-

Uniform stores, by maximum distance between stores. Matching-based estimates are

shown for stores that are no more than 0, 10, 100, and 150 miles apart.

-9%

-8%

-7%

-6%

-5%

-4%

-3%

-2%

-1%

0%

0 (n = 22) 10 (n = 191) 100 (n = 848) 150 (n = 1141)

Wh

ole

milk

sh

are

in U

NIF

OR

M s

tore

MIN

US

wh

ole

milk

sh

are

in N

ON

UN

IFO

RM

sto

re

Distance between (ZIP code centers of) UNIFORM and NON UNIFORM STORES

26

Figure S4. Distribution of whole milk market share across stores.

27

Fig. S5

Response of whole milk market share to price. Using Quantile Regression

Dependent variable is ln(Whole milk market share/1-Whole milk market share)

The black line represents the quantile regression estimates, the grey bars are the 95% confidence

intervals. The solid red line is the OLS estimate and the dotted lines are the 95% confidence

intervals.

28

Table S1.Summary statistics on demographics and competitive environment of stores

Variable Mean Std

Dev

Minimum Maximum

Age:

% households with kids <17 34.8 8.6 1.2 66.4

% population < age 5 6.6 1.3 1.2 12.1

% population > age 55 24.5 5.7 9.4 83.4

%households with >5 members 24.1 8.0 1.1 60.2

Income:

Per capita income 23639 8294 8476 69054

Median income 48140 15756 13971 153259

Poverty rate 10.1 6.4 1.0 45.1

Unemployment rate 5.2 2.8 0.6 30.4

% Blue collar 33.6 10.0 7.8 67.2

% White 71.3 21.4 1.2 99.1

Population density 3406 3938 20 54713

Competitive factors:

All retailers within 5 miles 7.0 5.3 1.0 79.0

Discount retailers within 10

miles

3.6 2.2 0.0 17.0

Hourly wage 18.4 3.6 8.4 29.8

Number of Observations

(stores)

1567

29

Table S2. Factors accounting for the variation in the price ratio of whole to 2% milk.

(1)OLS estimates with standard errors (2) Results of variance decomposition, showing

percentage of explained variance accounted for by each factor. (3) Logit regression.

Dependent Variable: Uniform Dummy (=1)

(2) (3)

Estimate

% of explained variation

accounted for

Intercept 1.10955 6.2322

(0.0186) (21.036)

Median Income -0.00355* 0.52% 0.1583*

(0.0008) (0.061)

% HH Kids 0.00195* 0.22% -0.0854

(0.0007) (0.048)

Pop Density -0.00011 0.00% 0.139

(0.0009) (0.080)

% White -0.00066 0.01% 0.1985*

(0.0009) (0.077)

Wage 0.00128 0.29% -0.0842*

(0.0004) (0.029)

All retailers within 5 miles -0.00013 0.02% 0.0285*

(0.0002) (0.014)

Discount retailers within 10 miles 0.00022 0.01% -0.0375

(0.0003) (0.027)

State Fixed Effects Included 18.69% Included

Chain Fixed Effects Included 80.24% Included

R2 0.541

Number Observations 6835 6835

(1)

Ratio : Price Whole/Price 2%

30

Table S3. Summary statistics of market shares and prices by uniform/non-uniform price structure.

Mean Std Dev Mean Std Dev

Market Share:

Whole 36% 16% 30% 15%

2% 30% 9% 36% 9%

1% 16% 9% 16% 7%

Skim 18% 9% 19% 10%

Price:

Whole $2.91 $0.47 $2.87 $0.40

2% $2.91 $0.47 $2.73 $0.40

1% $2.91 $0.47 $2.71 $0.40

Skim $2.90 $0.47 $2.60 $0.41

No. of Observations 1982 4853

Uniform Non-Uniform

31

Table S4. Response of whole milk market share to price. (1) Shows the Full results for

Table 1 text. (2) Shows the 2SLS parameter estimates using instruments for the prices.

Variables

Parameter Std. Error Parameter Std. Error

Intercept -1.695 0.047 -2.111 0.079

Whole milk price -1.352 0.046 -2.352 0.075

2% milk price 0.959 0.085 1.567 0.148

1% price 0.382 0.060 1.051 0.103

Skim price 0.372 0.052 0.291 0.074

Per capita income -0.249 0.008 -0.245 0.008

% Age<5 0.043 0.010 0.044 0.011

% Age>55 0.123 0.009 0.115 0.010

Population density 0.026 0.008 0.032 0.009

% White -0.412 0.009 -0.384 0.011

Adjusted r-square 0.534 0.572

Number of observations 6835 6835

(1) (2)


32

Table S5. Impact of Price Premium of Whole milk on Whole Milk Share



Intercept -0.611 0.010

Dummy for price whole milk > price 2% milk by:

1-5% -0.233 0.017

5-10% -0.559 0.018

10-15% -0.626 0.027

15-20% -0.808 0.040

20%+ -0.908 0.052

Per capita income -1.914 0.142

% Age<5 -0.089 0.187

%Age>55 0.788 0.174

Population density 0.765 0.235

% White 0.599 0.181

Demographic Interactions with Price Ratio

Per capita income 1.612 0.136

% Age<5 0.126 0.180

%Age>55 -0.630 0.168

Population density -0.722 0.231

% White -0.997 0.174



33

Table S6. Impact of Price Premium of Whole Milk on Whole Milk Share, by Income

Dependent variable: ln(Whole milk market share/1-Whole milk market share)


Low Income (Bottom Quartile PC Income) -0.266 0.020

Middle Income (Middle Quartiles PC Income) -0.617 0.015

High Income (Top Quartile PC Income) -1.044 0.021

Dummy for price whole milk > price 2% milk by: * Low income

1-5% -0.247 0.032

5-10% -0.629 0.034

10-15% -0.749 0.049

15-20% -1.081 0.076

20%+ -1.330 0.091

Dummy for price whole milk > price 2% milk by: * Middle Income

1-5% -0.242 0.025

5-10% -0.555 0.027

10-15% -0.703 0.040

15-20% -0.911 0.055

20%+ -0.975 0.074

Dummy for price whole milk > price 2% milk by: * High Income

1-5% -0.178 0.033

5-10% -0.453 0.037

10-15% -0.338 0.054

15-20% -0.252 0.084

20%+ -0.436 0.120

% Age<5 -0.057 0.189

%Age>55 0.661 0.175

Population density 0.177 0.234

% White 0.334 0.181

Demographic Interactions with Price Ratio

% Age<5 0.077 0.182

%Age>55 -0.531 0.169

Population density -0.175 0.230

% White -0.742 0.175



Documents

Will a Fat Tax Work? Romana Khan , Kanishka Misra , Vishal ...faculty.london.edu/kmisra/assets/documents/Fat_Tax.pdf · 1 Will a Fat Tax Work? Romana Khan1, Kanishka Misra2, Vishal