Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
How Reliably Do Empirical Tests Identify Tax Avoidance?
LISA DE SIMONE, Stanford Graduate School of Business
JORDAN NICKERSON, Boston College
JERI SEIDMAN, University of Virginia – McIntire School of Commerce
BRIDGET STOMBERG, Indiana University
AbstractResearch on the determinants of tax avoidance have relied on tests using GAAP and cash effective tax rates (ETRs) and total and permanent book-tax differences (BTDs). Two new proxies have emerged that overcome documented limitations of these proxies: one, developed by Henry and Sansing (2018), allows for more meaningful interpretation of results estimated in samples that include loss observations. The other, reserves for unrecognized tax benefits (UTB), provides new data on tax uncertainty. We offer empirical evidence on how well tests using these new proxies perform relative to those extensively used in prior research. The paper finds that tests using the proxy developed by Henry and Sansing (2018) have lower power relative to those using other proxies across all samples, including a sample that includes loss observations. In contrast, when firms accrue reserves for uncertain tax avoidance, tests using the current year addition to the UTB have the highest power across all proxies, samples, and levels of reserves. In the absence of reserves, tests using the GAAP ETR best detect uncertain tax avoidance, on average. This study contributes to the literature by using a controlled environment to provide the first large-scale empirical evidence on how the power of tests varies with the use of relatively new proxies, the inclusion of loss observations, and the advent of FIN 48.
Acknowledgements: We thank Linda Bamber, Mary Barth, Shuping Chen, Michelle Hanlon, Amy Hutton, Steve Kachelmeier, William Kinney Jr., David Larcker, Eduardo P. Lotor, Edward Maydew, John McInnis, Lillian Mills, Tom Omer (Editor), Barry Schaudt, Terry Shevlin, two anonymous reviewers, and workshop participants at the University of Texas for valuable comments and suggestions.
Contact author: Jeri Seidman. Physical address: McIntire School of Commerce, University of Virginia, P.O. Box 400173, Charlottesville, VA, 22904. Telephone: 434-294-8976. Fax: 434-924-7074. Email address: [email protected].
How Reliably Do Empirical Tests Identify Tax Avoidance?
Abstract
Research on the determinants of tax avoidance have relied on tests using GAAP and cash effective tax rates (ETRs) and total and permanent book-tax differences (BTDs). Two new proxies have emerged that overcome documented limitations of these proxies: one, developed by Henry and Sansing (2018), allows for more meaningful interpretation of results estimated in samples that include loss observations. The other, reserves for unrecognized tax benefits (UTB), provides new data on tax uncertainty. We offer empirical evidence on how well tests using these new proxies perform relative to those extensively used in prior research. The paper finds that tests using the proxy developed by Henry and Sansing (2018) have lower power relative to those using other proxies across all samples, including a sample that includes loss observations. In contrast, when firms accrue reserves for uncertain tax avoidance, tests using the current year addition to the UTB have the highest power across all proxies, samples, and levels of reserves. In the absence of reserves, tests using the GAAP ETR best detect uncertain tax avoidance, on average. This study contributes to the literature by using a controlled environment to provide the first large-scale empirical evidence on how the power of tests varies with the use of relatively new proxies, the inclusion of loss observations, and the advent of FIN 48.
JEL: H25, H26, M41Keywords: Uncertain Tax Avoidance, Loss Firms, Tax avoidance Simulated data
1. Introduction
We evaluate the relative power of empirical tests of the determinants of tax avoidance.1 A
vast literature exploring the determinants of tax avoidance exists (Wilde and Wilson 2018). Yet,
researchers and shareholders still must infer tax avoidance from financial statements because
firms typically do not disclose the details of their tax strategies. Researchers have thus relied on
GAAP and cash effective tax rates (ETRs) and total and permanent book-tax differences (BTDs)
as proxies for avoidance. However, limitations of these proxies have been documented, including
the confounding effects of financial reporting rules and earnings management, their inability to
capture the uncertainty associated with tax avoidance, and the difficulty of interpreting them for
loss firms (e.g., Blouin 2014; Donohoe, McGill and Outslay 2014; Drake, Hamilton and Lusch
2019; Edwards, Kubata and Shevlin 2018; Hanlon 2003; Henry and Sansing 2018; Schwab,
Stomberg and Xia 2019). Two new proxies have recently emerged that may address some of
these issues. In particular, one developed by Henry and Sansing (2018) better allows for
inclusion of loss observations in a sample. Similarly, reserves for unrecognized tax benefits
(UTB), accrued under FIN 48, provide new data on tax uncertainty. These proxies may enable
researchers to enhance their understanding of tax avoidance. We aim to offer empirical evidence
on how well tests using these new proxies perform relative to those extensively analyzed in prior
research (e.g., Hanlon and Heitzman 2010). We do this by using a controlled environment to
seed a known quantity of tax avoidance in Compustat data and evaluate the relative power of
statistical tests to detect the tax avoidance that we seed.
1 We follow Hanlon and Heitzman (2010, 137) and define tax avoidance as “all transactions that have any effect on the firm’s explicit tax liability.” We focus our analysis on empirical or statistical tests of the determinants of tax avoidance where the empirical model uses a tax avoidance proxy as the dependent variable and includes controls for known economic determinants of tax avoidance. Our view of empirical or statistical tests encompasses the model specification (i.e., independent and dependent variables), sample selection criteria, and skewness correction method (winsorizing, robust regression, etc.)
1
Our methodology resembles those of Dechow, Sloan, and Sweeney (1995) and Dechow,
Hutton, Kim, and Sloan (2012). Those authors seed a known quantity of earnings management
into a random subset of Compustat data and evaluate the relative performance of discretionary
accrual models to detect the earnings management that they seed. We use a similar approach and
evaluate the relative performance of statistical tests to detect the tax avoidance that we seed. The
known quantity of tax avoidance allows testing of how well determinants-of-tax-avoidance
models detect tax avoidance as well as exploration of opportunities to increase the power of
these tests. Using simulated data in a regression framework is necessary to evaluate the relative
power of empirical tests because it allows us to assess a number of different research design
choices (i.e., proxies, samples, etc.), while controlling for known economic determinants of tax
avoidance. Tax avoidance is not easily observable in publicly available data. For example,
because of differences between financial and tax reporting rules, observing an effective tax rate
(ETR) below the statutory tax rate does not necessarily indicate avoidance (e.g., Drake et al.
2019; Schwab et al. 2019).
We separately seed three types of tax avoidance: a permanent tax savings strategy (e.g., a
tax credit), a temporary tax savings strategy (e.g., accelerated tax depreciation), and a hybrid
strategy. Hybrid strategies are those for which the entire cash tax savings is not reflected in some
of the tax expense-based proxies because of financial reporting choices made by the firm. For
example, cross-jurisdictional income shifting is one such strategy because management asserts
that a portion of foreign earnings is permanently reinvested (APB 23). Permanent tax avoidance
strategies that are partially offset by UTBs are another example. We hold median cash taxes
saved as a percentage of pre-tax income constant across all strategies for comparability. For each
specification, we run 10,000 simulated regressions. To assess power—that is, the ability of a test
2
to detect tax avoidance when it exists—we calculate the percentage of simulated regressions that
detect the tax avoidance that we seed. A higher rate of correct detection implies greater power.
We first examine the power of tests, using the proxy developed by Henry and Sansing
(2018) as the dependent variable, relative to tests using ETRs and BTDs as proxies. Because this
new proxy aims to facilitate more meaningful interpretation of results estimated in a sample that
includes loss observations, we assess relative power using three distinct samples: (i) one that
includes all observations required to calculate variables; (ii) one that omits loss observations; and
(iii) another that omits both loss observations and negative values of tax expense or taxes paid.
Across all samples, tests using the Henry and Sansing (2018) proxy as the dependent variable are
not the most powerful. The power of these tests resembles that of tests using BTDs, with tests
using the Henry and Sansing (2018) proxy offering only modest power gains over total book-tax
differences. Further, although one of the purported advantages of the proxy developed by Henry
and Sansing (2018) is that it provides a meaningful calculation of tax avoidance even for
unprofitable firms, tests using it have lower power in a sample that includes loss observations,
relative to tests using other proxies in the same sample. Despite the difficulty interpreting
negative ETRs, tests using the cash ETR have significantly more power to detect all three tax
avoidance strategies in the sample that includes loss observations. Simply put, using the Henry
and Sansing (2018) proxy in a sample that includes loss observations does not significantly
improve power to detect tax avoidance. However, Henry and Sansing (2018) acknowledge that
not all tax avoidance studies should include loss observations because loss firms may have
different incentives to pursue tax avoidance.
The notion of uncertain tax avoidance has recently gained increasing attention from
stakeholders and researchers. We therefore also examine the relative power of tests using proxies
3
for uncertain tax avoidance as the dependent variable. The dependent variables we test are the
ending balance of the UTB and current year additions to the UTB, two commonly-used proxies
for uncertain tax avoidance (e.g., Drake, Goldman and Lusch 2016; Dyreng, Hanlon and
Maydew 2019; Gallemore, Gipper and Maydew 2019). As expected, tests using the UTB proxies
have low power to detect uncertain tax avoidance if firms do not accrue reserves for tax
uncertainty. Tests using the GAAP ETR have the most power, on average, across the three
samples to detect uncertain, permanent tax avoidance for firms not accruing reserves. However,
when firms do reserve for tax uncertainty, tests using current year additions to the UTB have the
highest power across all three samples. In particular, tests using current year additions to the
UTB are always more powerful than those using the ending balance of the UTB. Further, the
power of tests using current year additions to the UTB increases with higher levels of accrued
reserves for tax uncertainty and with the inclusion of loss observations. Researchers aiming to
identify uncertain tax avoidance may be able to improve the power of their tests by retaining loss
observations and using the current year addition to the reserve in lieu of the ending reserve
balance.
This study makes several contributions. First, it adds to the tax avoidance literature by
using a controlled environment to provide the first large-scale empirical evidence on the relative
power of tests using the proxy developed by Henry and Sansing (2018) and UTB proxies. The
paper also provides new insights into how various research design choices affect the power of
tests using GAAP and cash ETRs as well as total and permanent BTDs to detect tax avoidance.
Studies that validate tax avoidance proxies rely on aggressive tax avoidance that is either
disclosed to the tax authority (Lisowsky 2010; Lisowsky, Robinson and Schmidt 2013) or results
in legal action (Wilson 2009). We extend this literature by assessing the relative power of
4
statistical tests using newer proxies, and a range of avoidance strategies that extends beyond tax
shelters. Our results may assist researchers in understanding the strengths and weaknesses of
tests using these proxies and can highlight research design choices that can improve power to
detect tax avoidance. Additionally, the results have implications for interpreting tax avoidance
studies. For example, research that finds a significant result using only a subset of proxies may
be able to use these results to infer the type of tax avoidance in the sample.
Second, we illuminate how the inclusion of loss observations affects power, thus
informing the ongoing debate over this issue in the literature (e.g., Drake et al. 2018; Dyreng,
Hanlon, Maydew and Thornock 2017). This paper finds that the inclusion of loss observations
generally impairs the power of tests using most tax avoidance proxies—even the proxy
developed by Henry and Sansing (2018). The exception is that the power of tests using current
year additions to the UTB increases by roughly 10 percentage points when loss observations are
included in the sample. Thus, the cost of increased noise when including loss observations seems
to outweigh any benefits arising from increased sample size.
Third, the findings here speak to the impact of FIN 48 on tax avoidance research. For
firms unlikely to accrue reserves, tests using UTB proxies offer almost no power to detect
uncertain tax avoidance. This result is highly intuitive yet underscores the importance of
considering managers’ incentives and corporate governance when using UTB proxies to detect
uncertain avoidance. Depending on the percentage of these firms in the sample, inferences could
be affected. To mitigate these concerns, researchers often use multiple proxies in tandem to
strengthen inferences. Yet doing so could lead to mixed results because, in contrast to tests using
UTB proxies, the power of tests using the GAAP ETR and total and permanent BTDs to detect
tax avoidance monotonically decrease as accrued reserves for tax uncertainty increase.
5
Additionally, if firms increased tax reserves in response to FIN 48, as some studies suggest (e.g.,
Audit Analytics 2008; Blouin, Gleason, Mills and Sikes 2007, 2010; Dunbar, Kolbasovsky, and
Phillips 2007; Robinson, Stomberg, and Towery 2016), the accounting standard change could
impact the power of tests using the GAAP ETR and total and permanent BTDs. Researchers
examining a sample period spanning FIN 48 adoption could mitigate this concern by using a
proxy not impacted by changing reserve behavior (e.g., the cash ETR and the Henry and Sansing
(2018) proxy).
2. Related literature
In recent years, several studies exploring the determinants of corporate tax avoidance
have been published. Although their exact definitions of avoidance vary, they share a broad view
of transactions that affect the firm’s explicit tax liability (e.g., Hanlon and Heitzman 2010). Their
notion of tax avoidance includes strategies that generate permanent or temporary tax savings as
well as those that vary in terms of aggressiveness and uncertainty. Permanent strategies create
differences between book and taxable income that never reverse (e.g., tax-exempt municipal
bond interest). In contrast, temporary strategies only defer tax payments from one period to
another (e.g., accelerated tax depreciation). U.S. GAAP rules that govern accounting for income
taxes and managers’ financial reporting discretion can obscure whether tax savings are
permanent or temporary. For example, managers can defer recognition of incremental U.S. tax
on some foreign earnings, thereby recasting a portion of temporary tax savings as permanent.2
Because tax returns are confidential, stakeholders rely on financial statement proxies,
such as ETRs and BTDs, to detect tax avoidance. Studies have addressed the unique strengths
and weaknesses of these proxies (e.g., Guenther 2014; Hanlon 2003; Hanlon and Heitzman 2010;
2 Though the financial reporting incentives to assert indefinite reinvestment of foreign earnings have significantly decreased following recently tax law changes, they have not been eliminated.
6
Henry and Sansing 2018). Hanlon and Heitzman, in particular, (2010, 144) state: “we cannot
overemphasize that not all measures are equally appropriate for every research question …”
Studies also suggest when a particular proxy might be suitable to use in a statistical test of tax
avoidance. For example, Hanlon and Shevlin (2002) and Austin (2014) outline how accounting
for the tax benefits of stock options affects some tax avoidance proxies, such that they can either
overstate or understate firm tax avoidance. More recent studies note how profitability can
hamper ETRs as proxies for avoidance. Drake et al. (2018) note that failing to consider firms’
history of loses biases GAAP ETRs downward when valuation allowances are released. Schwab
et al. (2018) document a clustering of poorly performing firms in the tails of the GAAP ETR
distribution in a sample of firms with positive pre-tax income. The authors examine how not only
valuation allowance releases, but also valuation allowance accruals and nondeductible goodwill
impairments—all of which are common among firms with poor historical or current performance
—can distort GAAP ETRs as a proxy for tax avoidance. Edwards et al. (2018) demonstrate that
the cash ETR can change over time because of changes in pre-tax income, regardless of changes
in the actual level of tax planning or avoidance. The authors conclude that time-series studies on
changes in tax avoidance can over-attribute the decline in cash ETRs to tax planning. Finally,
Henry and Sansing (2018) note how little is known about the tax positions of unprofitable firms
because they are often eliminated from avoidance studies. The authors develop a proxy for
avoidance that facilitates the inclusion of loss firms in the sample and find, in contrast to other
studies, that the average firm is tax-disfavored over the last 20 years.
As taxes have become more salient to regulators, tax authorities, the public, and the
media, the uncertainty associated with tax avoidance has become of greater interest. Researchers
have struggled with how to measure it because, for example, firms generally did not publicly
7
disclose reserves for tax uncertainty prior to 2007. In 2006, the FASB issued ASC 740-10,
Accounting for Uncertainty in Income Taxes, to increase the comparability of accounting for tax
uncertainty. Since then, researchers have used both the level of and changes in these reserves to
explore the determinants and consequences of uncertain tax avoidance (Drake et al. 2016;
Dyreng et al. 2019; Gallemore et al. 2019).
We extend this literature by providing a large-scale empirical evaluation of the relative
power of statistical tests to detect tax avoidance. We focus on assessing the power of new
proxies for tax avoidance—the Henry and Sansing (2018) proxy as well as the level and current-
year change in reserves for unrecognized tax benefits. Even when it is clear that two proxies are
affected by a particular avoidance strategy (e.g., both the cash and GAAP ETRs detect
permanent tax avoidance), this study speaks to the relative power of tests using these proxies. Its
findings can therefore help address questions about whether and to what extent these new proxies
enhance the ability to detect cross-sectional differences in tax avoidance. Mirroring other studies,
the current study exploits samples that include and exclude loss observations and thus speaks
directly to how this important and much-debated research design choice affects the power of
tests to detect tax avoidance.
3. Research Design
Overview
This study examines the power of empirical tests using the proxy developed by Henry
and Sansing (2018) (hereafter HS), the ending balance of the UTB (UTB End), and current year
additions to the UTB (UTB Add) as dependent variables to capture cross-sectional differences in
tax avoidance, relative to tests using the GAAP ETR (GAAP ETR), the cash ETR (Cash ETR),
totals BTD (BTD), and permanent BTDs (Perm Diff) as the dependent variables. Throughout, we
8
use one-year measures for all proxies rather than long-run measures. Thus, all proxies are
calculated annually. HS is taxes paid less 35 percent of pre-tax income, all scaled by beginning
market value of assets.3 UTB End is the ending balance of the reserve for unrecognized tax
benefits scaled by total assets. UTB Add is current year additions to the reserve for unrecognized
tax benefits scaled by total assets. GAAP ETR is total tax expense scaled by pre-tax income.
Cash ETR is taxes paid scaled by pre-tax income. BTD is pre-tax income less estimated taxable
income, all scaled by total assets. We estimate taxable income by grossing up current federal and
current foreign tax expense by the highest statutory U.S. statutory rate during our sample period,
35 percent. Finally, Perm Diff adjusts total book-tax differences (BTD) by removing temporary
book-tax differences. We defined temporary book-tax differences total deferred tax expense
grossed up by 35 percent and scaled by total assets.
The power of a test is its ability to detect a difference in the tax avoidance proxy between
firms avoiding tax and all others, holding constant known economic determinants of avoidance.
Power is a function of multiple factors, including the underlying distribution of the tax avoidance
proxy in the sample and the degree to which control variables included in the model do not
explain variation in the proxy. By evaluating econometric models that include control variables,
a researcher can examine the relative power of regressions designed to account for these other
factors. Power is also affected by the magnitude of tax avoidance (i.e., the dollars of tax saved)
and the proportion of firms avoiding tax in the sample. We use these facts to validate our
methodology, as discussed below.
3 We calculate HS following Henry and Sansing (2018) and compute market value of assets as book value of assets plus market value of equity less stockholders’ equity. Market value of equity is price times shares outstanding, both from Compustat. We reset market value of equity and stockholders’ equity to zero if market value of equity is missing. We calculate BTD and Perm Diff following Frank, Lynch, and Rego (2009), except we scale by contemporaneous, rather than lagged, total assets. We do not reset missing ending balances of the reserve for unrecognized tax benefits or missing values of current year additions to the reserve to zero, following guidance from Lisowsky et al. (2013) and Robinson et al. (2016).
9
Simulated tax avoidance
Simulating tax avoidance strategies
The approach here relates closely to those of Dechow et al. (1995) and Dechow et al.
(2012). Their studies simulate earnings management in Compustat data to evaluate the relative
performance of models of discretionary accruals to detect the seeded earnings management.
Similarly, we use data simulations to study the relative power of tests to identify cross-sectional
differences in corporate tax avoidance. We seed three different strategies in Compustat data: a
permanent one, a temporary one, and a hybrid. We select these three because they differ
regarding whether tax benefits claimed in the current period will reverse in the future as well as
how they interact with financial reporting standards to affect the models we test. To ensure that
relative power is not driven by the magnitude of simulated tax avoidance, we hold cash taxes
saved as a percentage of pre-tax income roughly constant at the median across all three
strategies. After seeding tax avoidance, we winsorize the tax avoidance proxies to control for
outliers. We winsorize HS, BTD, and Perm Diff at 1 percent and 99 percent (e.g., Armstrong,
Blouin, and Larcker 2012; Cheng, Huang, Li, and Stanfield 2012; Higgins, Omer, and Phillips
2015; McGuire, Omer, and Wang 2012). We also winsorize UTB End and UTB Add at 1 percent
and 99 percent, consistent with how most papers address outliers with UTB proxies (e.g.,
Gallemore et al. 2019; Klassen, Lisowsky, and Mescall 2016; Law and Mills 2015; Law and
Mills 2017). We winsorize GAAP ETR and Cash ETR at zero and one (e.g., Armstrong et al.
2012; Badertscher, Katz, and Rego 2013; Brown and Drake 2014; Higgins et al. 2015).
We simulate a permanent tax avoidance strategy by assuming a percentage of selling,
general, and administrative expense is eligible for a credit, rather than a deduction. We expect
models using all proxies to capture this strategy. However, we have no expectation regarding
10
which tests will have the most power to detect the effects of permanent tax avoidance because
this is unknowable ex ante. For example, it is not possible to analytically determine whether tests
using GAAP ETR should better detect permanent tax avoidance than those using HS, UTB End,
UTB Add, Cash ETR, BTD, or Perm Diff because: (i) all proxies are affected by the permanent
strategy; and (ii) the power of a statistical test to detect seeded tax avoidance depends on the
distribution of the dependent and independent variables within the sample used.
We simulate a temporary tax avoidance strategy by assuming that a percentage of capital
expenditures is eligible for an accelerated deduction (e.g., bonus depreciation or cost
segregation). For simplicity, we assume 100 percent of the benefits claimed in year t reverse in
t+1.4 We expect tests using HS, Cash ETR, and BTD to have more power to detect the effects of
the temporary tax avoidance strategy than tests using GAAP ETR and Perm Diff, because GAAP
ETR and Perm Diff reflect only permanent tax avoidance. As such, we expect tests using GAAP
ETR and Perm Diff to detect the effects of temporary tax avoidance in approximately 5 percent
of simulations when we set α=0.05, because this is the rate at which these tests should reject the
null when there is no effect. For reasons outlined above, we can have no expectation about the
relative power of tests using HS, Cash ETR, or BTD to detect the effects of the temporary
strategy.
Finally, we simulate a hybrid strategy, the cash tax savings of which are not entirely
reflected in some tax expense-based proxies because of managers’ financial reporting choices.
We choose cross-jurisdictional income shifting as the hybrid strategy and implement it by
reclassifying a percentage of domestic pre-tax income to foreign pre-tax income taxed at 15
percent. We assume the firm intends to permanently reinvest half of the foreign income abroad
4 Assumptions about when tax benefits reverse affect our seeding because they change pre-seeded values in t+1 for observations we seed with tax avoidance in t. In untabulated tests, we alternatively assume benefits recorded in year t reverse ratably from t+1 to t+3 and find that power is not significantly affected.
11
such that incremental U.S. tax will never be due. Therefore, only half of the immediate cash tax
savings of cross-jurisdictional income shifting is reflected in tax expense; the other half is offset
by an increase in deferred tax expense because the firm records a deferred tax liability for the
foreign earnings it does plan to repatriate. We assume the deferred tax liability recorded in t
reverses in t+1. We seed the hybrid strategy only in observations with nonmissing and
nonnegative pre-tax domestic income to ensure that this strategy increases tax avoidance.
Although cross-jurisdictional income shifting has unique features, we expect the pattern of
results to generalize to other hybrid strategies. We expect tests using HS, GAAP ETR, Cash ETR,
BTD, and Perm Diff to detect the effects of the hybrid strategy but do not predict which test will
have the greatest power. In the Appendix, we provide specific journal entries for the three
strategies described above.
Simulating uncertain tax avoidance
To incorporate tax uncertainty into our analysis, we also seed the permanent tax
avoidance strategy in three samples of observations with nonmissing and nonzero ending
balances of reserves for unrecognized tax benefits accrued in accordance with ASC 740-10
Accounting for Uncertainty in Income Taxes. We focus on the permanent strategy because: (i)
reserves for temporary strategies do not affect total tax expense (Robinson et al. 2016); and,
therefore, would not affect many of the other proxies we test; and (ii) accruing reserves for the
hybrid strategy would require far more assumptions. We then consider the effects of accrued
reserves for tax uncertainty. We accrue either no reserve, a reserve equal to 50 percent of the
claimed tax savings, or a full reserve that offsets the claimed tax savings.5
5 Firms accrue and release reserves for UTBs each year, with releases offsetting the effects of accruals on UTB End; releases do not affect UTB Add. For parsimony, we do not separately simulate the effect of the timing of reserve releases. Our approach encompasses any scenario in which the net effect of the accruals and releases for the year is positive.
12
Seeding the uncertain, permanent strategy without reserves affects GAAP ETR, BTD, and
Perm Diff like a permanent strategy because the full amount of the tax savings are reflected in
these proxies. Seeding the permanent strategy with a full reserve does not change the value of
GAAP ETR, BTD, and Perm Diff because none of the claimed tax savings is reflected. As such,
we expect tests using GAAP ETR, BTD, and Perm Diff to detect fully reserved uncertain tax
avoidance in approximately five percent of simulations when we set α=0.05; this is the rate at
which these tests should reject the null when there is no effect. Finally, seeding the uncertain,
permanent strategy and accruing a nonzero but less-than-full reserve affects GAAP ETR, BTD,
and Perm Diff like a hybrid strategy because the full amount of claimed tax savings is not
reflected in these proxies. Thus, simulating uncertain tax avoidance with a less-than-full reserve
allows an alternative evaluation of the relative power of tests using these proxies to detect the
hybrid strategy. A key advantage of viewing uncertain tax avoidance with partial reserves as a
hybrid strategy is that it holds constant the underlying Compustat accounts (i.e., selling, general
and administrative expense) used to seed the tax avoidance and therefore removes a potential
source of variation when assessing the power of tests using these proxies to detect the hybrid
strategy. The power of tests using both UTB End and UTB Add to detect uncertain tax avoidance
should increase with the amount of reserves. Because neither UTB proxy is affected when
reserves are not recorded, we expect tests using UTB End and UTB Add to detect uncertain tax
avoidance in approximately five percent of simulations when we set α=0.05 and do not accrue
reserves. Finally, the power of tests using HS and Cash ETR to detect uncertain, permanent tax
avoidance is unaffected by the level of reserves. Thus, we expect the power of tests using these
proxies to remain constant, regardless of the level of the reserve we accrue.6 The Appendix
6 We do not expect the power of tests using Cash ETR and HS to approximate alpha. These proxies are affected by permanent tax avoidance, although they are affected similarly whether it is reserved.
13
provides specific journal entries for seeding the permanent strategy with a 50 percent reserve for
uncertain tax avoidance.
Benefits and limitations of using simulated data and evaluating econometric tests
The first part of our research design is the use of simulated data. Because firms need not
publicly describe their tax avoidance strategies and generally do not voluntarily disclose them, it
is not feasible to validate models of tax avoidance using actual tax avoidance data. Other studies
have attempted to validate proxies by correlating them with incidences of known or alleged tax
sheltering, but this method is limited to a subset of the most aggressive strategies. By using
simulated data, we control the nature and the amount of avoidance. This allows us to examine the
relative power of tests to detect the effects of a broader range of tax avoidance.
Intuitively, strategies that save few dollars will be more difficult to detect than those that
save many more. Likewise, when too few firms avoid taxes, the estimation error associated with
their average level of avoidance is too large to differentiate this group from other firms in the
sample. Furthermore, when tax avoidance is pervasive, there is not enough precision in the
control group. Thus, tax avoidance is more difficult to detect when either too many or too few
firms in the sample avoid tax. One cannot assess the power of a test simply by examining how
tax avoidance impacts a proxy. For example, even though HS more intuitively reflects tax
avoidance for loss observations than other proxies, it is unclear how well relatively powerful
tests using HS detect avoidance.
We rely on these intuitions about power to validate our methodology. Specifically, we
vary the median cash tax savings as a percentage of PI and compare the effects of these various
magnitudes of seeded tax avoidance on power. As expected, tests of tax avoidance have more
power when the magnitude of tax avoidance is larger. Similarly, we observe that power increases
with pervasiveness. Obtaining expected results helps validate our simulation methodology.
14
The second part of our research design is examining empirical tests designed to detect tax
avoidance. Thus, instead of using an analytical framework or using simulated tax avoidance data
to estimate univariate changes in tax avoidance proxies, we evaluate the relative power of
statistical tests employed by empirical researchers to detect tax avoidance. Focusing on statistical
tests provides three key benefits. First, we can assess power while holding constant known
economic determinants of tax avoidance. This feature matters because the power of a statistical
test depends on the distributions of the control variables and the tax avoidance proxy chosen as
the dependent variable as well as the correlation between the controls and the proxy. An
analytical framework could not accommodate the effects of these controls on the proxies in a
closed-form. Second, holding constant other research design choices, such as sample selection
and control variables, our approach allows us to compare power across tests that use different
proxies for tax avoidance as the dependent variable. Third, we can quantify the effect of different
sample selection criteria on the researcher’s ability to detect tax avoidance. For these reasons, it
is critical to evaluate the final econometric model, rather than simply calculate the sensitivity of
each proxy to a certain amount of tax avoidance.
Our method of simulating tax avoidance does have limitations. First, tax avoidance
already exists in Compustat data, and any pre-existing avoidance we do not control for is an
omitted variable. However, because we randomly seed additional tax avoidance, this omitted
variable is orthogonal to our coefficient of interest by definition and will not bias inferences
related to the relative power of the tests. This orthogonality also extends to control variables,
such that the incremental tax avoidance we seed does not disrupt the association between pre-
existing avoidance reflected in the proxies and the control variables we include in our
regressions. Second, we simulate each type of tax avoidance separately, thereby assuming firms
15
pursue only one additional strategy at a time. In reality, a firm’s specific incentives may induce
them to try multiple strategies at once or to prioritize one strategy over others. Although our
research design does not explicitly consider these incentives, the fact that we seed each strategy
randomly should not affect results because randomly seeded firms are as likely to have
incentives to pursue any one strategy as unseeded firms.
Third, the expected number of times a firm (or year) is seeded with tax avoidance is a
function of the number of times that firm (or year) is present in a given sample. For example, in
samples that require profitable observations, each firm will not be in the sample every year.
Thus, our seeding might not reflect the relative frequency of tax avoidance across firms or years
in reality. To explore whether our randomized seeding impacts the relative power of tests, we
conduct robustness tests in which we hold the unconditional likelihood of seeding constant but
intentionally seed tax avoidance so that seeded observations are clustered more in some years
relative to others or in more profitable observations. Although clustering seeded observations by
year or profitability does not impact inferences, we cannot rule out possible effects of other
forms of tax avoidance clustering. Finally, some strategies generate nontax costs (e.g., the
nondeductibility of interest expense associated with generating tax-exempt income) or implicit
taxes (e.g., expenditures become more costly when they are eligible for tax preferences, such as
credits or accelerated cost recovery), which could make them less attractive tax planning
opportunities. We attempt to mitigate this concern by separately simulating three strategies that
broadly reflect multiple tax planning opportunities.
Evaluation of econometric tests
We simulate tax avoidance in 15 percent of observations and seed median cash tax
savings of 0.25 percent of pre-tax income. We select this magnitude of cash tax savings based on
16
a review of papers examining the determinants of tax avoidance and believe it is an economically
plausible addition to the avoidance already embedded in Compustat data. We provide additional
detail in support of this magnitude in the Appendix.
To assess the relative power of statistical tests to identify simulated tax avoidance, we
estimate the rate at which we can reject the null when we estimate Equation (1).
TaxProxy = α + β1TaxSeeded + β2TaxSeedable + β3Size + β4Profitability + β5Leverage + β6CapEx + β7Intangibles + β8ForeignSales + β9R&D. (1)
TaxProxy is HS, UTB End, UTB Add, GAAP ETR, Cash ETR, BTD, or Perm Diff computed after
seeding additional tax avoidance in Compustat data. For each specification, we run 10,000
simulations and calculate the percentage of tests for which β1 is statistically significant at the five
percent level.
We set TaxSeeded to one if we seed tax avoidance in that observation and zero otherwise.
This represents the alternative hypothesis that the true value of β1 is not zero. A statistically
significant β1 means the test correctly detects a differential level of tax avoidance for the seeded
sample. A perfectly powerful test will reject the null hypothesis that β1 is zero 100 percent of the
time. We compute power as the percentage of the 10,000 simulations for which the null
hypothesis is rejected.
Given pre-existing tax avoidance in the data, Equation (1) also includes control variables
derived from the literature. We include controls for size (natural log of market capitalization),
profitability (pre-tax return on assets in models using ETR proxies and operating cash flows
scaled by assets in models using the remaining proxies), leverage, capital expenditures as a
percentage of total assets, intangible assets as a percentage of total assets, foreign sales from the
Compustat Segments database as a percentage of total sales, and research and development
expenditures as a percentage of total assets. We winsorize these continuous control variables at
17
1st and 99th percentiles in all specifications. Further, because not all firms report foreign income,
we include an indicator variable (TaxSeedable) in tests of the hybrid strategy. TaxSeedable
equals one if the firm-year has nonmissing, nonzero foreign pre-tax income and ensures we do
not seed the hybrid strategy of cross-jurisdictional income shifting in firms with no foreign
operations. We also interact each control variable with TaxSeedable to account for potential
differences in how the control variables affect the tax avoidance proxies between seedable and
nonseedable observations.
4. Sample composition
Sample selection
Table 1 describes our sample selection. We estimate regressions on Compustat data from
1993 through 2016. We select 1993 as the first year of the sample because it is the effective date
of ASC 740. We select 2016 as the last year of the sample to avoid the transitory financial
reporting effects of the 2017 tax law, which can substantially alter tax expense (Nichols et al.
2018; Seidman and Stomberg 2018). We exclude flow-through entities (e.g., partnerships and
trusts), entities not incorporated in the United States, financial firms (SIC 6000–6999), and
utilities (SIC 4900–4999). We also remove observations with zero or missing pre-tax income,
negative asset values, or negative market capitalization. Finally, we remove observations lacking
data required to calculate tax avoidance proxies and control variables. This sample is 92,754
firm-years (Full Sample). Because HS is intended to facilitate interpretation of results estimated
on a sample that includes loss observations, the Full Sample, which retains loss observations
(PI<0), is important to our analysis.
Our second sample retains only those observations from the Full Sample that also have
positive pre-tax Profit, which we define as both positive pre-tax income and positive pre-tax
18
operating cash flow resulting in a sample of 54,208 firm-years (Profit Sample). We test this
sample along with the Full Sample because it allows us to examine how the exclusion of loss
observations—a common sample selection criterion—affects power (e.g., Bauer 2015; Dyreng et
al. 2015; Hoi, Wu and Zhang 2013; Hoopes, Mescall and Pitman 2012). Our third sample
requires observations from the Full Sample to have both positive pre-tax Profit as well as
nonnegative total tax expense and taxes paid (Profit and Tax ≥ 0 Sample), resulting in a sample
of 49,491 firm-years. We include this third sample because studies often require both the
numerator and denominator of ETR proxies to be positive to facilitate interpretation (e.g., Hope,
Ma, and Thomas 2013; Jennings, Weaver, and Mayew 2012; Kubick, Lockhard, Mills, and
Robinson 2017; and Kubick, Lynch, Mayberry, and Omer 2015).
To examine the relative power of tests using UTB End and UTB Add as the dependent
variable, we construct three analogous samples that require nonmissing and positive values of the
UTB ending balance. The UTB sample of profitable and unprofitable observations (UTB > 0
Sample) contains 16,867 firm-years. The UTB sample of profitable observations (UTB > 0 and
Profit Sample) contains 11,914 firm-years. Finally, the UTB sample of profitable observations
with nonnegative tax expense or taxes paid (UTB > 0, Profit, and Tax ≥0 Sample) contains
10,733 firm-years.
Descriptive statistics
Table 2 presents descriptive statistics for the tax avoidance proxies across all samples.
Variables are winsorized but not yet seeded with tax avoidance. GAAP ETR and Cash ETR are
winsorized at zero and one; all other proxies are winsorized at 1st and 99 percentiles of their
distributions. Because we estimate 10,000 runs of each specification, we do not show descriptive
statistics for seeded data. However, because we randomly seed tax avoidance and then winsorize
19
variables, we expect the descriptive statistics in Table 2 to offer an unbiased representation of the
descriptive statistics for our average seeded sample.7
Recall that we compare tests using HS as the dependent variable to tests using ETR and
BTD proxies in the Full Sample of profitable and unprofitable firms, the Profit Sample, and the
Profit and Tax ≥ 0 Sample that ensures ETR proxies are positive. Panel A presents descriptive
statistics for all tax avoidance proxies other than the UTB proxies across the three samples. By
construction, HS is more positive as pre-tax income declines. Consistent with this, HS is positive
at the mean (0.0190) and median (0.0016) in the Full Sample, which includes unprofitable and
profitable firms, and is negative at the mean and median in both samples in Panel A that exclude
loss observations.
Panel B provides descriptive statistics for the UTB samples. Unlike other proxies, UTB
proxies are not significantly impacted by the inclusion of loss observations in the sample. Mean
and median values of UTB End and UTB Add are generally static across samples, suggesting that
even loss firms engage in uncertain tax avoidance. In the UTB > 0 Sample, the mean (median)
UTB End is 0.0144 (0.0067) and mean (median) UTB Add is 0.0015 (0.0005).
Turning to the ETR and BTD proxies in Panels A and B, the mean and median values of
the ETR proxies generally increase across samples as negative values of pre-tax income and tax
expense or taxes paid are eliminated. This pattern is consistent with negative values of the ETR
proxies (due either to negative pre-tax income or negative tax values) drawing down sample
averages. Similarly, we find positive mean and median values of the BTD proxies once we
eliminate loss observations from the sample. In Panel A, the mean (median) GAAP ETR is 23.7
7 We first seed additional tax avoidance in raw Compustat data and then winsorize. We do this to mimic archival tax research in which researchers observe data inclusive of tax avoidance and then winsorize. Results are not sensitive to this design choice: inferences are unchanged if we instead first winsorize and then seed.
20
(29.0) percent and mean (median) Cash ETR is 17.8 (7.7) percent in the Full Sample. The mean
(median) BTD is -0.2877 (-0.0003) and mean (median) Perm Diff is -0.2861 (0.0004).
The takeaway from Table 2 is that the distribution of the tax avoidance proxies changes
with sample selection, such that power will also vary across samples. However, because power is
affected by the distribution of both the dependent and independent variables in the sample, we
cannot infer from Table 2 which tests will be relatively more powerful.
5. Results for Tests Using HS
Main results
Figure 1 presents results for examining the power of tests using HS as the dependent
variable, relative to tests using the ETR and BTD proxies. We compare power across all samples
and tax avoidance strategies. We measure power using the percentage of simulations that
accurately identify, at statistically significant levels, observations where we have seeded tax
avoidance, holding known determinants of tax avoidance constant. In other words, power is the
percentage of the 10,000 simulations for which we reject the null hypothesis that β1, the
coefficient associated with TaxSeeded, is zero.
Across all samples and tax avoidance strategies, tests using HS as the dependent variable
are neither the most nor least powerful, compared to tests using GAAP ETR, Cash ETR, BTD,
and Perm Diff. 8 Tests using GAAP ETR or Cash ETR as the dependent variable have more power
to detect the effects of the permanent strategy across all samples relative to tests using HS (and
relative to tests using BTD and Perm Diff). Tests using Cash ETR have more power to detect the
8 When comparing power estimates across tests, we apply a rule of thumb that values at least two percentage points apart are significantly different. The variable we use to assess power has a Bernoulli distribution. The confidence interval for each power estimate is very small, because we estimate 10,000 regressions. In untabulated analysis, we test a sample of 20 power values from Table 3 that are within three percentage points of each other. We find the power estimates are significantly different in 100 percent of tests when they are at least two percentage points apart. However, the power estimates are not all significantly different when they are less than two percentage points apart. Thus, we believe using a rule of thumb that allows a reader to conclude power estimates are significantly different when the values are at least two percentage points different is appropriate.
21
temporary strategy across all samples than tests using HS.9 Test using HS also do not have the
most power to detect the hybrid strategy; tests using Cash ETR have more power to detect the
hybrid strategy across all samples and tests using BTD have more power in samples that require
Profit.10 Thus, in a sample that includes loss observations, tests using Cash ETR outperform tests
using HS in detecting the effects of all three tax avoidance strategies we seed, and tests using
GAAP ETR outperform tests using HS to detect the effects of permanent and hybrid tax
avoidance.11 We conclude from this analysis that tests using HS as the dependent variable are
generally not as powerful as tests using ETR proxies, even in samples that include loss
observations.
We next evaluate how the power of tests using HS varies across samples and tax
avoidance strategies. As we note above, tests using HS are not the most powerful for any strategy
in the sample that includes loss observations. Further, tests using HS experience a dramatic
increase in power to detect all strategies in samples that require pre-tax profit versus the Full
Sample. However, they remain some of the less powerful tests in these samples as well. In other
words, despite being intended to facilitate loss observations in the sample, the noise in a sample
that includes losses outweighs any power gains from including loss observations to increase the
sample size. Examining the power of tests using HS across strategies within each sample, we
find these tests have the greatest power to detect the temporary strategy in samples that exclude
9 As expected, tests using GAAP ETR or Perm Diff detect the temporary strategy at a rate very close to α (five percent), providing further validation for our simulation approach.10 Inferences with respect to seeding the hybrid strategy in Profit samples are unchanged if instead of using a Seedable indicator to identify observations with domestic pre-tax income greater than zero, we restrict the entire sample to include only those observations where domestic pre-tax income is greater than zero such that every observation in the sample could be seeded. 11 Because we randomly seed tax avoidance, there is no correlation between losses and TaxSeeded in the sample that includes loss observations. Therefore, the exclusion of a control for losses does not introduce a correlated omitted variable bias for TaxSeeded. Nonetheless, we confirm results are robust to: (i) winsorizing the ETR proxies at the 1st and 99th percentiles to hold constant the number of observations we winsorize across all proxies; (ii) including an indicator variable equal to one if the firm-year is not profitable and zero otherwise; and (iii) including an indicator variable equal to one if the firm-year has a tax net operating loss carryforward, and zero otherwise.
22
loss observations. Tests using HS have similar power to detect all three strategies in the Full
Sample. Table 3 details the results presented in Figure 1.
We acknowledge that researchers use multiple approaches to address the effect of
outliers, such as robust regression (e.g., Leone, Minutti-Meza and Wasley 2019; Powers,
Robinson and Stomberg 2016) or winsorizing to different values (e.g., winsorizing ETRs at one
percent and 99 percent, instead of at zero and one). We ensure inferences are unchanged if we
implement these alternative approaches. Specifically, in untabulated results, we find the relative
power of tests using HS does not change if we use different methods to winsorize GAAP ETR
and Cash ETR, if we use different scalars to calculate ETR proxies, or if we estimate robust
regressions.12
12 Across all samples, we reset GAAP ETR and Cash ETR values less than zero to zero and values greater than one to one. Inferences from Figure 1 are unchanged if we instead winsorize ETR proxies at one percent and 99 percent (as do Rego and Wilson 2012 and Higgins, Omer, and Phillips 2015) or if we scale tax expense or cash taxes paid by operating cash flows (as do McGuire et al. 2012 and Guenther, Krull, and Williams 2014). Results from robust regressions, which are estimated on raw data, also do not change inferences from Figure 1.
23
Implications for future research
24
Our results provide several implications for future research on the determinants of tax
avoidance. First, removing loss observations from the sample generally increases power across
tests using all proxies we examine. This result suggests the additional noise in samples that
include loss observations outweighs any power gains from an increased sample size. This finding
has important implications for sample selection in future studies, particularly in light of recent
research emphasizing the importance of loss firms in tax avoidance research (e.g., Drake et al.
2018; Henry and Sansing 2018; Hanlon and Heitzman 2010).13 Removing observations with
negative values of Tax, in addition to removing loss observations, has a less significant
incremental effect on power.
Second, if a researcher believes it is appropriate to include loss observations, our findings
indicate tests using GAAP ETR and Cash ETR have more power to detect tax avoidance in the
Full Sample than tests using HS. Because ETR proxies lack meaningful economic interpretation
for loss observations, Henry and Sansing (2018) develop HS to overcome these interpretation
issues and state that HS should be most useful in examining cross-sectional variation in tax
avoidance across the entire population of publicly traded firms. Henry and Sansing (2018) also
caution, however, that not all tax avoidance studies should include loss observations because loss
firms may have different incentives to pursue tax avoidance. If researchers find the inclusion of
loss years important to their study — perhaps because they are interested in tax avoidance across
all publicly-traded firms — our results suggest that tests using ETR proxies are more powerful
than tests using HS despite interpretation concerns.
13 Our tests cannot speak to whether loss firms engage in more or less tax avoidance, on average, than profitable firms or whether the determinants of tax avoidance differ between these two groups of firms. Researchers in this area face a trade-off: excluding loss observations may introduce sample selection bias that limits the generalizability of results to samples of only profitable firms, but including them reduces power to detect effects.
25
Third, tests using GAAP ETR and Cash ETR also generally have more power to detect tax
avoidance than tests using HS in samples that exclude loss observations. This finding is an
important consideration for researchers relying exclusively on tests using HS to detect the effects
of tax avoidance in samples that exclude loss observations. Fourth, across all proxies and
samples, we find tests detect the effects of the hybrid strategy, implemented here as cross-
jurisdictional income shifting with a permanently reinvested earnings assertion, with less power
than they detect the effects of the permanent or temporary strategies. Lower power to detect the
hybrid strategy could arise because some proxies do not reflect the temporary portion of the
hybrid strategy, which lowers the power of tests using these proxies. These results validate the
use of alternative proxies specifically designed to detect cross-jurisdictional income shifting
(e.g., Collins, Kemsley, and Lang 1998; Chen, Hepfer, Quinn, and Wilson 2018; De Simone,
Klassen, and Seidman 2019; De Simone, Mills, and Stomberg 2019).
Finally, because the most powerful test varies by tax avoidance strategy, the results in
Table 3 can potentially guide researchers in selecting the appropriate tax avoidance proxy to use.
For example, tests using BTD have the most power to detect the hybrid strategy in samples that
exclude loss observations. Table 3 results also have implications for interpreting results. Finding
that a variable of interest is significantly associated with one avoidance proxy but not others can
help readers isolate the types of tax planning strategies in place. For example, McGuire et al.
(2012) provide evidence (Table 6) that industry-expert financial statement auditors are associated
with lower GAAP and cash ETRs but not with different levels of BTDs. In light of our results,
this pattern could suggest that, when firms purchase tax services from their industry-expert audit
firm, they are more likely to pursue permanent tax avoidance strategies, such as federal and state
26
R&D credits (high-tech industries), tip credits (restaurant industries), or domestic production
manufacturing activities (manufacturing industries), rather than temporary or hybrid strategies.
6. Results for Tests using UTB End and UTB Add
Main results
Table 4 summarizes our results for examining the power of tests using UTB End and
UTB Add as the dependent variables, relative to tests using HS, GAAP ETR, Cash ETR, BTD, and
Perm Diff across all samples and tax avoidance strategies. In the following discussion, we focus
on the case where we seed the permanent tax avoidance strategy with a 50 percent reserve for
uncertain tax avoidance. We find that tests using UTB Add have the highest power of all tax
avoidance proxies across all three samples, ranging from 72 percent to 84 percent versus just
nine percent to 10 percent for tests using UTB End. Tests using UTB End also have lower power
than tests using Cash ETR in the UTB > 0 Sample. Thus, tests using UTB End have the third-
highest power, tied with tests using GAAP ETR and HS in this sample. Further, tests using UTB
End are dominated by tests using HS and Cash ETR in both samples that exclude loss
observations. UTB End reflects multiple years of uncertain tax avoidance, which adds additional
noise that likely contributes to the relatively lower power of tests using UTB End. We confirm
these inferences are unchanged if we estimate robust regressions.
Importantly, the inclusion of unprofitable firms does not impair the power of tests using
UTB End and UTB Add. In fact, the power of tests using UTB End is unaffected by sample
selection choices, and the power of tests using UTB Add is higher in the UTB > 0 Sample than in
the two samples that exclude loss observations. The reduced power of tests using UTB Add in
these other samples is not simply the result of reduced sample size, because the power of tests
using HS and the other proxies is lowest in the largest sample of profitable and unprofitable
27
firms. Examining our samples, we observe loss observations are less likely to have current year
UTBs. Thus, seeding these observations with tax avoidance that increases the current year UTB
from zero to a positive amount increases the likelihood of identifying tax avoidance. Overall, the
pattern of results presented in Table 4 for HS, GAAP ETR, Cash ETR, BTD, and Perm Diff is
generally consistent with those in panel A of Figure 1, though muted, as expected given reduced
sample size.
Next, we consider the effect of varying levels of tax uncertainty by changing the FIN 48
reserve. We assume the permanent strategy requires a FIN 48 reserve equal to zero percent, 50
percent, or 100 percent of the claimed tax benefits. Table 4 also presents results of this test. We
focus our discussion on the UTB > 0 Sample. We present results for all proxies, even though
seeding financial reporting for tax uncertainty does not affect HS or Cash ETR. This allows us to
compare the relative power of tests using these two proxies to detect uncertain tax avoidance to
tests using proxies that are affected by financial accounting for tax uncertainty.
Our first takeaway of tests using UTB proxies is that, for firms that undertake uncertain
tax avoidance but do not accrue reserves to preserve the financial reporting benefits of their
avoidance, tests using the UTB proxies are relatively powerless. In fact, tests using all other
proxies more powerfully detect uncertain, permanent tax avoidance in all three samples than tests
using UTB End or UTB Add seeded without reserves. Tests using GAAP ETR are most powerful
on average across the three samples when we do not accrue reserves. Further, the power of tests
using UTB End and UTB Add increases with the percentage of seeded permanent tax avoidance
reserved. Although highly intuitive, these findings have important implications for research
because the UTB proxies are commonly used to detect uncertain tax avoidance. However,
research suggests there is significant variation in reserves across firms, even when holding
28
constant industry, year, and transaction, some of which could be attributable to financial
reporting incentives (De Simone, Robinson, and Stomberg 2014). To mitigate these concerns,
researchers often consider using multiple proxies in tandem to triangulate results and strengthen
inferences. However, this approach could be problematic because the power of tests using GAAP
ETR, BTD, and Perm Diff to detect tax avoidance monotonically decrease, as accrued reserves
for tax uncertainty increase, while power increases monotonically for tests using UTB End and
UTB Add. The second takeaway of our UTB analyses is the effect of reserves on power is
significantly larger for tests using UTB Add than for those using UTB End. Even with only a 50
percent reserve, tests using UTB Add as the dependent variable are more powerful than tests
using all other proxies, including UTB End.
The UTB proxy results also provide an alternative analysis of the power of tests to detect
a hybrid strategy. Recall that, in our first sets of tests, the accounts we alter to seed tax avoidance
vary across strategies to reflect how companies implement a common type of each strategy (e.g.,
we alter selling, general, and administrative expense to implement the permanent strategy,
consistent with an R&D credit). Focusing on the Profit Sample, we find tests using all proxies
detect the hybrid strategy with the lowest power among the strategies the proxies are designed to
detect (e.g., GAAP ETR and Perm Diff do not detect the effects of temporary tax avoidance).
Because we use different underlying accounts, differences in power in Table 3 may be
attributable to how we seed tax avoidance. In our UTB tests, we seed only the permanent tax
strategy and vary the amount reserved. Therefore, we hold constant the underlying account (e.g.,
selling, general, and administrative expense) that we use for seeding tax avoidance. This feature
of our UTB tests allows us to assess the generalizability of the results related to the hybrid
strategy that are reported in Table 3.
29
Recall also that seeding the permanent strategy without reserves affects GAAP ETR,
BTD, and PermDiff like a permanent strategy, whereas seeding the permanent strategy with less
than full reserves impacts GAAP ETR, BTD, and Perm Diff like a hybrid strategy because only
the nonreserved portion of the permanent tax savings is reflected in these proxies. Thus, we can
compare the relative power of GAAP ETR, BTD, and Perm Diff to detect the effects of the
permanent and hybrid strategies across Tables 3 and 4. We observe similar patterns: tests using
all three proxies have greater power to detect the effects of permanent tax avoidance than to
detect the effects of hybrid tax avoidance. For example, as shown in Table 3, tests using Perm
Diff to detect the effects of the permanent strategy in the Profit Sample are roughly 2.5 times
more powerful than tests using Perm Diff to detect the effects of the hybrid strategy. In the
comparable sample in Table 4, tests using Perm Diff to detect the effects of uncertain tax
avoidance without reserves are roughly twice as powerful as tests using Perm Diff to detect the
effects of uncertain tax avoidance with a 50 percent reserve. We conclude that the relatively low
power of tests using these proxies to detect hybrid tax avoidance in Table 3 is not an artifact of
the particular hybrid strategy we seed (i.e., income shifting with a partial assertion of permanent
reinvestment), but rather arises because the effects of some strategies are not fully reflected in
some proxies. In contrast, Cash ETR and HS are not affected by reserves, so the power of tests
using these proxies is constant within a sample across all levels of reserves.
30
Implications for future research
Based on key findings presented in Table 4 and discussed above, tests using UTB Add
most powerfully detect the effects of uncertain tax avoidance. However, because the power of
UTB proxies to detect uncertain tax avoidance depends critically on firms’ accruing tax reserves
and partitioning the sample, based on managers’ overall financial reporting aggressiveness or
incentives to understate reserves, could strengthen tests. Retaining loss observations in the
sample when investigating uncertain tax avoidance could also improve power. Thus, for
researchers using multiple proxies to test their research question, this suggestion may require
them to maintain two separate samples to increase power—a sample that excludes loss
observations in tests using all other proxies and one that includes loss observations in tests using
UTB proxies.
Our UTB proxy results can also be interpreted as highlighting the potential effect of FIN
48 on the ETR and BTD proxies. Interpreting the tests in this way assumes FIN 48 did not itself
impact the level or type of tax avoidance firms undertake, and we acknowledge there is evidence
suggesting that assumption is invalid (e.g., McClure 2019). The interpretation also depends on
one’s beliefs about the financial reporting of firms before FIN 48. If reserves for tax uncertainty
were lower, on average, at that time (Audit Analytics 2008; Blouin et al. 2007, 2010; Dunbar et
al. 2007; Robinson et al. 2016), then comparing the power of tests when the reserves we seed are
lower to the power when the reserves we seed are higher informs the impact of FIN 48 adoption.
Because the power of tests using GAAP ETR, BTD, and Perm Diff to detect tax avoidance
decreases as the reserve percentage increases, tests using these proxies may have less power to
detect tax avoidance on average after FIN 48. One potential way to mitigate this concern is to
validate inferences from time-series tax avoidance tests that span the adoption of FIN 48 with
31
tests using Cash ETR or HS, because these proxies are unaffected by reserves for tax uncertainty.
7. Conclusion
This study simulates tax avoidance in Compustat data to evaluate the relative power of
econometric tests using new proxies for corporate tax avoidance, compared to those more
extensively analyzed in prior research. We simulate a tax avoidance strategy that yields
permanent tax benefits, one that yields temporary benefits, and one that we consider a hybrid,
because the entire cash tax savings is not reflected in some of the tax expense-based proxies, due
to financial reporting choices made by the firm. Because the purported benefits of the newer
proxies we test include the ability to analyze loss observations or to capture tax uncertainty, we
examine how the relative power of these tests varies across samples and levels of reserves for tax
uncertainty.
Although one of the purported advantages of the proxy developed by Henry and Sansing
(2018) is that it provides a more meaningful interpretation of results for samples that include
unprofitable firms, tests using the proxy have lower power in a sample that includes loss
observations, relative to tests using other proxies in the same sample. Tests using ETR proxies
are three to four times more powerful in these samples. When firms reserve for tax uncertainty,
tests using UTB Add have the highest power—significantly more power than tests using UTB
End or any other tax avoidance proxy—and this power increases with higher levels of reserves
for tax uncertainty and with the inclusion of loss observations. When firms do not accrue for
uncertain tax avoidance, tests using GAAP ETR have the greatest power to detect the effects of
uncertain, permanent tax avoidance on average.
Our results contribute to the tax avoidance literature by providing the first large-scale
empirical analysis of the relative power of HS and UTB proxies. Further, we inform the ongoing
32
debate over the appropriateness of including loss observations in large-sample tests of tax
avoidance. Our results suggest that the noise in tax avoidance proxies for loss observations
outweighs any power gains that result from a larger sample. Finally, our results can speak to the
effect of FIN 48 on the power of tests to detect tax avoidance. Our findings suggest the power of
tests using GAAP ETR, BTD, and PermDiff to detect uncertain tax avoidance may have declined
after FIN 48 if firms increased reserves for tax uncertainty on average after adoption. However,
we acknowledge that inferences related to the effect of FIN 48 on tax avoidance assume that FIN
48 itself did not directly impact the level of tax avoidance.
33
References
Armstrong, C., J. Blouin, and D. Larcker. 2012. Incentives for tax planning. Journal of Accounting and Economics 53: 391-411.
Audit Analytics. 2008. Fin 48 Adoption, Briefing. https://www.auditanalytics.com/0000/custom-reports.php, retrieved March 22, 2019.
Austin, C. 2014. Analysis of stock option tax benefits on the cash effective tax rate. Working paper, University of South Carolina.
Badertscher, B., S. Katz, and S. Rego. 2013. The separation of ownership and control and corporate tax avoidance. Journal of Accounting and Economics 56: 228-50.
Bauer, A. 2015. Tax avoidance and the implications of weak internal controls. Contemporary Accounting Research 33: 449-86.
Blouin, J. 2014. Defining and measuring tax planning aggressiveness. National Tax Journal 67: 875-900.
Blouin, J., C. Gleason, L. Mills, and S. Sikes. 2007. Pre-empting disclosure? Firms’ decisions prior to FIN 48. The Accounting Review 85(3): 791-815.
Blouin, J., C. Gleason, L. Mills, and S. Sikes. 2010. What can we learn about uncertain tax benefits from FIN 48? National Tax Journal 60: 521-35.
Brown, J. 2011. The spread of aggressive corporate tax reporting: A detailed examination of the corporate-owned life insurance shelter. The Accounting Review 86: 23-57.
Brown, J. and K. Drake. 2014. Network ties among low-tax firms. The Accounting Review 89: 483-510.
Chen, C.W., B. Hepfer, P. Quinn, and R. Wilson. 2018. The effect of tax-motivated income shifting on information asymmetry. Review of Accounting Studies 23: 958-1004.
Cheng, C., H. Huang, Y. Li, and J. Stanfield. 2012. The effect of hedge fund activism on corporate tax avoidance. The Accounting Review 87: 1493-526.
Collins, J., D. Kemsley, and M. Lang. 1998. Cross-jurisdictional income shifting and earnings valuation. Journal of Accounting Research 36: 209–29.
De Simone, L., K. Klassen, and J. Seidman. 2017. Unprofitable affiliates and income shifting behavior. The Accounting Review 92: 113-36.
De Simone, L., L. Mills, and B. Stomberg. 2019. Using IRS data to identify income shifting to foreign affiliates. Review of Accounting Studies, forthcoming.
34
Dechow, P., R. Sloan, and A. Sweeney. 1995. Detecting earnings management. The Accounting Review 70: 193-215.
Dechow, P., A. Hutton, J. Kim, and R. Sloan. 2012. Detecting earnings management: A new approach. Journal of Accounting Research 50: 275-334.
De Simone, L., J. Robinson, and B. Stomberg. 2014. Distilling the reserve for uncertain tax positions: the revealing case of black liquor. Review of Accounting Studies 19: 456-72.
Donohoe, M., G. McGill and E. Outslay. 2012. Through a glass darkly: What can we learn about a US multinational corporation’s international operations from its financial statement disclosures? National Tax Journal 65(4): 961-84.
Drake, K., N. Goldman, and S. Lusch. 2016. Do income tax-related deficiencies in publicly disclosed PCAOB Part II reports influence audit client financial reporting of income tax accounts? The Accounting Review 91: 1411-39.
Drake, K., R. Hamilton and S. Lusch. 2018. The sources of declining effective tax rates: Insight from effective tax rate reconciliations. Working paper, University of Arizona and Texas Christian University.
Dunbar, A., L, Kolbasovsky, and J. Phillips. 2007. FIN 48 adoption disclosures. Financial Reporting Watch. October 24, 2007.
Dyreng, S., M. Hanlon, and E. Maydew. 2010. The effects of executives on corporate tax avoidance. The Accounting Review 85: 1163-89.
Dyreng, S., M. Hanlon, and E. Maydew. 2018. When does tax avoidance result in tax uncertainty? The Accounting Review, forthcoming.
Dyreng, S., M. Hanlon, E. Maydew, and J. Thornock. 2017. Changes in corporate tax rates over the past 25 years. Journal of Financial Economics 124: 441-63.
Edwards, A., A. Kubata, and T. Shevlin. 2018. The decreasing trend in cash effective tax rates. Working paper, University of Toronto, University of Muenster, and University of California-Irvine.
Frank, M., L. Lynch, and S. Rego. 2009. Tax reporting aggressiveness and its relation to aggressive financial reporting. The Accounting Review 83: 467-96.
Gallemore, J., B. Gipper, and E. Maydew. 2018. Banks as tax planning intermediaries. Journal of Accounting Research 57: 169-209.
35
Gallemore, J. and E. Labro. 2015. The importance of the internal information environment for tax avoidance. Journal of Accounting and Economics 60: 149-67
Guenther, D. A. 2014. Measuring corporate tax avoidance: Effective tax rates and book-tax differences. Working paper, University of Oregon.
Guenther, D., L. Krull, and B. Williams. 2014. Are tax aggressive firms just inflating earnings? Working paper, University of Oregon and Indiana University.
Hanlon, M. 2003. What can we infer about a firm’s taxable income from its financial statements? National Tax Journal 56: 831-63.
Hanlon, M. and S. Heitzman. 2010. A review of tax research. Journal of Accounting and Economics 50: 127-78.
Hanlon, M. and T. Shevlin. 2002. Accounting for tax benefits of employee stock options and implications for research. Accounting Horizons 16: 1-16.
Henry, E. and R. Sansing. 2018. Corporate tax avoidance: Data truncation and loss firms. Review of Accounting Studies 23: 1042-70.
Higgins, D., T. Omer, and J. Phillips. 2015. The influence of a firm’s business strategy on its tax aggressiveness. Contemporary Accounting Research 32: 674-702.
Hoi, C., Q. Wu, and H. Zhang. 2013. Is corporate social responsibility (CSR) associated with tax avoidance? Evidence from Irresponsible CSR activities. The Accounting Review 88: 2025-59.
Hoopes, J., D. Mescall, and J. Pittman. 2012. Do IRS audits deter corporate tax avoidance? The Accounting Review 87: 1603-39.
Hope, O., M. Ma, and W. Thomas. 2013. Tax avoidance and geographic earnings disclosure. Journal of Accounting and Economics 56: 170-89.
Jennings, R., C. Weaver, and W. Mayew. 2013. The extent of implicit taxes at the corporate level and the effect of TRA86. Contemporary Accounting Research 29: 1021-59.
Klassen, K., P. Lisowsky, and D. Mescall. 2016. The role of auditors, non-auditors, and internal tax departments in corporate tax aggressiveness. The Accounting Review 91: 179-205.
Kubick, T. R., G. B. Lockhart, L. F. Mills, and J. R. Robinson. 2017. IRS and corporate taxpayer effects of geographic proximity. Journal of Accounting and Economics 63(2-3): 428-53.
Kubick, T., D. Lynch, M. Mayberry, and T. Omer. 2016. The effects of regulatory scrutiny of tax avoidance: An examination of SEC comment letters. The Accounting Review 91: 1751-80.
Law, K. and L. Mills. 2015. Taxes and financial constraints: Evidence from linguistic cues.
36
Journal of Accounting Research 53: 777-819.
Law, K. and L. Mills. 2017. Military experience and corporate tax avoidance. Review of Accounting Studies 22: 141-84.
Leone, A., M. Minutti-Meza, and C. Wasley. 2019. Influential observations and inference in accounting research. The Accounting Review, forthcoming.
Lisowsky, P. 2010. Seeking shelter: Empirically modeling tax shelters using financial statement information. The Accounting Review 85: 1693-720.
Lisowsky, P., L. Robinson, and A. Schmidt. 2013. Do publicly disclosed tax reserves tell us about privately disclosed tax shelter activity? Journal of Accounting Research 51: 583-629.
Lyon, J., B. Barber, and C. Tsai. 1999. Improved methods for tests of long-run abnormal stock returns. The Journal of Finance 53: 165-201.
McClure, C. 2019. Determinants of tax avoidance. Working paper, University of Chicago.
McGuire, S., T. Omer, and D. Wang. 2012. Tax avoidance: Does tax-specific industry expertise make a difference? The Accounting Review 87: 975-1003.
McGuire, S., D. Wang, and R. Wilson. 2014. Dual class ownership and tax avoidance. The Accounting Review 89: 1487-516.
Nichols, N., A. Duxbury, and I. Scott. 2018. Analysis of the TCJA’s impact on year-end financial statements. Tax Notes June 25, 2018.
Powers, K., J. Robinson, and B. Stomberg. 2016. How do CEO incentives affect corporate tax planning and financial reporting of income taxes? Review of Accounting Studies 21: 672-710.
Rego, S. and R. Wilson. 2012. Equity risk incentives and corporate tax avoidance. Journal of Accounting Research 50: 775-810.
Robinson, J., S. Sikes, and C. Weaver. 2010. Performance measurement of corporate tax departments. The Accounting Review 85: 1035-64.
Robinson, L., B. Stomberg, and E. Towery. 2016. One size does not fit all: How the uniform rules of FIN 48 affect the relevance on income tax accounting. The Accounting Review 91: 1195-217.
Schwab, C., B. Stomberg, and J. Xia. 2019. When do GAAP effective tax rates capture something other than tax avoidance? Working paper, Indiana University.
37
Seidman, J. and B. Stomberg. 2018. How the corporate rate cut will affect profits. Tax Notes February 5, 2018.
Stock, K. 2013. Tax shelters don’t always work. Just ask Amazon. Bloomberg Technology 21 May 2013. https://www.bloomberg.com/news/articles/2013-05-21/tax-shelters-don-t-always-work-dot-just-ask-amazon, retrieved September 19, 2019.
Wilde, J. and R. Wilson. 2018. Perspectives on corporate tax avoidance: Observations from the past decade. Journal of the American Taxation Association 40: 63-81.
Wilson, R. 2009. An examination of corporate tax shelter participants. The Accounting Review 84: 969-99.
38
Appendix – Additional detail on seeding tax avoidance in Compustat data
This section provides details on our methodology. We begin with a more detailed
discussion of how we simulate tax avoidance in Compustat data, including the specific
adjustments we make to underlying accounts to seed tax avoidance. We then discuss more details
of certain aspects of the method we use to assess power.
Detail of simulating tax avoidance
We seed a level of tax avoidance that generates cash tax savings equal to approximately
0.25 percent of pre-tax income at the median. We select this magnitude of cash tax savings based
on a review of papers examining the determinants of tax avoidance and believe it is an
economically plausible addition to the tax avoidance already embedded in Compustat data.
McGuire, Omer, and Wang (2012) estimate GAAP ETRs of firms with tax expert auditors are
one percent lower than firms without tax expert auditors. Robinson, Sikes, and Weaver (2010)
estimate a 4.3 percentage point difference in ETRs for firms whose tax departments are classified
as profit centers. Brown (2011) estimates that the average tax shelter firm in her sample saved
three percent of pre-tax income from this relatively uncertain type of tax avoidance. We believe
the savings from tax avoidance we simulate should be much lower than these savings for two
reasons. First, Compustat data already includes some level of tax avoidance and the point of our
study is to evaluate the power of statistical tests to detect seeded tax avoidance. The median cash
ETR in unadjusted Compustat data for our primary sample is 25 percent, suggesting tax savings
equal to 14 percent of pre-tax income versus a combined U.S. rate of 39 percent. Seeding an
additional 0.25 percent of pre-tax income increases the tax savings of the firm by 1.8 percent,
which we believe is economically significant. Second, given these studies estimate the effect of
certain firm characteristics on all types of tax avoidance while we seed only one type of tax
39
avoidance at a time, we believe the estimates in these studies represent an upper bound of the
magnitude of additional tax avoidance we should seed.
Below, we outline the journal entries required to seed each tax avoidance strategy,
showing the strategy’s effect on the most granular tax accounts in Compustat. We acknowledge
that all journal entries in this Appendix affect multiple accounts and adjusted those accounts in
our data accordingly. For example, a decrease to current federal tax expense also decreases total
tax expense (TXT), net income (NI) and stockholders’ equity (SEQ).
1) Permanent Strategy
To seed a strategy that generates permanent tax savings, we assume the firm shifts a
percentage of selling, general, and administrative expense (XSGA) from a deduction to a credit.
For example, a firm could elect to claim an R&D credit instead of deducting R&D or a FICA tip
credit instead of deducting payroll taxes. We therefore seed an additional benefit equal to 65
percent of a portion of XSGA to represent an increase in benefit for the change from one dollar
of deduction to one dollar of credit. We set the portion of XSGA shifted to yield additional cash
tax savings approximately equal to 0.25 percent of pre-tax income (PI) at the median, which
requires us to assume approximately 0.15 percent of XSGA is converted from a deduction to a
credit. We summarize the journal entry as follows:
Taxes paid (TXPD) (0.15% × XSGA) × 65%Current federal tax expense (TXFED) (0.15% × XSGA) × 65%
2) Temporary Strategy
To seed a strategy that generates temporary tax savings, we assume the firm accelerates
deductions related to a percentage of capital expenditures (CAPX). For example, a firm could
elect bonus depreciation or conduct a cost segregation study that accelerates tax depreciation
deductions for certain components of real property. We set the portion of CAPX deducted to
40
yield additional cash tax savings approximately equal to 0.25 percent of PI at the median, which
requires us to assume an additional 1.52 percent of CAPX is deductible in the current year. We
summarize the journal entry as follows:
Taxes paid (TXPD) (1.52% × CAPX) × 35%Current federal tax expense (TXFED) (1.52% × CAPX) × 35%
Deferred federal tax expense (TXDFED) (1.52% × CAPX) × 35%Deferred tax liability (TXNDBL) (1.52% × CAPX*35%)
3) Hybrid Strategy
We model our seeding of a strategy that can yield both permanent and temporary tax
benefits from cross-jurisdictional income shifting. We shift a percentage of pre-tax domestic
income (PIDOM) to pre-tax foreign income (PIFO). We assume an advantageous foreign tax rate
on the shifted income of 15 percent. To achieve median cash tax savings approximately equal to
0.25 percent of PI, we shift 1.88 percent of PIDOM to PIFO. We require positive PIDOM, which
ensures this strategy decreases current federal tax expense and increases current foreign tax
expense.
In many cases, foreign source income is not subject to U.S. taxation until it is repatriated.
In accordance with APB 23, firms can assert that they intend to permanently reinvest U.S. tax-
deferred foreign-source income and avoid accruing the incremental U.S. tax due upon
repatriation. We assume the firm asserts that 50 percent of the income we shift is permanently
reinvested. We summarize the journal entry as follows:
Taxes paid (TXPD) (1.88% × PIDOM) × 20%Current foreign tax expense (TXFO) (1.88% × PIDOM) × 15%
Current federal tax expense (TXFED) (1.88% × PIDOM) × 35%
Deferred federal tax expense (TXDFED) (1.88% × PIDOM) × 10%Deferred tax liability (TXNDBL) (1.88% × PIDOM) × 10%
4) Uncertain tax avoidance
41
To examine the effects of uncertain tax avoidance, we seed the permanent tax avoidance
strategy as above and then layer on the financial reporting effects of accruing reserves for
uncertain tax avoidance. We seed multiple levels of reserves, ranging from zero to 100 percent of
claimed tax benefits. Below we present the incremental adjustment made after adjusting for the
effects of permanent tax avoidance, assuming a 50 percent reserve, which means the other 50
percent of the benefit is reflected in tax expense. We summarize this journal entry as:
Current federal tax expense (TXFED) (0.15% × XSGA) × 65% × 50%FIN 48 reserve (TXTUBPOSINC) (0.15% × XSGA) × 65% ×
50%
Assessing power
We apply the seeding methodology to 10,000 independent draws of 15 percent of
observations for each specification we test. We calculate the percentage of tests for which β1
from the following equation is statistically significant.
TaxProxy = α + β1TaxSeeded +β2TaxSeedable +ΣβiControls (1)
TaxProxy is HS, UTB End, UTB Add, GAAP ETR, Cash ETR, BTD, or Perm Diff
computed after seeding additional tax avoidance in Compustat data. TaxSeeded equals one for all
observations (15 percent of each independent sample) seeded with tax avoidance, and zero
otherwise. This variable represents the alternative hypothesis that the true value of β1 is not zero
(i.e., that the observation has not been seeded with tax avoidance). A statistically significant β1
means the test correctly detects a differential level of tax avoidance for the seeded sample. A
perfectly powerful test will reject the null hypothesis that β1 is zero 100 percent of the time. We
compute power as the percentage of the 10,000 simulations for which the null hypothesis that β1
is zero is rejected.
42
Because we hold the amount of cash taxes saved constant at the median across all three
strategies, one might expect power to be the same across all strategies for tests using dependent
variables that are only affected by changes to TXPD (i.e., Cash ETR and HS). However, because
we seed the strategies using different underlying accounts, the amount of seeded tax savings
varies by strategy for the same observation. If we had chosen to seed all three strategies using the
same underlying account (e.g., if the permanent strategy was income shifting with no indefinite
reinvestment assertion, the hybrid strategy was income shifting with a partial indefinite
reinvestment assertion, and the temporary strategy was income shifting with a full indefinite
reinvestment assertion), then power would be identical across strategies. Results for Cash ETR
and HS in Table 4 confirm this: we see identical power across different levels of tax reserves for
tests using these proxies that are unaffected by reserves for tax uncertainty. We chose our
approach so we could evaluate how well tests detect examples of different strategies.
A well-specified test will reject the null hypothesis that there is no effect of tax avoidance
on the dependent variable when the null is true at a frequency equal to the significance level. In
each simulation, we randomly seed tax avoidance in 15 percent of observations. We then
randomly set TaxSeeded to one for 15 percent of observations independent of the actual seeding.
This represents the null hypothesis that TaxSeeded is uncorrelated with seeded tax avoidance.
Because we randomly assign TaxSeeded, it is also uncorrelated with any pre-existing tax
avoidance in the Compustat data. A statistically significant β1 at a given significance level, α,
reflects a type I error in which the null hypothesis that the coefficient β1 equals zero is rejected in
favor of the alternative when the null is true. In our tests which use a one-tailed five percent
significance level (α = 0.05) (negative β1 for HS, GAAP ETR, Cash ETR and positive for UTB
End, UTB Add, BTD and PermDiff) and assuming a binomial distribution, a well-specified test
43
will indicate a positive and significant estimate of β1 between 4.444 percent and 5.556 percent of
the time. Whether a test is well specified is a function of the underlying distribution of the tax
avoidance proxy and the explanatory variables. Skewness in the dependent variable can cause
misspecification because unless explanatory variables sufficiently explain the skewness, the
resulting error term will also be skewed thereby violating the normality assumption underlying
tests of significance for coefficients in OLS regressions. In contrast, this same skewness does not
violate any assumptions and is less concerning when the skewed variable is an explanatory
variable.
Because we can assess power only in well-specified tests, if a test is misspecified when
estimated using OLS, we report power estimated using the empirical p method following Lyon,
Barber and Tsai (1999). The empirical p method uses Monte Carlo methods to approximate the
distribution of an OLS regression coefficient. That is, rather than attempt to minimize skewness
in the sample (as winsorizing does) the empirical p method generates a distribution of
coefficients using random samples that remain skewed. This distribution of coefficients is then
compared to the actual coefficient to generate an empirical p-value. The empirical p method
eliminates misspecification by design; tests employing empirical p will always be well specified.
It is appropriate to compare the power of a well-specified test using OLS with the power of a test
using the empirical p method. We note in the tables any instances where we assess power using
empirical p.
Our methodology is similar to Dechow et al. (1995), who adopt a two-step approach
whereby proxies for discretionary accruals are the residuals from various prediction models for
total accruals. These proxies then become the dependent variables in a second-stage regression.
We base our methodology on a single-stage regression, consistent with how researchers test for
44
tax avoidance. That is, we include control variables in a multivariate regression in addition to an
indicator of tax avoidance. The two approaches yield identical inferences when standard errors
are adjusted for heteroscedasticity in the two-stage approach.14
14 We confirm this in untabulated analysis by repeating a subset of our simulations using the two-step approach in Dechow et al. (1995). We estimate each tax avoidance proxy as a function of controls and capture the residual. This unexpected tax avoidance becomes Tax Proxy in the following regression: TaxProxy= α + β1TaxSeeded + β2TaxSeedable. Results are perfectly correlated with results generated using our single-stage methodology.
45
TABLE 1Sample selection
Observations from Compustat 1993-2016
289,029 Less:
Flow-through entities (4,8
26) Entities not incorporated in the U.S. (62,078) Financial firms and utilities (36,335) Observations with zero or missing PI (38,093) Observations where AT<0 or market capitalization <0 (20,074) Observations with missing TXT or TXPD (28,815) Observations with missing lagged assets (5,807) Observations with missing data to calculate control variables (247)
Full Sample 92,
754
UTB > 0 Sample: Observations with TXTUBEND > 0 16,
867
Full Sample 92,
754
Observations with negative pre-tax income (PI) or pre-tax operating cash flow (OANCF + TXPD) (38,54
6)
Profit Sample: Observations with pre-tax Profit 54,
208
UTB > 0 and Profit Sample: Observations with TXTUBEND > 0 11,
914
Profit Sample 54,
208
Observations with negative tax expense (TXT) or taxes paid (TXPD) (4,71
7)Profit and Tax ≥ 0 Sample: Observations with pre-tax Profit and non-negative Tax 49,
491 UTB > 0, Profit, and Tax ≥ 0 Sample: Observations with TXTUBEND > 0 10,
733 Table 1 outlines sample selection criteria. Variables in capitalized letters refer to Compustat variable names. Flow-through entities are those with company names containing “Trust”, “LP”, or “Partners”. Country of incorporation is determined using foreign incorporation code (FIC). Financial firms are firms with SIC codes in 6000-6999 and utilities are firms with SIC codes in 4900-4999. Market capitalization is PRCC_F × CSHO. Observations have pre-tax Profit if both pre-tax income (PI) and pre-tax operating cash flows (OANCF + TXPD) are greater than zero. Observations have nonnegative Tax if both TXPD and TXT are greater than or equal to zero.
46
TABLE 2Descriptive statistics
Panel A: Samples for HS tests Sample Full Profit Profit and Tax ≥ 0
Proxy Mean Median Min Max Mean Median Min Max Mean Median Min MaxHS 0.0190 0.0016 -0.0894 0.4264 -0.0077 -0.0048 -0.1058 0.0348 -0.0067 -0.0042 -0.1000 0.0360
GAAP ETR 0.2372 0.2903 0.0000 1.0000 0.3233 0.3546 0.0000 1.0000 0.3423 0.3600 0.0000 1.0000Cash ETR 0.1778 0.0769 0.0000 1.0000 0.2716 0.2534 0.0000 1.0000 0.2856 0.2688 0.0000 1.0000
BTD -0.2877 -0.0003 -11.161 0.2395 0.0283 0.0196 -0.0943 0.2567 0.0261 0.0186 -0.0969 0.2418Perm Diff -0.2861 0.0004 -11.122 0.3280 0.0226 0.0089 -0.0806 0.3513 0.0154 0.0081 -0.0843 0.1975
Panel B: Samples for UTB tests
Sample UTB > 0 UTB > 0 and Profit UTB > 0, Profit, and Tax ≥ 0
Proxy Mean Median Min Max Mean Median Min Max Mean Median Min MaxUTB End 0.0144 0.0067 0.0001 0.1574 0.0112 0.0062 0.0001 0.0828 0.0109 0.0061 0.0001 0.0783UTB Add 0.0015 0.0005 0.0000 0.0167 0.0014 0.0005 0.0000 0.0138 0.0014 0.0005 0.0000 0.0131
HS 0.0067 -0.0012 -0.0537 0.1727 -0.0058 -0.0046 -0.0570 0.0240 -0.0049 -0.0042 -0.0475 0.0246GAAP ETR 0.2580 0.2882 0.0000 1.0000 0.3080 0.3239 0.0000 1.0000 0.3301 0.3317 0.0000 1.0000Cash ETR 0.2081 0.1711 0.0000 1.0000 0.2751 0.2523 0.0000 1.0000 0.2865 0.2642 0.0000 1.0000
BTD -0.0293 0.0087 -0.8784 0.1784 0.0260 0.0213 -0.0884 0.1823 0.0245 0.0207 -0.0896 0.1708Perm Diff -0.0275 0.0059 -0.8802 0.3095 0.0235 0.0128 -0.0860 0.3279 0.0158 0.0116 -0.0914 0.1319
Table 2 provides descriptive statistics for tax avoidance proxies. In Panel A, the Full Sample contains Compustat observations from 1993-2016 with non-missing data required to calculate tax avoidance proxies and control variables. We further eliminate flow-through entities, entities not incorporated in the U.S., utilities (SIC 4900-4999), and financial firms (SIC 6000-6999). The two sub-samples eliminate observations 1) with Profit less than or equal to zero, where Profit is pre-tax income (PI) and pre-tax operating cash flow (OANCF + TXPD), and 2) with Tax less than zero, where Tax is tax expense (TXT) and taxes paid (TXPD). In panel B, the samples are analagous to those in panel A but also require a positive ending balance in unrecognized tax benefits (TXTUBEND > 0). HS is (TXPD - .35 × PI)/(lag MVA) where MVA = AT + (MVE - SEQ) and MVE = PRCC_F × CSHO. GAAP ETR is TXT/PI. Cash ETR is TXPD/PI. BTD is (PI - (TXFED + TXFO)/.35)/AT. Perm Diff is (PI - (TXFED + TXFO + TXDI)/0.35)) /AT. UTB End is TXTUBEND/AT. UTB Add is TXTUBPOSINC/AT. We winsorize HS, BTD and Perm Diff, UTB End and UTB Add at one and 99 percent and winsorize GAAP ETR and Cash ETR at zero and one.
47
TABLE 3Summary of power in HS tests across multiple proxies, strategies and samples
Sample
ProxyFull Profit Profit and Tax ≥ 0
PERM HYBRID TEMP PERM HYBRID TEMP PERM HYBRID TEMPHS 6 7 7 22 22 30 22 23 31
GAAP ETR 24 14 5 77 17 5 81 19 5Cash ETR 26 23 32 47 28 63 46 27 62
BTD 5 10 7 36 37 47 36 38 47Perm Diff 5 6 5 31 14 4 54 22 4
Table 3 shows the percentage of simulations (out of 10,000) that correctly identifies seeded tax avoidance for various strategy-proxy pairs. Journal entries showing the effects of seeded tax avoidance for the three strategies – a permanent strategy (PERM), a temporary strategy (TEMP) and a hybrid strategy (HYBRID) – are in the Appendix. We seed tax avoidance such that median cash tax savings from each strategy equal 0.25 percent of pre-tax income (PI). We seed tax avoidance in 15 percent of observations from each of three samples. The Full Sample contains Compustat observations from 1993-2016 with non-missing data required to calculate tax avoidance proxies and control variables. We further eliminate flow-through entities, entities not incorporated in the U.S., utilities (SIC 4900-4999), and financial firms (SIC 6000-6999). The two sub-samples eliminate observations 1) with Profit less than zero, where Profit is pre-tax income (PI) and pre-tax operating cash flow (OANCF + TXPD), and 2) with Tax less than or equal to zero, where Tax is tax expense (TXT) and taxes paid (TXPD). Tests identify seeded tax avoidance using the following regression: TaxProxy = α + β1TaxSeeded + β2TaxSeedable + ΣβiControls. TaxProxy is either HS (TXPD - 0.35 × PI)/(lag MVA) where MVA = AT + (MVE - SEQ) and MVE = PRCC_F × CSHO, GAAP ETR (TXT/PI), Cash ETR (TXPD/PI), BTD (PI - (TXFED + TXFO)/0.35)/AT), or Perm Diff (PI - (TXFED + TXFO + TXDI)/0.35))/AT). TaxProxy is computed after we seed additional tax avoidance in Compustat data. Controls are: size (natural log of MVE); profitability (pre-tax return on assets (PI/AT) in models using ETR proxies and operating cash flows scaled by assets (CFO/AT) in models using other proxies); leverage (LT/AT); capital expenditures (CAPX/AT); intangible assets (INTAN/AT); foreign sales from the Compustat Segments database as a percentage of total sales (SALE); and research and development expenditures (XRD/AT). These continuous control variables are winsorized at one and 99 percent in all specifications. Further, in tests of the hybrid strategy, we include the interaction of each control variable with TaxSeedable to account for differences between seedable and non-seedable observations. We winsorize HS, BTD and Perm Diff at one and 99 percent and GAAP ETR and Cash ETR at zero and one. All tests in Table 3 are well-specified using OLS.
48
TABLE 4Summary of power in tests of tax uncertainty across proxies, samples, and levels of reserves
Sample
Proxy
UTB > 0 UTB > 0 and Profit UTB > 0, Profit and Tax ≥ 0
no reserve
50% reserve
full reserve
no reserve
50% reserve
full reserve
no reserve
50% reserve
full reserve
UTB End 5 9 15 5 10 17 5 10 77UTB Add 5 84 100 4 72 100 4 72 100
HS 8 8 8 13 13 13 13 13 13GAAP ETR 12 8 5 22 11 5 24 11 5Cash ETR 14 14 14 19 19 19 19 19 19
BTD 10 7 5 16 9 5 16 9 5Perm Diff 9 7 5 13 7 5 22 11 5
Table 4 shows the percentage of simulations (out of 10,000) that correctly identifies seeded tax avoidance for various strategy-proxy pairs. Journal entries showing the effects of seeded permanent and uncertain tax avoidance are in the Appendix. We seed permanent tax avoidance such that median cash tax savings equals 0.25 percent of pre-tax income (PI). We then create a reserve equal to 0 percent, 50 percent, or 100 percent of the claimed benefits to include the effect of tax uncertainty. We seed permanent tax avoidance in 15 percent of observations from each of three samples. The UTB > 0 Sample contains Compustat observations with a positive ending balance in unrecognized tax benefits (TXTUBEND > 0) and non-missing data to calculate required variables. We further eliminate flow-through entities, entities not incorporated in the U.S., utilities (SIC 4900-4999), and financial firms (SIC 6000-6999). The two sub-samples eliminate observations 1) with Profit less than or equal to zero, where Profit is pre-tax income (PI) and pre-tax operating cash flow (OANCF +TXPD), and 2) with Tax less than zero, where Tax is tax expense (TXT) and taxes paid (TXPD). Tests identify seeded tax avoidance using the following regression: TaxProxy = α+ β1TaxSeeded + β2TaxSeedable + ΣβiControls. TaxProxy is either UTB End (TXTUBEND/AT), UTB Add (TXTUBPOSINC/AT), HS (TXPD - 0.35*PI)/(lag MVA) where MVA = AT + (MVE - SEQ) and MVE = PRCC_F × CSHO, GAAP ETR (TXT/PI), Cash ETR (TXPD/PI), BTD (PI - (TXFED + TXFO)/0.35)/AT), or Perm Diff (PI - (TXFED + TXFO + TXDI)/0.35))/AT). TaxProxy is computed after we seed additional tax avoidance in Compustat data. Controls are: size (natural log of MVE); profitability (pre-tax return on assets (PI/AT) in models using ETR proxies and operating cash flows scaled by assets (CFO/AT) in models using other proxies); leverage (LT/AT); capital expenditures (CAPX/AT); intangible assets (INTAN/AT); foreign sales from the Compustat Segments database as a percentage of total sales (SALE); and research and development expenditures (XRD/AT). These continuous control variables are winsorized at one and 99 percent in all specifications. We winsorize UTB End, UTB Add, HS, BTD and Perm Diff at one and 99 percent and GAAP ETR and Cash ETR at zero and one. Percentages in bold indicate tests that are not well-specified using OLS so we evaluate power using the empirical p method.
49
Figure 1: Relative power of tests using HS across three samples
Panel A: Permanent strategy Panel B: Temporary strategy
0%10%20%30%40%50%60%70%80%90%
100%
Full Profit Profit and Tax ≥ 0
HS GAAP ETR Cash ETR BTD Perm Diff
0%10%20%30%40%50%60%70%80%90%
100%
Full Profit Profit and Tax ≥ 0
HS GAAP ETR Cash ETR BTD Perm Diff
Panel C: Hybrid strategy
0%10%20%30%40%50%60%70%80%90%
100%
Full Profit Profit and Tax ≥ 0
HS GAAP ETR Cash ETR BTD Perm Diff
Figure 1 graphs the results in Table 3. All details and variable definitions are in the notes to Table 3.
50