Upload
others
View
2
Download
0
Embed Size (px)
Citation preview
Estimating the Effect of Incarceration on Recidivismand Employment: An RD Approach
Shawn D. BushwayCatalina FrancoDavid J. HardingJeffrey Morenoff∗
Abstract
This paper analyzes how incarcerating convicted offenders affects their future recidi-vism and employment outcomes. Using data on felony offenders in Michigan sentencedbetween 2003 and 2006 and outcomes collected from various administrative sources,we exploit discontinuities in the structure of the Michigan Sentencing Guidelines asan exogenous source of sentence type assignment. By comparing observations on ei-ther side of the cut-points generated by prior criminal record scores in the SentencingGuidelines, we provide evidence on the causal effect of incarceration on recidivism andemployment for low-level offenders who are on the sanction margin. Preliminary resultsshow that there is a sizable incapacitation effect for offenders sentenced to prison inrecidivism one and three years after sentence and in employment in the fourth quarterafter sentence. This suggests that incarcerated offenders are less likely to recidivate orbe employed as a result of being incarcerated. There is, however, no difference in theoutcomes of prisoners and probationers when the relevant risk period of analysis startsafter release.
∗Bushway: Rockefeller College of Public Affairs & Policy, University at Albany (SUNY) (e-mail: [email protected]);Franco: Economics, University of Michigan (e-mail:[email protected]); Harding: Sociology, University of California, Berkeley(e-mail: [email protected]); Morenoff: Sociology, Population Studies Center, and Survey Research Center, University ofMichigan (e-mail: [email protected]). This project is funded by NSF, NICHD, PSC, and UM Office of Research. We arethankful to Anh Nguyen, Charley Chilcote, and Paulette Hatchett for all their help with assembling and cleaning the data, andJeff Smith for useful feedback. We would like to acknowledge the support from the Eunice Kennedy Shriver National Instituteof Child Health and Human Development (1R21HD060160 01A1), and the National Science Foundation (SES 1061018).
1
1 Introduction
The dramatic increase in the number of people incarcerated in the United States over the
last three decades has generated a discussion among policy makers, criminal justice offi-
cials, researchers, and citizens about the causes and consequences of mass incarceration
and ways of reducing the size of the nation’s prison population without compromising
public safety. One of the central questions in this discussion is whether sentencing a
convicted felon to prison - at considerably higher cost than alternative sentences such
as probation - will reduce the likelihood that the person will reoffend in the future.1 A
related question is whether serving time in prison reduces one’s employability and lowers
wages after prison.
Despite the centrality of these questions to scholarly and policy debates, studies on the
economic and social consequences of incarceration often base their inferences on nebulous
counterfactual comparisons and usually fail to adequately rule out competing explana-
tions for the putative effects of incarceration they estimate, leading some observers to
conclude that existing research is not nearly sufficient for making firm evidence-based
conclusions for either science or public policy (Nagin, Cullen, and Jonson 2009). In re-
cent years, a new wave of studies have used quasi-experimental designs that leverage the
random assignment of judges to felony cases to estimate the effects of incarceration on
measures of subsequent recidivism, employment, and earnings (Abrams 2009; Aizer and
Doyle 2013; Berube and Green 2007; Green and Winik 2007; Kling 2006; Loeffler 2013;
Mueller-Smith 2014).
The current paper contributes to this literature by implementing a novel quasi-
experimental design to estimate the effect of being sentenced to prison compared to
probation on the probability of prospectively being (a) convicted of a new felony offense
1Criminologists use the term specific deterrence to refer to this idea the punishment given for a priorcrime may deter a person from committing a future crime. This is distinguished from general deterrence,which refers to the idea that the threat of punishment may deter people in the general population fromcommitting crime, regardless of their specific experience with prior crime and punishment. Each of thesequestions has spawned its own criminological literature, but in this paper we focus primarily on studies ofspecific deterrence.
2
and (b) employed, using data on a sample of convicted felony offenders in the state of
Michigan sentenced between 2003 and 2006. The design emerges from the structure of
the sentencing guidelines and capitalizes on discontinuities in the probability of being
sentenced to prison based on the official system that is used for scoring and classifying
convicted offenders in pre-sentence investigation reports, as dictated by the Michigan
Sentencing Guidelines.
Empirically, the discontinuities in the probabilities of receiving a prison sentence can
be analyzed under a fuzzy regression discontinuity (RD) design framework. Given the
resemblance between the fuzzy RD and the instrumental variables (IV) estimators, we
propose two IV approaches to provide the causal effect of incarceration on recidivism
and employment. The first approach pools together the multiple cutoffs that are present
in the data to construct a single cutoff which is used to instrument the prison sentence
indicator. The second approach uses the individual cutoffs as multiple instruments in
the IV regression. The IV results are complemented by reduced-form regressions that
give insights into the heterogeneity of the treatment effects that are behind the summary
effects estimated by the IV analysis.
The rest of this document is organized as follows: Section 2 provides a brief theo-
retical motivation and discussion of previous studies, section 3 presents the details of
the Michigan sentencing guidelines in which our research design in based, and section
4 discusses the data sources and presents descriptive statistics of our analytical sample.
The last two sections discuss the results and conclude.
2 Theoretical motivation and prior research
Our research questions derive their significance from well-developed bodies of theory and
research linking incarceration to future behavior, and so we review some of the key dis-
tinctions between different perspectives.
Contemporary criminological accounts emphasize three general mechanisms through
3
which incarceration can reduce the likelihood that a person will reoffend in the future:
(1) incapacitation, (2) rehabilitation, and (3) specific deterrence. Although being in-
carcerated has a clear mechanical effect on suppressing crime for the duration of one’s
custodial sentence, the effectiveness and efficiency of incapacitation as a crime-control
strategy are open to question. One limitation on its effectiveness is that by removing
offenders from the community, incarceration may create criminal opportunities for new
offenders through so-called replacement effects (Miles and Ludwig 2007). Incapacitation
is also a financially costly way to control crime, with a bed in a state prison or local
jail costing an average of roughly $26,000 per year, compared to average expenditures of
$2,800 per parolee and $1,300 per probationer (Schmitt, Kris Warner, and Gupta 2010).
It is very difficult to disentangle the incapacitative effects of prison from its behavioral
effects, which could operate through rehabilitation, specific deterrence, or other mech-
anisms. Finally, the magnitude of any incapacitation effects depends on the criminal
behavior of the comparison group, those who are not sentenced to prison but rather re-
main in the community.
After the decline in the support for rehabilitation as the guiding philosophy of the
American penitentiary system (Bushway and Paternoster 2010; Cullen 2002), there has
been a resurgence of interest in and support for rehabilitation in recent decades brought
about by new research on corrections programs (Cullen 2005; Cullen 2002). A general
conclusion of this research is that there are successful programs that curb recidivism,
but their effectiveness hinges on the way they are matched to the needs of individual
offenders and the extent to which they maintain program integrity (Bushway and Pa-
ternoster 2010; Cullen 2002). Some scholars also argue that exposure to programs and
interventions that are inappropriately matched to an offender’s needs especially those
that violate the risk principle by exposing low-risk offenders to excessive interventions
can have criminogenic effects (Nagin, Cullen, and Jonson 2009).
Specific deterrence is the primary theoretical framework used to motivate studies of
the effects of incarceration on reoffending. It refers to the possibility that an offender
will be less likely to engage in future criminal activity after being punished for a previous
4
crime (Bushway and Paternoster 2010; Nagin, Cullen, and Jonson 2009). Its focus on
deterring people who have already been punished for previous crimes distinguishes it
from the notion of general deterrence, which refers to the broader deterrent effects that
punishments may have on members of society, regardless of their specific experience with
crime and punishment.
Despite the emphasis placed on the three mechanisms outlined above in the litera-
ture, other theoretical perspectives have also been brought to bear on the question of
how incarceration affects subsequent behavior. First, prisons and jails can have labeling
effects that can operate through stigma (and social reaction to the label) or through
transformation of one’s identity (the internalization of the label), and both of these can
be reinforced through interactions inside and outside of prison (Nagin, Cullen, and Jon-
son 2009). Labeling is often evoked as one of the main reasons that former prisoners
have trouble finding jobs (Pager 2009). Also, insufficient opportunities for education and
job training in prisons, along with the atrophy of job skills one brings to prison and
lost job experience can all be viewed from a human capital perspective as reasons why
returning prisoners may have more difficulty (re)connecting with the labor market than
probationers (Kling 2006; Loeffler 2010; Tyler and Kling 2007). Prisons and jails are
viewed by social learning theorists as sites where procriminal attitudes, values, skills,
and roles can be transmitted through informal interactions. To the extent that prisoners
acquire procriminal skills and experience human capital deficits that make it harder for
them to find jobs in the formal labor market, they may face more strain and differential
opportunities that make crime more accessible and profitable than legal forms of work.
Finally, incarceration especially imprisonment can deplete the social capital that one
can access after prison (Loeffler 2010). The combination of time and distance away from
home can make it difficult to stay connected to relatives and friends, especially weak ties
that can be especially useful for finding jobs. In the parlance of control theory, prison
may diminish one’s social bonds, to family, friends, work, and engagement in other forms
of social organization.
In terms of prior research, a small set of studies have utilized quasi-experimental or
5
experimental designs to study the effects of incarceration on recidivism and employment.
In a pair of studies using data from the Superior Court of the District of Columbia
(Berube and Green 2007; Green and Winik 2010) use randomly assigned judges as in-
struments and find no statistically significant relationship between incarceration and
reoffending. Abrams (2009) also uses an instrumental variables design in a comparison of
recidivism among prisoners and probationers in Clark County, Nevada, but in his case,
the instruments come from the random assignment of public defenders and his treatment
is sentence length (rather than sentence type). He finds that there is a relationship be-
tween sentence length and recidivism but it is complex and non-monotonic negative for
both the shortest/weakest and longest sentences, and positive for mid-range sentences.
Nagin and Snodgrass (2013) use the random assignment of judges to felony defendants
in Pennsylvania who were sentenced during 1999 to estimate the effects of incarcera-
tion (compared to non-custodial sanctions) on recidivism. Aizer and Doyle (2013) use
the random assignment of judges to defendants in a juvenile court in Chicago, Illinois
between 1991 and 2006 to study the effects of juvenile incarceration on high school com-
pletion and incarceration in adult facilities later in life. Finally, Mueller-Smith (2014)
uses the random assignment of “courtrooms” (combinations of judges and prosecutors)
to misdemeanor and felony defendants sentenced in Harris County, Texas between 1980
and 2009, to estimate the effects of sentence type and length on recidivism, employment,
wages, take-up of food stamps, marriage, and divorce.
In a study of sentences in Washington juvenile courts, Hjalmarsson (2009) used a re-
gression discontinuity design that capitalizes on large discrepancies between neighboring
cells of the sentencing grid in terms of the probability of being incarcerated in a state
detention facility (for 15-36 weeks) vs. a “local sanction (which could include combina-
tions of time served at a local detention center, community supervision, and community
service) and found that incarceration reduced future offending by 35%. Although similar
in spirit to our regression discontinuity design, this study had a very different substan-
tive focus the juvenile justice system where the treatments are qualitatively different.
Manski and Nagin (1998) also studied the effect of a juvenile offender being sentenced
to confinement in a residential treatment facility vs. nonresidential treatment on recidi-
6
vism in Utah. Their seminal contribution was to illustrate the importance of studying
(and ideally modeling) the behavioral patterns of human actors (e.g., judges) who make
decisions regarding treatment assignments. They demonstrate how bringing different
behavioral models about judicial decision-making to bear on the identification of causal
effects can be much more informative than the conventional approach of assuming noth-
ing about how judges act. They also show how assumptions about the behavioral model
driving selection can lead to very different conclusions about the causal effect and urge
that researchers focus on better understanding the behaviors of the causal agents who
make the treatment assignments.
Studies of incarceration and employment also find that offenders who have been in-
carcerated or served longer sentences tend to have weaker labor force attachment after
prison (Bushway and Reuter 2002; Bushway and Paternoster 2010; Nagin, Cullen, and
Jonson 2009; Wakefield and Uggen 2010). However, some studies have also shown that
prisoners experience temporary employment increases after their release relative to their
employment status just before going to prison, which may be partially attributable to
the effects of post-prison parole supervision and reentry programs (Bloom et al. 2007;
Kling 2006; Loeffler 2010; Pettit and Lyons 2007; Tyler and Kling 2007). Similar results
were found in two quasi-experimental studies. Kling (2006) used data from the Florida
state and California federal court systems, using judges as instruments to estimate the
effect of sentence length on subsequent employment and earnings. He found that in
the short-term, longer lengths of incarceration are associated with more positive labor
market outcomes, while there is no effect in the medium term. Loeffler (2010) analyzed
employment data on felony cases from the Circuit Court of Cook County, Illinois and
likewise found that incarceration was associated with temporarily higher employment.
He also conducted a non-experimental analysis of data on wages and job tenure and
found that the temporary increase in employment among the incarcerated group was not
accompanied by wage gains or tenure increases, as one might expect if the employment
gains were the result of improvements to human or social capital. Instead, he concludes
that the evidence is more consistent with the benefits that returning prisoners may get
from reentry programs, especially when compared to their employment outcomes just
7
before going to prison (as he does in a difference-in-difference analysis).
As a whole, this small group of studies using quasi-experimental designs to analyze
the impacts of incarceration on employment and recidivism yield several general conclu-
sions. First, most of the quasi-experimental studies of the adult criminal justice system
found no significant effects of either sentence type (e.g., incarceration vs. a non-custodial
sanction) or length on recidivism outcomes. The only exception was the Mueller-Smith
(2014) study, one of the few to separate the effects of incapacitation (comparing individ-
uals currently in prison or jail to those who received non-custodial sanctions) from the
longer-run effects of incarceration after the incarcerated group has been released back
to the community. This study found that (a) incarceration was negatively associated
with recidivism when currently incarcerated individuals were compared to those released
to the community on non-custodial sanctions (i.e., an incapacitation effect), but (b) in-
carceration was positively associated with recidivism when both groups were compared
post-release. The two studies of the juvenile justice system produced discrepant results.
Using the identification strategy based on random judge assignment, Aizer and Doyle
(2013) found that juvenile incarceration increased the likelihood of recidivism, defined
as future incarceration as an adult. Hjalmarsson (2008), however, found that juvenile
incarceration was associated with lower probability of future incarceration as a juvenile.
As a recent summary of this literature by the National Research Council (2014)
noted, in the larger literature on the effects of incarceration on labor market outcomes,
there are many observational studies of survey data suggesting that being incarcer-
ated weakens ones future employment opportunities and earnings. The results from
the quasi-experimental literature are more equivocal. Neither Kling (2006) nor Loeffler
(2013) found significant associations between incarceration and employment or wages,
but Mueller-Smith (2014) found that incarceration reduces post-incarceration employ-
ment and wages.
8
3 Michigan Sentencing Guidelines
The sentencing guidelines manual contains recommendations in terms of type of sen-
tence and sentence length that judges use in the sentencing process. With the exception
of offenses for which there is no sentencing discretion,2 the sentencing guidelines describe
in detail the recommended sentences and sentence lengths for an offender based on the
current offense, prior criminal history, and type of crime. The current version of the
Michigan sentencing guidelines applies to felonies committed on or after January 1, 1999.
The guidelines are indeterminate in that they (a) provide a range of minimum sen-
tences within each cell for judges to choose from and (b) present recommended rather
than mandatory minimum sentences (Deming, 2000).3 Because the sentencing guidelines
are only recommendations, judges are free to depart from the recommended range,4 but
departures are relatively rare, occurring in less than 2 percent of the cases analyzed in
this sample. The guidelines divide offenses into nine classes based on their severity as
defined by the maximum term of imprisonment allowed by statute for the offense (classes
A-H, with A being the most severe, H the least severe, and class M reserved for second-
degree murder). Each class has its own sentencing grid, with cells divided according
to scores on two measures, the offender prior record (PR) and offense severity (OS),
which are each computed as sums of scores on component measures. There are seven
components to the PR score and 20 components to the OS score.5 The total PR scores
are divided into seven intervals to generate the prior record variable (PRV) level. The
PRV cut points are the same for all grids. The OS scores are also divided in intervals
which determine the offense severity variable (OSV) level. The number of OSV levels
2Examples of felonies excluded from the guidelines are first degree murder, which carries a mandatorylife sentence, or felony firearm, which carries a mandatory two-year flat sentence (sentence to prison for aminimum of 2 years and maximum of 2 years).
3Maximum sentences are set by statute in Michigan.4Judges must justify any departure in writing and are precluded from basing departures on any in-
formation already taken into account in the guidelines or on race, gender, ethnicity, nationality, religion,employment, or similar factors.
5Our understanding is that many other states have a more discrete sentencing guidelines system forclassifying offenders based on prior record. Our application of the regression discontinuity method in thisstudy depends on the fairly continuous nature of the prior record variables in Michigan.
9
and the cut points defining them are not the same across grids. Each cell defined by the
intersection of PRV and OSV levels contains a range of possible minimum sentences in
months. In the example grid (see appendix) the lowest minimum sentence (in months) is
the large number on the left of the cell while the four numbers on the right of the cell are
the highest minimum sentence lengths in months. These four subdivisions correspond to
the offender’s habitual status for offenders with prior felony records (Michigan Judicial
Institute, 2010), and their function is basically to increase the upper limit of the mini-
mum sentence of the appropriate cell by a fixed percentage.
Judges are responsible for guideline score calculations, but in practice this work is
done as part of the pre-sentence investigation and sentencing information report that is
provided to the judge by the Michigan Department of Corrections and typically prepared
by an MDOC probation officer. 6 The officer relies on police reports, interviews with
victims, and criminal history searches to calculate the continuous prior record (PR) and
offense severity (OS) scores and to determine the offender’s habitual status. The proba-
tion officer is also the person who typically places the offender in a cell on the relevant
grid based on the calculated guidelines scores. Our conversations with probation officers
suggest that judges rarely request that scores be recalculated.
There is the possibility of manipulation in assigning points to the components of the
scores, but we consider this far more likely for offense severity scores than for prior record
scores. Offense severity scores include potentially subjective aspects of the crime, such
as whether there was psychological injury to a victim or a victim’s family member or
whether a firearm was discharged in the direction of a victim, whereas prior record scores
include objective characteristics of the offender’s prior criminal history, such as whether
the offender was on parole or probation at the time of the offense and how many prior
misdemeanors, low severity felonies, or high severity felonies the offender had been con-
victed of in the past (with severity defined by the exact crime of the prior conviction).
6Michigan is somewhat unique compared to other states in that the Department of Corrections handleprobation supervision of all offenders sentenced to felony probation. Offenders sentenced to jail or jailfollowed by probation for a felony also appear in MDOC records because MDOC conducts all pre-sentenceinvestigations for all circuit courts throughout the state.
10
For this reason, we focus on variation in sentence type generated by prior record scores,
as described below.
Another potential source of manipulation is the plea bargaining process, as prose-
cutors and defense attorneys are well aware of the details of the sentencing guidelines
system. In our analytic sample, 97% of convictions occurred through a plea bargain (as
opposed to a bench or jury trial). If a prosecutor were to base plea agreements on the
exact grid cell that the individual would be placed in and on her expectations of the
probability of recidivism (or employment) from the likely sentence in that cell, then such
manipulation would be a threat to the validity of our regression discontinuity design.
However, our conversation with the probation officers who prepare pre-sentence investi-
gations and sentencing information reports for judges lead us to doubt that such extreme
and intentional manipulation is occurring. First, the cases in our analytical sample are
typical cases that are processed very quickly, leaving little time and attention for such
careful calculation or concern. Second, we believe that most plea bargaining occurs over
the exact crime the offender will plead guilty to, and therefore which crime severity grid
will govern his or her sentencing. Our analysis only makes comparisons within sentencing
grids.
For our purposes, a key aspect of the sentencing guidelines is that cells on most
grids (classes B-G) are divided into three categories based on the types of sentences
recommended: (1) Intermediate cells, including jail, probation and other (rarely used)
sentences like fines, drug treatment, or house arrest; (2) straddle cells, in which any type
of sentence is possible, and (3) prison only cells. 7 In the example grid in the appendix,
intermediate cells are marked with asterisks, straddle cells are shaded, and prison cells are
unmarked. As we will explain in section 5, our research design exploits the discontinuous
7Grids M and A contain only prison cells. Grid H contains intermediate and straddle cells but no prisoncells. Intermediate cells have ranges in which the upper recommended limit is 18 months or less. Whenoffenders in intermediate cells are sentenced to jail, their jail term can be 0-12 months (or zero to thestatutory maximum if the statutory maximum is less than 12 months). Straddle cells have ranges in whichthe lower limit of the range of the minimum sentence is 12 months or less and the upper limit is at least 18months. When offenders in straddle cells are sentenced to jail, their jail terms can be 0-12 months.
11
jump in the probability of going to prison when crossing from an intermediate cell to a
straddle cell.
4 Data
The data used in the analysis comes from different sources. We draw primarily on admin-
istrative data from the Michigan Department of Corrections (MDOC), which provided
information on all individuals convicted of a felony between 2003 and 2006. The pre-
sentence investigation records, called the Basic Information Report (BIR), contain the
individual sentencing guidelines scores and components, identifiers for the sentencing grid
and cell for each case, legal codes for the offense first charged and convicted, habitual
offender status, type of conviction (plea, bench trial, jury trial, etc.), offense date, convic-
tion date, sentence date, days spent in jail (jail credits), sentence(s) imposed, and IDs for
judges, defense attorneys, counties, and circuits. Additionally, the BIR records offender
demographics, prior convictions and arrests, and substance abuse history.8
The two outcomes of interest we analyze in this study are recidivism and employment
in different time frames. Recidivism is measured in three ways: new felony convictions,
arrests, and entry to prison. Arrest records were provided by the Michigan State Police
and are available through 2012. Data on new felony convictions (convictions recorded af-
ter the original sentence and for offenses occurring after the original sentence) are drawn
from BIR from MDOC. Lastly, supervision records from MDOC document subsequent
incarceration in prison, for a technical violation or a new sentence.9 Conviction and incar-
ceration records are available through 2013. We analyze recidivism outcomes 1, 3 and 5
years after sentence and after release. In this draft we analyze only new felony recidivism.
8Demographic and economic characteristics used in the analysis include age, race, gender, marital statusat arrest, and years of schooling. A few characteristics in the PSI are crudely measured (i.e., whether or notthe offender has a history of mental illness, drug abuse, or alcohol abuse) but were nonetheless retained inthe analysis as they serve as important pre-sentence variables.
9Our access to data on multiple forms of recidivism and to MDOC data on the supervision of all paroleesand probationers allows us to capture moves to prison for parole and probation violations that are notrecorded in arrest records, a potentially important form of censoring that is not addressed in many studies.
12
Employment data comes from the Michigan Unemployment Insurance Agency (UIA)
which is matched to the MDOC administrative records using social security numbers and
aliases.10 These data provide information on employment histories in the formal labor
market prior to sentencing and employment outcomes post-sentence and post-release.
The employment variables are whether the individual is observed to be employed in the
4th, 12th, and 20th quarters after sentence and after release. An individual is observed
to be employed if he or she was recorded as having any wages during the quarter.
An important distinction that we make concerns the start of the risk period for the
outcome. One approach taken by many previous studies is starting the risk period at
release. This means that for a probationer, the risk period starts at sentence but for the
other three sentence types it starts once the incarceration period ends.11 An alternative
approach is to define the risk period as beginning at the date of sentencing for offenders
in all sentence types. We view both approaches as having strengths and weaknesses.
Starting the risk period at release allows for comparisons with prior research and re-
moves any incapacitation effects during incarceration. However, this method introduces
a number of potential problems. It confounds the effects of incarceration with effects of
10Records for the study population were matched with UI records using the following procedures. First,all social security numbers available in MDOC databases for the population were sent to the MichiganUnemployment Insurance Agency and Workforce Development Agency for matching. In some cases, morethan one social security number was available for each subject. Returned UI records were matched withnames from MDOC databases, including aliases, to eliminate incorrect social security numbers. If more thanone social security number that MDOC had recorded for the same person matched records in the UI data,project staff selected the best match by comparing employer names listed in the UI records with those listedin MDOC records (from parole or probation agent reports). For cases in which no single match could bedetermined, UI data were retained for all social security numbers listed in the MDOC records for a givenindividual, under the assumption that such people worked under multiple social security numbers.
11For those sentenced to jail or jail followed by probation, we must estimate the date of release from jailbased on the jail credits at sentencing and the sentence length because MDOC does not run the jails or trackjail inmates who are not also under MDOC supervision or custody (e.g. parolees or probationers servingjail time, prisoners temporarily housed in local jails for court appearances). In an unknown number of casesthese release dates are overestimates due to early release from jails, which is at the discretion of the localjail and often due to overcrowding. Given the short length of most jail sentences anyway and because we arenot concerned with estimating the effects of jail or jail followed by probation sentences, we do not see thisas a problem for the present analysis.
13
elapsed time, both period and aging effects. Measuring outcomes from time since release
also risks allowing some endogeneity to creep back into our estimates, as release dates
are potentially a function of behavior during prison. In addition, those sentenced to
particularly long minimum sentences will not have post-release outcomes, especially for
time periods furthest from release. Measuring outcomes starting at sentence avoids these
problems, but produces estimates that may be dominated by incapacitation effects.12 In
what follows, we report results from both approaches.
The analytic sample excludes resentences, flat or mandatory sentences (including life
sentences), community service and fines sentences, as well as records from specialty courts
(e.g., drug and family courts). We retained only the carrying or most severe offense and
associated sentencing outcome when the offender was convicted of multiple offenses. We
perform all analyses using records for non-habitual offenders only as this category con-
tains the vast majority of observations. The analytic sample for the RD analysis consists
of around 21,500 individual records from 83 counties in Michigan which correspond to
individuals whose PRV score (running variable) is within 16 points of the relevant cutoff.13
Table 1 shows basic descriptive statistics by sentence type. About 90% of offenders
in the sample are sentenced to intermediate sentences while only 10% are sentenced to
prison. Minimum sentence lengths are largest for probation and jail with probation with
27 and 31 months on average, respectively. The table also shows means of the baseline
covariates. The sample of offenders is primarily male, white, and non-married. Across all
sentence types, around a third of individuals have very low education, about 20% have
a mental illness, and between 30 and 50% have an addiction to alcohol. On average, at
the time of sentence the offenders were 30 years old, and were first arrested when they
were 20 years old. Finally, employment was very low with the average offender working
12Starting the risk period at sentence may also has more policy relevance because legislators and judgessurely consider incapacitation effects in making decisions or policies related to sentencing or release fromprison.
13The OSV score could potentially be used as an alternative or secondary running variable; nevertheless,it is more likely to be subject to manipulation than the PRV scores. The explanation of why the bandwidthof 16 point from the cutoff is chosen is given in the results section.
14
only about a third of the time before sentence.
Table 1: Descriptive statistics
Sentence type
Prison Probation JailJail withprobation
% of observations in sample 10.69 28.82 7.69 52.80
Average sentence length (months) 17.75 27.00 7.14 30.64
Median sentence length (months) 18.00 24.00 7.00 24.00
% of women 0.08 0.18 0.11 0.14
% white 0.59 0.49 0.57 0.69
% married 0.14 0.14 0.12 0.14
% with less than high school 0.44 0.45 0.47 0.42
Average age at sentence 32.90 30.80 32.75 31.98
Average age at first arrest 19.46 20.43 19.57 20.46
% with mental illness 0.19 0.18 0.19 0.21
% with drug addiction 0.54 0.50 0.52 0.51
% with alcohol addiction 0.50 0.32 0.42 0.52
Proportion of quarters employed within 3years before sentence
0.30 0.35 0.27 0.38
Proportion of quarters employed within 6years before sentence
0.33 0.36 0.29 0.39
Notes: All figures correspond to observations within 16 points from the cutoff. Sample sizes are
between 18,300 and 21,700 depending on the variable. Sentence length for jail with probation is
for the probation part of the sentence. Less than high school does not include GED.
Descriptive statistics of the outcomes are presented in Table 2. As mentioned above,
we analyze both after sentence and after release risk periods. Panel A shows average
recidivism for the four sentence types. Within 6 months after sentence the incidence of
new felonies is close to zero for all offenders (this reflects in part the fact that we measure
new felonies at the conviction date, not the offense date, because offense dates were more
frequently missing and in some cases unreliable). It increases monotonically with time
15
for all sentence types, although the rates of increase are highest for probationers. This
group exhibits the highest recidivism rates after sentence with around 20% of offenders
committing a new felony within 5 years of sentence. This is in contrast to prisoners,
for whom the recidivism rate is around 8% in the same time frame, reflecting in part
incapacitation in prison, as the median prison minimum sentence length is 18 months.
Individuals sentenced to jail and jail with probation exhibit recidivism rates similar to
those of probationers.
The incidence of recidivism is also low within 1 year after release for all groups. How-
ever, in contrast to the after sentence statistics, the increase in the rate is similar in all
cases, reaching 12% and 20% on average within 3 and 5 years after release, respectively.
The fact that after sentence but not after release the Lower recidivism rates for recidi-
vism measured after sentence than after release for individuals in prison, jail, and jail
with probation reflects the incapacitation effect of incarceration in prison and jail. The
differences are smaller for sentences involving jail because jail stays are relatively short.
The employment statistics are shown in Panel B of Table 2. Four calendar quarters
after sentence offenders sentenced to prison seem to be at a large disadvantage in terms
of the probability of being employed than individuals in other sentence types. Presum-
ably, this is also a consequence of the incapacitation effect. It does not show up in other
incarceration sentences such as jail and jail with probation because incarceration in jail
is typically fairly short. Further out in time, at 12 and 20 quarters after sentence, the
employment rate oscillates around 20% for individuals in all sentence types, although
those in jail with probation seem to have slightly better employment outcomes.
After release, it is interesting that employment rates start out higher in the fourth
quarter after release but then decline substantially in the 12th and 20th quarters. In the
case of prisoners whose decline from the 4th quarter to the 12th and 20th quarters is the
largest, almost 30% are employed soon after release but only around 17% have a job in
the 20th quarter after release. A post-prison peak in employment has been observed in
other studies using UI records (see Bushway et al., 2007).
16
Table 2: Descriptive statistics of outcomes of interest
Panel A. Any new felony within:
After sentence After release
1 year 3 years 5 years 1 year 3 years 5 years
Prison 0.003 0.116 0.247 0.063 0.237 0.350
Probation 0.075 0.233 0.324 0.075 0.233 0.324
Jail 0.031 0.207 0.301 0.073 0.234 0.317
Jail with probation 0.036 0.189 0.282 0.065 0.210 0.296
Panel B. Any employment in:
After sentence After release
4th quarter 12th quarter 20th quarter 4th quarter 12th quarter 20th quarter
Prison 0.103 0.200 0.166 0.281 0.190 0.169
Probation 0.324 0.256 0.208 0.324 0.256 0.208
Jail 0.203 0.175 0.153 0.185 0.173 0.140
Jail with probation 0.357 0.290 0.237 0.342 0.277 0.227
Panel C. Fraction released
After sentence
1 year 3 years 5 years
Prison 0.304 0.806 0.930
Probation 1 1 1
Jail 1 1 1
Jail with probation 1 1 1
Notes: All figures correspond to observations within 16 points from the cutoff. Sample sizes are between 21,300 and 21,700
depending on the variable.
17
5 Empirical Strategy
Our analysis exploits the exogenous change in the probability of being sentenced to prison
arising from the marginal increase in PRV scores that moves an offender from an interme-
diate cell (where the presumptive sentence is something other than prison) to a straddle
cell (where recommended sentence types include prison). In other words, offenders with
similar PRV scores face different probabilities of going to prison depending on whether
their PRV score lies to the left or right of a cut point that determines the boundary
between an intermediate and a straddle cell. This setting naturally leads to a fuzzy
regression discontinuity (RD) design because the increase in the probability of going to
prison from crossing the boundary between an intermediate and a straddle cell is less
than 1.
One complexity in our application of fuzzy RD is that the Michigan sentencing guide-
lines contain many potential discontinuities. Depending on the grid and OV level where
an offender is placed, the offender will be affected by a different cutoff. For example,
in the grid shown in the appendix, an offender in OV level I would be affected by the
PRV score cutoff of 50 points while the relevant cutoff for an offender in OV level II is 25
points. Therefore, only one cutoff is relevant for each individual. This differs from the
setup in other RD designs with multiple discontinuities in which the same individual is
affected by all the cutoffs (see for example van der Klaauw, 2002).
A second complexity is that our treatment, sentence type, is not dichotomous but
rather has four categories. The sentence types contemplated in the Michigan sentencing
guidelines can be many as discussed in section 3. In particular, the vast majority of of-
fenders in our sample are sentenced to prison, jail, jail with probation, or probation. We
do not consider other possible sentences such as fines or community service because we
believe individuals who receive such sentences have zero probability of receiving a prison
sentence and are therefore not appropriate comparisons for those sentenced to prison.
Our main theoretical question relates to the prison vs. probation comparison but since
other sentence types are possible in intermediate and straddle cells, we must incorporate
18
them in the analysis.
A common strategy to analyze the data from multiple cutoffs in a regression disconti-
nuity setup is to normalize all cutoffs to zero to run a pooled regression on all observations
to obtain the treatment effect based on a single-cutoff estimator. Cattaneo et al. (2015)
show that normalization of the cutoffs estimates an average of local average treatment
effects that is weighted by the relative density of observations around each cutoff. Under
this framework and given the fuzzy nature of our RD design, we are only able to identify
one treatment effect using the single cutoff as an instrumental variable. Since we have
four sentence types, we construct indicator variables for which the reference category is
probation. The sentence type variables are defined as prison vs. everything else, jail vs.
everything else, and jail with probation vs. everything else. When including all three
sentence type indicators in the regression, the indicator variables give the difference in
the outcome between prison and probation, jail and probation, and jail with probation
and probation, respectively. However, our treatment of interest is sentence to prison as
compared to sentence to probation, so we include the indicators for jail and jail with
probation as controls in the regression and do not give a causal interpretation to their
coefficients. As we will show below, the discontinuities at the borders between interme-
diate and straddle cells only produce meaningful variation in treatment assignment for
the prison vs. probation comparison.
The fuzzy RD can be described by the following two-equation system:
yi = β0 + τDi + β1PRVi + β2PRVi ·Di +Xγ + εi (1)
Di = α0 + ηTi + α1PRVi + α2PRVi · Ti +Xθ + νi (2)
Where (2) is the first stage equation relating the treatment dummy (Di) with an
indicator for crossing the cutoff (Ti), and (1) is the outcome equation relating the out-
come yi with the treatment dummy. The matrix X includes the treatment dummies
that are not being instrumented as well as controls for age. The outcomes we consider
19
are recidivism and employment over different time frames as described in the data section.
As shown by Angrist and Pishke (2009), the fuzzy RD estimation is mathematically
equivalent to instrumental variables (IV) estimation. In this sense, the indicator for
crossing the cutoff can be used as an instrument for the treatment dummy, and the two
equations above can be estimated by 2SLS. This method provides the causal effect of
the treatment on the outcomes of interest for those who are affected by the instrument
(crossing the cutoff) provided that the instrument only affects the outcome through its
effect on the probability of going to prison (the exclusion restriction), and that crossing
the cutoff makes offenders more likely to go to prison (monotonicity). One possible threat
to the exclusion restriction is that crossing the cutoff could affect the outcome through
changes in sentence lengths as well as sentence types. However, in this setting crossing
the cutoff does not produce variation in either prison length for those sentenced to prison
on both sides of the cutoff or probation sentence length for those sentenced to probation
on both sides of the cutoff.
In the results shown later, we estimate the models with a linear functional form of the
running variable allowing for a different slope on either side of the cutoff. The running
variable (PRV scores) is centered to zero so that equation (1) estimates the treatment
effect at the cutoff. Consistent with current best practice in RD designs, we use local
linear regressions (LLR) around each cutoff are only using observations that are within
a narrow bandwidth from the cutoff.
We also conduct a reduced-form analysis that estimates the effects of crossing the
cutoffs on the oucomes. From the equations above, the reduced-form equation is obtained
by substituting equation (2) on equation (1) as follows:
yi = β0 + τRTi + β1PRVi + β2PRVi ·Di +Xγ + ϵi (3)
In this case, the coefficient τR is the intent-to-treat effect, that is, the effect of being
eligible for a prison sentence (by crossing the boundary between an intermediate and a
20
straddle cell). We perform reduced-form analyses for the 11 cutoffs separately.
To account for potential heterogeneity in the first stage across multiple cutoffs in
our data, we reformulate the IV setup so that we can use the 11 cutoffs as 11 separate
instrumental variables. This relaxes an assumption implicit in an analysis that pools all
cutoffs together, that the effect of crossing each cutoff on treatment assignment is the
same. Such an assumption may be unreasonable here because the various cutoffs are for
different crime classes (severities), for offenders with different criminal histories, and for
offenses with different characteristics. The two-equation system for a setup that treats
each cutoff as a separate instrument is:
yic = β0 + τDi + β1PRVi + β2PRVi ·Di + δcCi + πcPRVi · Ci +Xγ + εi (4)
Di = α0 + λcTic + α1PRVi + α2PRVi · Tic + ιcCi + θcPRVi · Ci +Xµ+ νi (5)
Where yic is the outcome for individual i in cutoff c, Di is the treatment of interest,
Tic indicates whether the PRV score is to the right of cutoff c, and Ci is an indicator
for whether the individual’s relevant cutoff is cutoff c. We control for a full set of cutoff
dummies as well as their interactions with the running variable . Again, we center
the running variable at zero for each cutoff, and include controls for the non-relevant
treatments and age. Similar to the pooled single-cutoff model, we estimate the model by
2SLS.
6 Results
This section presents graphic and econometric evidence of the magnitude of the rela-
tionship between the probability of receiving a prison sentence and crossing the cutoff,
the so-called first stage. We also show reduced-form results for the recidivism and em-
ployment outcomes. We conclude by showing instrumental-variable regressions based on
pooling all cutoffs together and using the 11 cutoffs as multiple IVs.
21
All estimates in this section are obtained from local linear regressions (LLR) within
a bandwidth of 16 points from the cutoff. The procedure to arrive at this bandwidth
is based on the application of randomization inference to RD proposed by Cattaneo et
al. (2015a). This procedure emerges from the assumption that within a narrow win-
dow around the cutoff, the RD design is analogous to a randomized experiment. Even
though the estimation can be performed relying on parametric extrapolation from flexible
polynomials that use all the observations, we avoid this approach because it produces
biased point-estimates and standard errors if done incorrectly.14 What the randomization
inference procedure proposes is to find the smallest window in which local randomiza-
tion is an appropriate assumption. It consists of running a series of balance tests of
the baseline covariates for each candidate window, and choosing the smallest window in
which covariate balance is not rejected. We implement this test for each of the 11 cut-
offs by running seemingly unrelated regressions (SUR) in which we test the hypothesis
that the covariates in Table 1 are jointly balanced in a given window. We found that a
window of 16 is the smallest window in which the covariates are balanced for all cutoffs.15
Another substantive reason to implement the test derived from randomization infer-
ence is that the nature of the PRV scores (running variable) does not allow to perform
McCrary type tests (McCrary, 2008). As discussed in the sentencing guidelines section,
the PRV scores are constructed from 7 different prior record variables. These variables
are coded in multiples of 5 and 2, so it is impossible to get certain values of the score.
14As we explored the possibility of a strategy involving polynomial functional forms for the running variableand all observations, we conducted AIC tests to obtain the optimal polynomial order for each outcome /cutoff combination when using the whole range of observations in Grids D, E and F. The optimal ordersvaried considerably across cutoffs, so to avoid imposing the same functional form for all cutoffs, we opted forthe randomization inference approach in which the linear approximation is valid for all outcomes / cutoffsonce we restrict the observations to a narrow window around each cutoff.
15For two of the cutoffs (Grid E, OV II and Grid F, OV II) we found that there is no balance of thecovariates in any of the candidate windows. However, once we control for a quadratic on age at sentencein the SUR tests, the covariates are balanced within the bandwidth of 16 points. Hence, we control for aquadratic on age in all estimations. This is appropriate because age is mechanically correlated with therunning variable, a composite of the offender’s prior record. Older offenders will on average have higherprior record scores since they have had more time to commit and be apprehended for crimes. As a result,we understand the RD design to be valid in this setting only once we have conditioned on age.
22
In this sense, the running variable is not only not continuous but very rugged, which
makes the McCrary test non-informative, as the test will appear to detect evidence of
manipulation where there are merely mathematically impossible values of the running
variable. Hence, we rely on balance tests of the covariates to establish the validity of our
RD design.
6.1 First stages
Figure 1 shows the basic relationship between the probability of going to prison and the
dummy for crossing the cutoff when all cutoffs are pooled into a single one and centered
at zero. The y axis shows the probability of going to prison relative to the other three
types of sentences, and the x axis shows the PRV scores. Scores greater or equal to zero
indicate that the individual’s PRV score is at or to the right of the cutoff relevant for that
individual. In the graph, each dot represents the average probability of going to prison
for each value of the PRV scores, and the lines are the fitted values from a regression
of the prison indicator on a dummy for crossing the cutoff, the running variable and an
interaction between the two.16 There is a clear discontinuity in the probability of being
sentenced to prison for individuals with PRV scores at and to the right of the cutoff,
i.e. offenders whose PRV score places them in a straddle cell. Visually, the jump in the
probability of prison at the cutoff is around 10 percentage points.
The first stage graph in Figure 1 illustrates the raw discontinuity, that is, without
controlling for age and cutoff fixed effects. Table 3 presents the first stage regressions
for the pooled sample and each cutoff individually. In the pooled case, the regression
controls for a quadratic on age, cutoff fixed effects and their interactions with the running
variable. Crossing the cutoff increases the probability of receiving a prison sentence by
9.3 percentage points, very similar to what we observe in Figure 1.
We estimate the first stage for each of the cutoffs in Grids D, E and F. The magnitude
16Note that although all dots are drawn the same size, they may represent vary different numbers ofobservations. This is the case with all graphs we present in this paper.
23
0.1
.2.3
.4P
r (P
rison
)
−20 −10 0 10 20PRV scores centered at zero
First stagePrison vs. all other sentences
Figure 1: First stage - pooled cutoffs
of the first stage oscillates between 4.85 percentage points in Grid E, OV I to 39.9
percentage points in Grid F, OV IV. Notice the variation in the size of the jump across
cutoffs, as well as the variation in the sample sizes. The small sample sizes in each cutoff
suggest that estimating separate models for each cutoff would lead to underpowered
estimates. The cutoffs that seem to provide stronger first stages in terms of the size of
the jump and the statistical significance are those in Grid D. This makes sense based
on the classification of the crime classes provided by the Michigan sentencing guidelines;
high severity crimes are in grids letters closer to A. Offenders in Grid D have committed
more serious crimes, and are therefore more likely to be sentenced to prison if their scores
place them in a straddle cell. Although not smaller in magnitude than other first stages,
two of the cutoffs (Grid E, OV IV and Grid F, OV I) are not statistically significant.
This is probably due to lack of power because other cutoffs with similar sample sizes have
a significant first stage when the point estimate is much higher than those in these two
cutoffs, and because other cutoffs with similar first stage coefficients have a significant
first stage when the sample size is larger. The first stage graphs for each of the 11 cutoffs
24
Table 3: First stages
1{PRVi ≥ cutoff} S.E. Observations R-squared
All cutoffs pooled 0.0931*** 0.0091 21,737 0.13
Grid DOV I 0.1894*** 0.0545 1,034 0.13
OV II 0.1186*** 0.0354 1,597 0.14
OV III 0.2313*** 0.0517 586 0.10
Grid EOV I 0.0485*** 0.0130 6,000 0.10
OV II 0.1166*** 0.0151 6,631 0.139
OV III 0.1386*** 0.0415 1,192 0.16
OV IV 0.0640 0.0423 1,017 0.12
Grid FOV I 0.0680 0.0541 730 0.05
OV II 0.0811*** 0.0225 2,132 0.08
OV III 0.1418** 0.0566 607 0.11
OV IV 0.3985*** 0.0921 211 0.10
Notes: Robust standard errors. All models regress the prison dummy on a dummy for
crossing the cutoff, the PRV scores, the interaction between the two, cutoff fixed effects,
their interaction with the PRV scores and a quadratic on age at sentence.
*** p
than the cutoff, a linear term on the running variable, an interaction between the cutoff
dummy and the PRV scores that allows for a different slope on either side of the cutoff,
and a quadratic on age at sentence. In the case of the pooled regressions, cutoff fixed
effects and their interactions with the running variable are also included. All regressions
include observations within a 16-point window from the respective cutoff.
Table 4 presents the results from the recidivism reduced form regressions for all cut-
offs pooled together and for each individual cutoff. Six outcome variables are examined:
binary indicators for whether the offender recidivates within 1 year, 3 years, and 5 years
after sentence and after release. The after-sentence estimates capture in part any incapac-
itation effect that emerges from the fact that a significant portion of offenders sentenced
to prison are still incarcerated in the time frames at which the outcome variables are
measured. Recall from Table 2 that, in this sample, only 30% of prisoners have been
released to the community 1 year after sentence. The first row of the table shows that
compared to being in an intermediate cell, being to the right of the cutoff translates into a
probability of committing a new felony that is 1.24 percentage points lower. Three years
after sentence the probability is lower by 3.27 percentage points, and 5 years later, it is
lower by 2.52 percentage points. All these figures are statistically significant, although
the last one is only significant at the 10% level. After release, the picture is substantially
different. None of the effects are significant at the 5% level, meaning that after being
released to the community, the likelihood of committing a new crime is basically the
same for an individual who was sentenced to prison than for somebody with a different
sentence type. This is consistent with what was shown in Table 2, i.e., all sentence types
had very similar recidivism rates after release.
Table 4 also shows a great deal of heterogeneity across cutoffs. While most of the
signs of the point estimates are negative, some are positive, suggesting that crossing the
cutoff increases the probability of committing a new crime. The majority of positive
signs are found in Grids E and F in which the severity of the crime for which individu-
als were originally sentenced is lower than in grid D. It is worth noticing that at times
the coefficients are sizeable in magnitude but the small number of observations increases
26
Table 4: Reduced forms by cutoff (recidivism)
Any new felony within:
After sentence After release1 year 3 years 5 years 1 year 3 years 5 years
All cutoffs -0.0124** -0.0327*** -0.0252* -0.0063 -0.0196* -0.0172pooled (0.0059) (0.0113) (0.0129) (0.0073) (0.0118) (0.0132)
Grid D
OV I -0.0137 -0.0939* -0.0319 0.0225 -0.0667 -0.0295(0.0297) (0.0549) (0.0635) (0.0393) (0.0600) (0.0649)
OV II -0.0329 -0.0458 -0.0045 -0.0208 -0.0320 -0.0031(0.0221) (0.0434) (0.0485) (0.0283) (0.0450) (0.0495)
OV III -0.0106 -0.1432* -0.1628* -0.0137 -0.1818** -0.1452(0.0355) (0.0777) (0.0939) (0.0477) (0.0844) (0.0981)
Grid E
OV I -0.0284** -0.0538** -0.0679** -0.0162 -0.0488** -0.0704***(0.0142) (0.0243) (0.0265) (0.0161) (0.0248) (0.0267)
OV II -0.0090 -0.0417** -0.0202 -0.0109 -0.0209 -0.0063(0.0092) (0.0191) (0.0224) (0.0119) (0.0202) (0.0230)
OV III 0.0263 0.0492 0.0409 0.0658** 0.0709 0.0892*(0.0225) (0.0455) (0.0518) (0.0308) (0.0481) (0.0537)
OV IV 0.0134 -0.0338 -0.0644 -0.0178 -0.0338 -0.0643(0.0191) (0.0546) (0.0656) (0.0313) (0.0585) (0.0692)
Grid F
OV I 0.0107 0.0547 0.0138 -0.0381 0.0533 0.0124(0.0312) (0.0646) (0.0744) (0.0430) (0.0695) (0.0775)
OV II -0.0074 0.0189 0.0386 -0.0021 0.0292 0.0403(0.0166) (0.0332) (0.0388) (0.0199) (0.0348) (0.0399)
OV III 0.0220 -0.0658 -0.0592 0.0140 -0.0629 -0.0620(0.0274) (0.0590) (0.0691) (0.0385) (0.0642) (0.0738)
OV IV -0.0529 0.0621 -0.1820 -0.0081 0.1073 -0.1162(0.0520) (0.1305) (0.1479) (0.0796) (0.1388) (0.1575)
Notes: Robust standard errors in parentheses. Results are from regressions of each outcome on a
dummy for crossing the cutoff, a linear term on the PRV scores and the interaction between
the two. A quadratic on age at sentence is also included.
*** p
0.0
5.1
.15
−20 −10 0 10 20PRV scores centered at zero
Reduced formNew offense within 1 year after sentence
(a)
.06
.08
.1.1
2.1
4.1
6
−20 −10 0 10 20PRV scores centered at zero
Reduced formNew offense within 1 year after release
(b)
.15
.2.2
5.3
.35
−20 −10 0 10 20PRV scores centered at zero
Reduced formNew offense within 3 years after sentence
(c)
.1.2
.3.4
.5
−20 −10 0 10 20PRV scores centered at zero
Reduced formNew offense within 3 years after release
(d)
.2.3
.4.5
.6
−20 −10 0 10 20PRV scores centered at zero
Reduced formNew offense within 5 years after sentence
(e)
.2.3
.4.5
.6
−20 −10 0 10 20PRV scores centered at zero
Reduced formNew offense within 5 years after release
(f)
Figure 2: Recidivism reduced forms (all cutoffs pooled)
the uncertainty with which they are estimated. Grid E is perhaps an exception to the
small sample problem. In fact, in this grid we find most of the significant estimates. In
28
particular, for OV level I we find that being in a straddle cell reduces the probability of
committing a new crime 1, 3, and 5 years after sentence, and 3 and 5 years after release.
The magnitudes of these reductions are important: 3 and 5 years after sentence, the
probability of recidivating is about 5 and 7 percentage points lower, respectively, after
sentence and after release. In this case, it seems that more than having an effect through
the incapacitation effect, being in a straddle cell is effectively reducing the probability
of committing new crimes once the offenders are released to the community. This is of
course not generally the case. For example, in OV II in the same grid we find the op-
posite effect (that individuals in a straddle cell are more likely to recidivate) both after
sentence and after release, and these estimates are statistically significant 1 and 5 years
after release.
Regarding employment, Table 5 shows the reduced form coefficients in the same
format discussed above. The employment outcomes are measured by binary variables
indicating whether the individual is observed receiving formal sector wages in a given
period (4th, 12th, and 20th quarters) after sentence and after release. Pooling all cut-
offs together, the coefficients across all outcomes are very close to zero and insignificant.
This suggests that employment outcomes are similar for individuals sentenced to any of
the sentence types. Recall also that the employment rates for this sample are very low,
almost never surpassing 35% in any of the sentence groups or time frames (see Table 2).
Examining individual cutoffs separately supports our conclusions from the pooled
case. Most of the regressions give very small coefficients that are statistically insignifi-
cant. As with recidivism, there seems to be heterogeneity, as the signs of the coefficients
are not the same across the table. The few coefficients that are significant are very large
in magnitude. For example, in the 20th quarter after release in Grid D, OV III, crossing
the cutoff is associated with a probability of having a job that is 21.3 percentage points
larger than being to the left of the cutoff. Similarly, for the same quarter after release,
this probability is 15 percentage points larger in Grid F, OV I.
Graphical presentations of the reduced forms are provided in Figures 2 and 3. As
29
Table 5: Reduced forms by cutoff (employment)
Any employment in:
After sentence After release4th quarter 12th quarter 20th quarter 4th quarter 12th quarter 20th quarter
All cutoffs -0.0175 0.0078 0.0039 -0.0071 -0.0023 0.0039pooled (0.0133) (0.0127) (0.0119) (0.0135) (0.0127) (0.0121)
Grid D
OV I -0.0149 0.0081 -0.0527 -0.0707 -0.0528 0.0260(0.0506) (0.0518) (0.0460) (0.0549) (0.0507) (0.0466)
OV II 0.0319 0.0404 0.0123 0.0650 0.0790* -0.0068(0.0471) (0.0452) (0.0422) (0.0481) (0.0451) (0.0437)
OV III 0.0554 0.1265 0.0711 0.0895 0.0726 0.2128**(0.1142) (0.1135) (0.1084) (0.1165) (0.1108) (0.1080)
Grid E
OV I 0.0229 -0.0053 -0.0237 -0.0035 -0.0103 -0.0087(0.0251) (0.0235) (0.0220) (0.0252) (0.0235) (0.0222)
OV II -0.0329 0.0053 0.0245 -0.0037 0.0014 0.0209(0.0246) (0.0238) (0.0225) (0.0250) (0.0237) (0.0227)
OV III -0.0880 -0.0352 0.0225 -0.0868 -0.0581 -0.0092(0.0556) (0.0535) (0.0502) (0.0563) (0.0527) (0.0511)
OV IV -0.1036 -0.0329 -0.0756 -0.0730 0.0303 0.0073(0.0856) (0.0836) (0.0790) (0.0868) (0.0830) (0.0820)
Grid F
OV I 0.0114 0.0833 0.0840 0.1048 0.0886 0.1499**(0.0663) (0.0637) (0.0587) (0.0696) (0.0633) (0.0581)
OV II -0.0571 0.0465 0.0297 -0.0332 -0.0049 0.0267(0.0428) (0.0395) (0.0366) (0.0429) (0.0398) (0.0371)
OV III -0.0717 0.0010 -0.0382 0.0131 -0.0924 -0.0730(0.0780) (0.0727) (0.0674) (0.0798) (0.0710) (0.0727)
OV IV -0.3509 -0.1211 0.3496* -0.2665 -0.1171 0.3133(0.2140) (0.2019) (0.1931) (0.2205) (0.2149) (0.1897)
Notes: Robust standard errors in parentheses. Results are from regressions of each outcome on a dummy for
crossing the cutoff, a linear term on the PRV scores and the interaction between the two. A quadratic on age
at sentence is also included.
*** p
.15
.2.2
5.3
.35
.4
−20 −10 0 10 20PRV scores centered at zero
Reduced formAny employment in 4th quarter after sentence
(a)
.2.2
5.3
.35
.4
−20 −10 0 10 20PRV scores centered at zero
Reduced formAny employment in 4th quarter after release
(b)
.05
.1.1
5.2
.25
.3
−20 −10 0 10 20PRV scores centered at zero
Reduced formAny employment in 12th quarter after sentence
(c)
.05
.1.1
5.2
.25
.3
−20 −10 0 10 20PRV scores centered at zero
Reduced formAny employment in 12th quarter after release
(d)
.05
.1.1
5.2
.25
−20 −10 0 10 20PRV scores centered at zero
Reduced formAny employment in 20th quarter after sentence
(e)
.05
.1.1
5.2
.25
−20 −10 0 10 20PRV scores centered at zero
Reduced formAny employment in 20th quarter after release
(f)
Figure 3: Employment reduced forms (all cutoffs pooled)
before, the graphs show the average of the outcome variable for each value of the running
variable, and the lines show the predicted values from regressions as described above.
31
Due to space constraints, we only present graphs based on the pooled sample. These
graphs are consistent with the reduced form estimates for the pooled models presented
in Table 4 and Table 5.
6.3 OLS and 2SLS estimates
Our last set of results exploits the fact that our RD is a fuzzy design because the prob-
ability of going to prison does not increase from zero to one when crossing the cutoff.
As discussed in section 5, a fuzzy RD design is basically an instrumental-variable design
in which the first stage is given by the change in the probability of receiving the treat-
ment when crossing the cutoff. In this sense, 2SLS is an appropriate estimation method
to obtain a consistent estimate of the treatment effect for observations near the cutoff
and for whom crossing the cutoff changes the treatment assigned. As explained in the
beginning of this section, we implement local linear regressions within a bandwidth of
16 points around the cutoff. This is the widest bandwidth for which we found that the
covariates were balanced across all 11 cutoffs. As we have done throughout this paper, in
the results that follow we differentiate between the effects after sentence and after release.
Tables 6 and 7 present results for the recidivism and employment outcomes, respec-
tively, for the pooled sample only.17 We show only the coefficient for the treatment of
interest, i.e. the indicator for whether the individual was sentenced to prison as compared
to probation. Each table contains three panels indicating the estimator that was used to
obtain the coefficient. In Panel A, the nave OLS regression coefficient is reported. This is
a regression of the outcome on three indicators for sentence type (excluding probation),
a linear term on the PRV scores, interactions of the sentence type indicators and the
PRV scores, interaction of the cutoff dummy with the PRV scores, a quadratic on age
at sentence, and cutoff fixed effects and their interactions with the PRV scores. Since
the omitted treatment dummy is probation, the coefficient on the dummy for prison is
measuring the difference, at the cutoff, in the outcome of interest between an individual
17As we saw previously, most coefficients in the reduced form regressions by cutoff were not significant.Hence, presenting 2SLS regressions by cutoff would not be reasonable since the standard errors in the 2SLSregressions will be higher than those in the reduced form regressions.
32
sentenced to prison relative to one sentenced to probation.
The OLS coefficient is naive in the sense that it does not instrument for the treat-
ment using the change in probability of treatment attributable to crossing the cutoff.
In other words, it does not take into account the fact that the probability of going to
prison does not automatically jump to one to the right of the cutoff. For this reason, we
implement two IV approaches. The first approach is based on pooling all cutoffs together
to create a single cutoff that will serve as a single instrumental variable for the prison
treatment indicator. The IV is simply a dummy equal to one if the PRV score is to the
right of the cutoff. Under this approach, we have one endogenous variable that we want
to instrument and one IV, so the model is exactly identified. In the second approach
we make use of the multiple instruments that the 11 cutoffs provide. As before, we can
construct the same type of instrumental variables by generating 11 dummy variables for
whether the PRV is greater than or equal to each individual cutoff value. In this case,
we will use 11 IVs to instrument the prison dummy, so the model is overidentified. We
expect that sampling variability will be lower under this approach because the first stages
more accurately capture the change in treatment probability that comes from crossing
each cutoff (rather than assuming that change is the same for all cutoffs as the pooling
approach does), leading to a (slightly) stronger first stage. In 2SLS, a stronger first stage
leads to smaller SEs for the treatment effect coefficient(s) (Wooldridge, 2002). In each
of the two approaches we include an interaction between the treatment dummy and the
running variable, which is instrumented by interactions between the IV and the PRV
scores in the first case, and by interactions between each of the IVs and the PRV scores
in the second.
The results for the recidivism outcomes are presented in Table 6. The OLS estimates
show that the probability of committing a new crime is reduced by 7.2, 12.6, and 9.8
percentage points 1, 3, and 5 years after sentence, respectively, when the individual is
sentenced to prison as compared to probation. Similarly, the probability of recidivating
is 1.68 points lower 1 year after release. For longer periods after release, there does not
seem to be any difference between individuals sentenced to prison rather than probation.
33
Table 6: LLR results (recidivism)
Any new felony within:
After sentence After release1 year 3 years 5 years 1 year 3 years 5 years
Panel A. OLS
1{Sentence = prison} -0.0723*** -0.1256*** -0.0979*** -0.0168** -0.0138 -0.0096(0.0046) (0.0101) (0.0123) (0.0071) (0.0120) (0.0137)
Observations 21,737 21,737 21,737 21,641 21,594 21,399
Panel B. 2SLS - All cutoffs pooled
1{Sentence = prison} -0.1549** -0.3639*** -0.2908** -0.0742 -0.2252 -0.2213(0.0670) (0.1309) (0.1481) (0.0853) (0.1434) (0.1673)
Observations 21,737 21,737 21,737 21,641 21,594 21,399
Panel C. 2SLS - Multiple cutoffs
1{Sentence = prison} -0.1391*** -0.2461** -0.1782 -0.0473 -0.0765 -0.0554(0.0539) (0.1056) (0.1206) (0.0702) (0.1173) (0.1390)
Observations 21,737 21,737 21,737 21,641 21,594 21,399
Notes: Robust standard errors in parentheses. OLS results are from regressions of each outcome on three treatment dummies
which indicate the sentence type: prison, jail, and jail with probation. The reference category is probation. Regressions include
a linear term on the PRV scores and the interaction between the treatment dummies and the PRV scores. In Panel B, 2SLS
regressions run the same OLS regression instrumenting the dummy for prison with a dummy indicating whether the PRV score
is to the right of the pooled cutoff. In Panel C, the dummy for prison is instrumented with 11 dummies indicating whether the
PRV score is to the right of each of 11 cutoffs. All models include cutoff fixed effects and their interactions with the PRV scores,
as well as a quadratic on age at sentence.
*** p
this is primarily capturing an incapacitation effect. After release, none of the coefficients
are statistically significant, although the coefficients are large in magnitude. With more
than 21,000 observations, the large standard errors are attributable to the fact that the
standard errors in IV estimation tend to be large (Wooldridge, 2002).
The last panel of Table 6 shows the coefficients obtained from the IV approach with
multiple instruments. The magnitudes are fairly consistent with those found in the IV
approach with only one instrument. For example, being sentenced to prison reduces the
probability of committing a new crime by 13.9, 24.6, and 17.8 percentage points when
measured 1, 3, and 5 years after sentence. Even though the standard errors are smaller
under this approach, we find that the effect 5 years after sentence is not statistically
significant. Moreover, the reduction in standard errors is not sufficient to make the after-
release coefficients statistically significant, although the magnitude of these estimates is
more consistent with a no post-release effects than those from the pooled cutoffs single
IV approach, for which the coefficients were very large in magnitude but not significant.
In summary, we see a large incapacitation effect arising from the fact that a signif-
icant fraction of offenders sentenced to prison are still in prison when measuring these
outcomes while individuals in other sentences have already been released or have been
released for longer periods of time. The effects shrink considerably once we compare pris-
oners and probationers after release, and the estimated effects are no longer statistically
significant. We note, however, that the point estimates are still potentially substantively
large in their potential implications, as they represent changes in recidivism rates of
approximately 5-8 percentage points. Even in the models with multiple cutoffs treated
as multiple instruments, our models are underpowered to detect effects of such magnitude.
The employment results presented in Table 7 are consistent with the findings from the
reduced-form regressions. First, the OLS estimates are statistically significant for 5 out
of 6 outcome variables. In the fourth quarter after sentence, prisoners are 19.5 percentage
points less likely to have a job than a probationer. In the 12th and 20th quarters, the
35
negative coefficients are smaller (3.2 and 2.1 percentage points, respectively).18 After
release, there is no difference between a prisoner and a probationer in the fourth quarter,
but the coefficients in the 12th and 20th quarters are almost identical to those in the
corresponding after-sentence quarters.
Table 7: LLR results (employment)
Any employment in:
After sentence After release4th quarter 12th quarter 20th quarter 4th quarter 12th quarter 20th quarter
Panel A. OLS
1{Sentence = prison} -0.1947*** -0.0324*** -0.0213** -0.0172 -0.0394*** -0.0252**(0.0103) (0.0115) (0.0108) (0.0130) (0.0118) (0.0119)
Observations 21,504 21,504 21,504 21,387 21,278 20,612
Panel B. 2SLS - All cutoffs pooled
1{Sentence = prison} -0.1821 0.0921 0.0527 -0.1171 -0.0343 0.1842(0.1501) (0.1443) (0.1351) (0.1594) (0.1515) (0.1912)
Observations 21,504 21,504 21,504 21,387 21,278 20,612
Panel C. 2SLS - Multiple cutoffs
1{Sentence = prison} -0.2326* -0.0123 0.0259 -0.2231* -0.0436 0.0968(0.1210) (0.1160) (0.1088) (0.1310) (0.1249) (0.1452)
Observations 21,504 21,504 21,504 21,387 21,278 20,612
Notes: Robust standard errors in parentheses. OLS results are from regressions of each outcome on three treatment dummies which indicate
the sentence type: prison, jail, and jail with probation. The reference category is probation. Regressions include a linear term on the PRV
scores and the interaction between the treatment dummies and the PRV scores. In Panel B, 2SLS regressions run the same OLS regression
instrumenting the dummy for prison with a dummy indicating whether the PRV score is to the right of the pooled cutoff. In Panel C, the
dummy for prison is instrumented with 11 dummies indicating whether the PRV score is to the right of each of 11 cutoffs. All models
include cutoff fixed effects and their interactions with the PRV scores, as well as a quadratic on age at sentence.
*** p
differences among sentence types were not important as we found in our analyses above.
The only two coefficients that are significant at the 10% level are those for the 4th quarter
after sentence and after release. According to these, a prisoner is about 23 percentage
points less likely to be working in a formal job in the 4th quarter after receiving the
sentence or after being released from prison. Some of the coefficients switch sign but we
cannot tell whether it is because prisoners are really more likely to be working (some
of the point estimates are sizeable in magnitude) or because of the imprecision of our
estimates. We do note, however, that in both 2SLS approaches, the point estimates for
employment effects trend to more positive effects of prison on employment over time,
starting most negative in the 4th quarter and moving to most positive in the 20th quar-
ter, suggesting yet another axis along which effects of incarceration in prison may be
heterogeneous.
All in all, the employment results are not supportive of effects of incarceration in
prison vs. probation on employment outcomes. These conclusions must be tempered
by the fact that our standard errors are too large to give very precise estimates, and
as a result our results are not inconsistent with incarceration effects. However, the
2SLS results are consistent with the descriptive statistics and reduced-form regressions
discussed above.
7 Conclusion
Whether and how being incarcerated in prison affects one’s future behavior are questions
that criminologists have asked for decades. This paper leverages discontinuities in the
probability of being sentenced to prison arising from the structure of the Michigan Sen-
tencing Guidelines to provide new estimates of the total effects of incarceration on future
recidivism and employment outcomes. The sentencing guidelines provide a framework in
which low-level offenders (classified in Grids D, E, and F, i.e. low-severity crime classes)
may receive a prison sentence if their prior record scores (PRV) are at or above a certain
cutoff determined by the specific grid and the offense severity level. This setup leads to a
fuzzy RD design in which the running variable is the PRV scores. To estimate the causal
37
effect of receiving a prison sentence relative to probation, we perform reduced-form and
instrumental-variable analyses based on the logic of instrumental variables applied to the
fuzzy RD setting. We conduct two types of IV analyses: one in which we pool all cutoffs
together to construct a single binary variable indicating whether the individual’s PRV
score is at or to the right of the pooled cutoff. This indicator is used as an instrument
for the dummy indicating the receipt of a prison sentence. In the second IV approach,
we construct 11 IVs in the same way, one for each of the 11 cutoffs present in Grids D,
E, and F of the Michigan Sentencing Guidelines.
From our analytical sample of around 21,000 observations within a bandwidth around
the cutoff of 16 points of the PRV scores, we are able to identify significant incapacita-
tion effects particularly for recidivism. Being sentenced to prison reduces the likelihood
of committing a new crime by at least 14 percentage points relative to an offender sen-
tenced to probation. This effect is present 1, 3, and 5 years after sentence. After release,
we find null prison effects in all time frames. Hence, our results suggest that, at the
cutoff, the main effect of sentencing offenders to prison operates through incapacitation.
That is, our results indicate that individuals getting a prison sentence behave the same
as individuals receiving probation after release. One interpretation of these two results
taken together is that either criminogenic and crimino-suppressive mechanisms are both
null or that the two types of effects roughly cancel each other out. While we cannot
distinguish between the various mechanisms that might bring about those effects, our
results do suggest the importance of incapacitation effects. If it were to generalize, this
would also be an important conclusion from a policy perspective, as it suggests that
sentencing offenders on the margin between prison and probation to prison only reduces
their average future offending during the time they spend in prison.
Our IV strategy is less successful in identifying the effects of incarceration on em-
ployment, as the estimates are fairly imprecise. Nonetheless, the estimates suggest that
initially there is a strong negative effect on the probability of having a formal job in
the 4th quarter after sentence and in the 4th quarter after release (which is only signif-
icant at the 10% level). Longer term effects are not statistically significant. We find no
38
statistically significant effect in the 12th quarter and then a slight positive but still not
statistically significant effect in the 20th quarter.
One reading of our results is that they suggest that future research should pay closer
attention to sources of heterogeneity in the effect of incarceration in prison, as some
of our results suggest the potential for considerable effect heterogeneity in the effect
of prison. First, although our results differ from those of some prior studies (see for
example, Mueller-Smith, 2014, who finds positive effects of incarceration on recidivism
and negative effects on employment using a different identification strategy), this may
be because we are estimating effects for a different subpopulation of offenders, in our
case only individuals within a certain window of the cutoffs. Second, the reduced-form
analysis presented in this paper provides evidence that crossing the cutoff has significant
effects on the outcomes that vary in sign depending on the cutoff analyzed. This suggests
that individuals convicted of more or less severe crimes may be affected by prison differ-
ently. Finally, we found some suggestive evidence that the effect of prison on employment
starts out negative and seems to turn positive over time, suggesting that there is treat-
ment effect heterogeneity across different time frames for the outcome. Unfortunately,
our research design and setting are poorly positioned to examine effect heterogeneity
directly. Although we use multiple cutoffs as multiple instruments in