90
Migrant Self-Selection: Anthropometric Evidence from the Mass Migration of Italians to the United States, 1907–1925 * Yannay Spitzer [email protected] Brown University Ariell Zimran [email protected] Northwestern University August 8, 2014 Abstract Are migrants positively or negatively self-selected from within their populations of origin? We study this fundamental and persistent question of the economics of migration using data on one of the largest flows of free migration ever—that of Italians to the United States between 1907 and 1925. We exploit never-before-used stature data in the Ellis Island arrival records—from which we transcribed the heights and other personal information of a random sample of 50,000 Italian passengers—combined with Italian province-birth cohort height distributions and our own geo-matching of millions of Italian passengers to their places of origin in order to construct a novel data set for our analysis. Relying on the well-established relationship between population average stature and living standards, we quantify migrant self-selection by comparing the heights of migrants to the height distributions of their respective birth cohorts in their provinces of origin. Our analysis reveals opposite patterns of self-selection across and within Italian provinces. Italian migrants were shorter, on average, than all Italians of the same birth cohort, suggesting negative self-selection on the national level. However, when compared only to the distribution of stature in their own provinces of origin, we find that Italian passengers were, on average, taller, indicating positive self-selection on the local level. Moreover, we find that the degree of self-selection from a province and birth cohort was decreasing in its average stature, suggesting that less-developed province-cohorts, where liquidity constraints to migration were more likely to bind, provided relatively higher quality migrants. The findings of this research demonstrate the importance of distinguishing between self-selection from a country as a whole and self-selection from within a particular sub-national region. Comparisons of migrants to their national-level origins, which are the norm in the literature on migrant self-selection, may fail to capture a significant portion of the self-selection occurring within a group of potential migrants from a particular sub-national region. * The most recent version of this paper can be found at http://aez.econ.northwestern.edu/spitzer_zimran_italian_ stature.pdf. The results in this paper are preliminary and may be affected by ongoing research and data transcription. Please contact the authors before citing or circulating this paper. A previous version of this paper circulated under the title “Self- Selection of Immigrants on the Basis of Living Standards: Evidence from the Stature of Italian Immigrants at Ellis Island, 1907–1925.” 1

MigrantSelf-Selection: AnthropometricEvidencefromtheMass ... › eha › wp-content › uploads › 2014 › 05 › Zimran.pdfeconomy characterized by lower real wages (Hatton and

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

  • Migrant Self-Selection: Anthropometric Evidence from the MassMigration of Italians to the United States, 1907–1925∗

    Yannay [email protected]

    Brown University

    Ariell [email protected]

    Northwestern University

    August 8, 2014

    Abstract

    Are migrants positively or negatively self-selected from within their populations of origin? We studythis fundamental and persistent question of the economics of migration using data on one of the largestflows of free migration ever—that of Italians to the United States between 1907 and 1925. We exploitnever-before-used stature data in the Ellis Island arrival records—from which we transcribed the heightsand other personal information of a random sample of 50,000 Italian passengers—combined with Italianprovince-birth cohort height distributions and our own geo-matching of millions of Italian passengers totheir places of origin in order to construct a novel data set for our analysis. Relying on the well-establishedrelationship between population average stature and living standards, we quantify migrant self-selectionby comparing the heights of migrants to the height distributions of their respective birth cohorts intheir provinces of origin. Our analysis reveals opposite patterns of self-selection across and within Italianprovinces. Italian migrants were shorter, on average, than all Italians of the same birth cohort, suggestingnegative self-selection on the national level. However, when compared only to the distribution of staturein their own provinces of origin, we find that Italian passengers were, on average, taller, indicating positiveself-selection on the local level. Moreover, we find that the degree of self-selection from a province andbirth cohort was decreasing in its average stature, suggesting that less-developed province-cohorts, whereliquidity constraints to migration were more likely to bind, provided relatively higher quality migrants.The findings of this research demonstrate the importance of distinguishing between self-selection froma country as a whole and self-selection from within a particular sub-national region. Comparisons ofmigrants to their national-level origins, which are the norm in the literature on migrant self-selection,may fail to capture a significant portion of the self-selection occurring within a group of potential migrantsfrom a particular sub-national region.

    ∗The most recent version of this paper can be found at http://aez.econ.northwestern.edu/spitzer_zimran_italian_stature.pdf. The results in this paper are preliminary and may be affected by ongoing research and data transcription. Pleasecontact the authors before citing or circulating this paper. A previous version of this paper circulated under the title “Self-Selection of Immigrants on the Basis of Living Standards: Evidence from the Stature of Italian Immigrants at Ellis Island,1907–1925.”

    1

    mailto:[email protected]:[email protected]://aez.econ.northwestern.edu/spitzer_zimran_italian_stature.pdfhttp://aez.econ.northwestern.edu/spitzer_zimran_italian_stature.pdf

  • Acknowledgements We are indebted to Joel Mokyr, Joseph Ferrie, Igal Hendel, and Aviv Nevo forencouragement and guidance, and to the Northwestern University Economics Department’s Eisner Fund,the Northwestern University Center for Economic History and an Exploratory Travel and Data Award fromthe Economic History Association for financial support. We are grateful to Peg Zitko and the Statue ofLiberty-Ellis Island Foundation for providing the Ellis Island arrival records data, to Brian A’Hearn, FrancoPeracchi, and Giovanni Vecchi for sharing their computed moments of Italian stature distributions, andto Jordi Martí-Henneberg for sharing historical GIS files of Italy. We also thank Luigi Guiso, TimothyHatton, Seema Jayachandran, John Komlos, Lee Lockwood, Andrea Matranga, Paola Sapienza, MarianSmith, Richard Steckel, and Zachary Ward for helpful suggestions and insightful comments. Thanks are alsodue to Roy Mill for giving us access to the dEntry transcription system and for investing considerable time andenergy adjusting it to our needs; to Daniel Bird, Maureen Craig, Aanchal Jain, and Anand Krishnamurthyfor helpful discussions; to seminar participants at Northwestern University and conference participants atthe 2014 Warwick Economics PhD Conference and the Cliometrics Conference; and to Joshua Picache,Kris Angelo Belino, Abmelaine Pastores, Chermilyn Sarmiento, and Mary Rose Manlapaz for excellenttranscription. All errors are our own.

    2

  • 1 Introduction

    [A]lthough drawn from classes low in the economic scale, the new immigrants as a rule are thestrongest, the most enterprising, and the best of their class . . . .

    (The Dillingham Commission, US Congress, 1911, p. 24)

    Between 1892 and 1925, nearly four million Italians immigrated to the United States—the largest sin-

    gle flow during the Age of Mass Migration (Ferenczi and Wilcox, 1929, Tables 2–3, pp. 384–393). This

    phenomenon, part of a general contemporaneous trend of growth in migration to the United States from

    other southern and eastern European countries, sparked a debate over the policy of nearly total openness

    of the United States to immigration (Goldin, 1994). Public debate focused primarily on the “quality” of

    the southern and eastern European migrants.1 Groups favoring the restriction of immigration warned of a

    decline in the quality of immigrants, arguing that these immigrants, unlike those arriving en masse from

    northern and western Europe in prior decades, represented the poor, incapable, uneducated, unskilled, and

    criminal elements of their origin countries; that is, that they were negatively selected from within their

    environments of origin. In the late 1910s and early 1920s, after decades of agitation, such sentiment finally

    prevailed with the passage of sweeping immigration restrictions, culminating in the Immigration Act of 1924,

    which effectively ended unfettered large-scale immigration from Italy and other countries in the European

    periphery (Hatton and Williamson, 1998, ch. 9).

    Despite these allegations and the abundance of research, both modern and contemporary, that they

    precipitated (e.g., Gomellini and Ó Gráda, 2013; Hall, 1904; Stolz and Baten, 2012; US Congress, 1911), the

    question of whether migrants during the Age of Mass Migration were positively or negatively self-selected

    remains unresolved.2 Even in the modern context, determining the nature and causes of migrant self-selection

    remains at the forefront of research in the economics of migration (Borjas, 1987; Chiquiar and Hanson, 2005;

    Fernández-Huertas Moraga, 2013; McKenzie and Rapoport, 2010), and is crucial in understanding the effects

    of migration on the source and host economies (c.f., Biavaschi and Elsner, 2013). If, for example, migrants

    are positively self-selected from within their populations of origin, then emigration, by disproportionately

    leading to the exit of more productive individuals from the sending economy, may harm it (Bhagwati, 1976;

    Di Maria and Stryszowski, 2009; Docquier and Rapoport, 2012; Mattoo, Neagu, and Özden, 2008; Todaro,

    1We use the term “quality” here to refer to any traits that affect an individual’s productivity. Examples include education,skill, health, wealth, and intelligence. Proponents of immigration restriction in the early 20th century had an even broaderdefinition, arguing, for example, that these new immigrants were more likely to be involved in criminal activity, or lacked ahistory of self-governance that would be crucial to their assimilation in the United States.

    2Abramitzky, Boustan, and Eriksson (2012, 2013) also study the self-selection of migrants in the Age of Mass Migration,focusing on Norwegians.

    3

  • 1996). Conversely, the receiving economy may benefit from the influx of these productive individuals. If

    migrants are negatively self-selected, the opposite may occur.

    However, empirical answers regarding migrant self-selection remain elusive, primarily due to a number

    of data limitations that make direct comparisons between migrants and the population at risk for migra-

    tion difficult, if not impossible in some contexts. In particular, a lack of representative data on the source

    population often confounds efforts to quantify migrant self-selection. Even when comparison data are avail-

    able, they generally cannot be disaggregated to geographic levels below the country of origin, raising the

    possibility that the nature of self-selection from within source populations is obscured by composition effects

    across sub-national units. Moreover, most data on migrant quality are observed only after arrival in the host

    country, raising the possibility that they do not reflect the pre-migration characteristics of migrants. In most

    of the few cases in which these issues can be overcome, the measure typically used to compare migrants to

    stayers is occupation, which, although informative regarding individual skill and human capital, comprises

    only a rough measure of a migrant’s economic capability, reflecting only limited aspects of it.

    In the present research we study self-selection into migration using stature to measure migrant quality.

    This approach is grounded in a large body of research, which has established that the average stature of a

    large group is indicative of the group’s average economic capability—an amalgamation of many of its facets,

    such as skill (Komlos, 1990), education (Case, Paxson, and Islam, 2009), income, (Deaton, 2007; Persico,

    Postlewaite, and Silverman, 2004), wealth (Floud, Wachter, and Gregory, 1990), health (Fogel, 1986; Steckel,

    1995), childhood environment (Bailey, Hatton, and Inwood, 2014), and cognitive ability (Case and Paxson,

    2008), that all determine a prospective migrant’s contribution to his home economy, and his labor market

    outcomes in the host economy. Stature is thus, at an aggregate level, a proxy for economic capability and

    productivity in a broader sense than are other commonly used measures, such as occupation-implied skill.

    Moreover, adult stature is fixed for those in a relatively broad age range and, for such individuals, is unaffected

    by migration. The premise of this paper is that migrant self-selection can be quantified by comparing the

    average stature of migrants to that of the populations of origin. If, for example, the population of migrants is

    taller, on average, than the overall population in the sending economy, then it can be deduced that migrants

    are more economically productive than non-migrants on average.

    Applying this approach to the Italian migration to the United States enables a very reliable comparison

    of the migrant population to the source population, as the stature distribution for the source population

    of Italian adult males is known.3 Moreover, this distribution is known at a geographically disaggregated

    3In most countries, military height data are only available for a self-selected group of individuals choosing to join themilitary. In Italy, however, all males were required to be measured by the military. The resulting data were collected by

    4

  • level, enabling us to avoid obstacles stemming from the fact that the Italian migration was composed of

    individuals originating in many heterogeneous provinces, and to explore the relationship between provincial

    characteristics and different features of migration from each province. Studying a historical episode of

    migration also carries many advantages that are unavailable in the study of modern migration. This approach

    effectively avoids difficulties created by the fact that modern migratory flows are censored by restrictive

    immigration policies, and are thus not representative of the latent supply of those willing and able to migrate.

    Studying historical migration in which such barriers did not exist, allows scholars to cleanly identify migrant

    self-selection at the source, to learn the mechanisms that determine the nature of migrant self-selection, and

    to use these insights to make inferences regarding the effects of changes in migration policy on the quantity

    and the quality of migrants.

    In order to perform this analysis, we constructed a novel data set consisting of the stature, place of

    origin, and other personal information of Italian passengers from the Ellis Island arrival records database.

    First, we created a geolocation algorithm to assign each of the nearly five million passengers in the Ellis

    Island database to his province of origin based on his reported last place of residence. Next, we randomly

    sampled approximately 50,000 Italian passengers arriving between 1907 (when information on stature was

    first collected on manifests of immigrants arriving in the United States) and 1925 (when the restrictions

    of the Immigration Act of 1924 entered into force), and transcribed their stature and other information

    regarding the nature of their voyage to the United States. We then compared the heights of migrants to the

    distributions of Italian stature gleaned by A’Hearn, Peracchi, and Vecchi (2009) and A’Hearn and Vecchi

    (2011) from Italian military records covering nearly all Italian males at conscription age.

    This analysis reveals opposite patterns of self-selection across and within Italian provinces. Italians

    passing through Ellis Island were shorter, on average, than all Italians of the same birth cohort, providing

    evidence of negative self-selection on the national level. However, when compared only to the distribution

    of stature in their own provinces of origin, we find evidence that Italian passengers were, on average, taller,

    indicating positive self-selection on the local level. The difference between these two findings is driven by

    positive self-selection within southern provinces, which were the origins of a disproportionately large share of

    migrants, and in which the average stature was below the national average for Italy. Moreover, we find that

    immigrants from northern Italy tended to be negatively self-selected from within their provinces of origin—

    the opposite of their southern compatriots. Moreover, the degree of positive within-province self-selection of

    immigrants arriving in the United States after 1917 was far greater than that of immigrants arriving in the

    A’Hearn, Peracchi, and Vecchi (2009) and A’Hearn and Vecchi (2011), and made available to us.

    5

  • pre-1917 period.

    We further investigate what factors determine the degree of self-selection from within provinces, thus

    providing a test of three major theories of migrant self-selection—relative inequality, liquidity constraints,

    and network connections.4 We find that the degree of migrant self-selection was decreasing in the level of

    development of the province of origin (as measured by its average stature), indicating that immigrants from

    relatively less-developed environments were, on average, of higher quality relative to their provinces and

    birth cohorts of origin than those from relatively more developed environments. We also find evidence that

    migrants who were able to finance their own passage were more positively self-selected, on average. These

    results are consistent with theories that predict positive self-selection due to the need to overcome liquidity

    constraints to migration. We also find that individuals who migrated to join an immediate family member

    were, on average, shorter than those who did not. This finding is consistent with the notion that chain

    migration particularly helpful for lower quality migrants in overcoming liquidity constraints to migration.

    We do not find any robust and statistically significant evidence supporting theories that hold that the nature

    of migrant self-selection is determined by the relative inequality of the sending and receiving countries.

    Although taken from the report of the anti-immigration Dillingham Commission, the epigraph to this

    paper, like our results, demonstrates the importance of distinguishing between self-selection from a country

    as a whole and self-selection from within a particular sub-national region, and of conceding that the two levels

    of self-selection may be qualitatively different. Comparisons of migrants to their national-level origins, which

    are the norm in the literature on migrant self-selection (e.g., Chiquiar and Hanson, 2005; Stolz and Baten,

    2012), may fail to capture a significant portion of the self-selection occurring within a group of potential

    migrants from a particular sub-national region.

    The remainder of the paper proceeds as follows. Section 2 provides the relevant historical and economic

    background for this study. Section 3 discusses the data construction process, and provides summary statistics

    for the data set used in this study. Section 4 presents the main results, which are interpreted in section

    5. Section 6 evaluates various theories of migrant self-selection. Section 7 discusses possible threats to

    identification. Section 8 concludes.

    4These theories are discussed in more detail in section 2.2.

    6

  • 2 Background

    The issue of migrant self-selection, particularly from Italy in the early 20th century, but also in the context

    of modern migration, has been studied extensively by economists, modern and historical. In this section, we

    provide background on the mass migration of Italians in the early 20th century, as well as on the body of

    economic knowledge on the issue of migrant self-selection.

    2.1 Historical Background

    At the beginning of the 20th century, Italy lagged behind most other western European countries in terms

    of nearly every economic indicator. As shown in Figure 1, real wages were low, less than half their level

    in Britain (O’Rourke, 1997). Moreover, Italy’s industrial production lagged that of its neighbors (Ciccarelli

    and Fenoaltea, 2013), and malaria and other diseases were endemic, particularly in the south (Foerster,

    1919). As a result, living standards in Italy, measured by average stature, fell short of those of most other

    European countries, as depicted in Figure 2. These poor economic conditions spurred many Italians to leave

    their home to seek opportunity elsewhere. Such was the strength of this incentive that by the turn of the

    century, Italy had become the largest source of migrants to the United States, displacing such countries as

    Ireland, Great Britain, Sweden, and Norway. Moreover, as depicted in Figure 3, Italy led Europe in terms

    of relative migration, with the highest rates of emigration per capita of any European country in the period

    1900–1913.5

    In some ways, the Italian migration was typical of the Age of Mass Migration: migrants were mostly

    young, unskilled, and male; but in other ways, Italians were distinct from other migrants. First, they tended

    to distribute themselves between several destination countries, primarily the United States, Argentina, and

    Brazil. Between 1886 and 1895, nearly 75 percent of Italians traveling to the Americas went to Argentina or

    Brazil, with the remainder going to the United States. By the period 1906–1915, the United States became

    the lead destination for Italians, drawing more than twice the numbers of Brazil and Argentina combined.

    There was also considerable (mostly seasonal) migration to other European countries of a magnitude rivaling

    that of the flow to the United States (Hatton and Williamson, 1998, Table 6.1, p. 101). Second, roughly three-

    quarters of migrants were male (Hatton and Williamson, 1998, p. 102), a gender imbalance exceeding that

    of almost any other group. Finally, Italians were, more than any other group of migrants, likely to migrate

    temporarily rather than to remain abroad permanently (Gomellini and Ó Gráda, 2013). The canonical

    5If we include sub-national ethnic groups, however, Russian Jews were more likely to emigrate than Italians (see Spitzer,2013, 2014, for more details).

    7

  • example is the tendency of these so-called “birds of passage” to exploit the seasonal differences between Italy

    and South America, traveling between the two in order to participate in both harvests (Foerster, 1919; Hatton

    and Williamson, 1998). Return migration also occurred from the United States; as the annual nominal wage

    in the United States was nearly five times that in Italy, and the cost of round trip passage would consume

    only 20 percent of those earnings, many Italians would travel to the United States to work, and then return or

    remit the money to their families in Italy (Gomellini and Ó Gráda, 2013). It has been estimated (Bandiera,

    Rasul, and Viarengo, 2013, Table 4, p. 37) that as many as 80 percent of Italian migrants to the United

    States eventually returned to Italy.

    Discussing emigration from Italy as a whole obscures the considerable variation in emigration rates and

    the general patterns of migration across Italian provinces (Hatton and Williamson, 1998, p. 106). Whereas

    most Italian emigrants in the 1880s were from the north (Gomellini and Ó Gráda, 2013), the poorer south

    had taken the lead in terms of emigration rates by 1900 (Ferenczi and Wilcox, 1929, Table 10, pp. 432–443;

    Hatton and Williamson, 1998, Table 6.4, p. 107). The greater emigration rates from the south were driven

    primarily by the fact that the south was much poorer than the north. As a primarily rural agricultural

    economy characterized by lower real wages (Hatton and Williamson, 1998, pp. 115–116), southerners had

    a stronger push to emigrate than did northerners. In addition, the north’s relative proximity to major

    European labor markets caused many northerners to migrate within Europe rather than overseas. Thus,

    the mix of Italian immigrants to the United States was primarily southern, and therefore drawn from the

    relatively poorer portions of the country.

    Americans were aware that the bulk of Italian immigration to the United States increasingly originated

    in the poorer south, and many were displeased with the new growth of Italian immigration in the early

    20th century. Many of those opposed to a continued openness to immigration felt that malicious forces

    were at work in Europe to transfer the least desirable elements of the population of Europe to the United

    States (Commissioner-General of Immigration, 1903; Hall, 1904), citing the immigrants’ lack of skill, and

    (as perceived by anti-immigration activists) lack of mental and physical fortitude as evidence. Writing at

    the height of the migration, the Commissioner-General of Immigration (1903, p. 70) asserted that

    The great bulk of the present immigration proceeds from Italy, Austria, and Russia, and, further-more, from some of the most undesirable sources of population of those countries. No one wouldobject to the better classes of Italians, Austrians, and Russians coming here in large numbers;but the point is that such better element does not come.

    Notably, claims of such negative selection were not seriously disputed (c.f., Douglas, 1919). Even ad-

    8

  • vocates of continued openness to immigration accepted them, arguing that better measures ought to have

    been taken to prevent the dependent and criminal elements from entering the United States (Brandenburg,

    1904), that the United States’s tradition as a haven for immigrants was worth maintaining, and that the

    immigrants would eventually converge (even physically) to American standards through their time in the

    United States (Boas, 1911).

    Nativist concerns led to the formation of the Dillingham Commission (US Congress, 1911), which was

    charged with investigating the nature and effects of the mass immigration. After collection and analysis of

    considerable data, the commission enumerated in great detail the negative characteristics of the immigrants,

    ranging from their poor living conditions to their lack of education and skill, eventually concluding that

    immigration restriction was necessary in order to protect the “national character.” These restrictions culmi-

    nated in the literacy test of 1917 and finally in the quotas of the Immigration Act of 1924, which brought

    an end to mass immigration to the United States from the European periphery.

    2.2 The Economics of Migrant Self-Selection

    Theoretical foundations for the economic analysis of migrant self-selection are laid out by Borjas (1987). Ac-

    cording to his modification of the Roy (1951) model, the nature of self-selection into migration is determined

    by the relative returns to skill in the sending and receiving economies. If the returns to skill in the receiving

    country are higher relative to those in one source country than in another, emigrants from the latter are

    predicted to be more strongly positively self-selected than those from the former on the basis of skill. In

    most studies of migrant self-selection, the relative returns to skill are proxied by the relative inequality of

    the income distributions of each country.6 When focusing on relative inequality, positive self-selection of

    migrants is predicted to be induced when the income distribution of the sending country is less unequal

    than that of the destination country. Conversely, when the sending country’s income distribution is more

    unequal than that of the destination country, migrants are predicted to be negatively self-selected. Borjas

    (1987), for example, uses this framework to argue that the deteriorating performance of successive cohorts

    of immigrants in the latter half of the twentieth century (as measured by their earnings and integration into

    the American labor market) can be explained by the fact these immigrants have increasingly originated in

    very unequal countries, and are thus of lower quality.7

    This so-called relative inequality model has met with mixed empirical success. Chiquiar and Hanson6We are grateful to Timothy Hatton for pointing out this distinction.7Interestingly, similar arguments were made to explain the labor market performance of the “new” immigrants from the

    southern and eastern European periphery as compared to that of immigrants from northern and western Europe in the early20th century.

    9

  • (2005) find that immigrants from Mexico, in which the income distribution is very unequal, are not negatively

    self-selected on the basis of earnings, skill, or education. Other recent empirical studies have also found

    evidence of positive self-selection into migration, usually with respect to skill or education, regardless of the

    relative inequality of income distributions (Feliciano, 2005; Gould and Moav, 2010; Grogger and Hanson,

    2011). These findings are rationalized by the presence of migration costs and borrowing constraints that

    disproportionately inhibit migration by those in the lower tail of the income distribution (Chiquiar and

    Hanson, 2005; Chiswick, 1999; Mishara, 2007). Therefore, regardless of the nature of self-selection of those

    wishing to migrate, only those of higher quality are able to overcome liquidity constraints and actually do

    so, generating more positive self-selection. However, Ibarraran and Lubotsky (2007) and Fernández-Huertas

    Moraga (2011) dispute this finding. Several explanations have been offered to reconcile these results. Borger

    (2009), McKenzie and Rapoport (2010), Spitzer (2013), and Wegge (1998) argue that the direction of self-

    selection is indelibly tied to the strength of the potential migrant’s social network. Stronger social connections

    in the destination country enable individuals who would otherwise be unable to overcome liquidity constraints

    to migration to do so, resulting in a weakening of the distortive effects of migration costs and borrowing

    constraints on the Roy model effects. Belot and Hatton (2012) and Fernández-Huertas Moraga (2013)

    attempt to reconcile these disparate theories. Belot and Hatton (2012) show that once poverty constraints

    are accounted for, patterns of self-selection appear to correspond to the predictions of the relative inequality

    model. Fernández-Huertas Moraga (2013) finds that a combination of the three explanations is required

    to fully account for differences in the pattern of self-selection between urban Mexico (whence migrants are

    negatively self-selected) and rural Mexico (whence they are positively self-selected).

    A different explanation for the composition of migratory flows is given by development economists, who

    have recently examined the role of risk in the migration decision, which is highlighted by Harris and Todaro

    (1970). Bryan, Chowdhury, and Mobarak (2014) find evidence that risk aversion prevents rural-to-urban

    migration in developing countries. Those with greater wealth, who would be better able to bear the risk,

    would therefore be more likely to migrate, generating positive self-selection.

    Overall, despite a vast literature studying the issue of migrant self-selection in both modern and historical

    contexts, a consensus on its nature and its causes and mechanisms remains elusive. Findings of different

    directions of self-selection in different studies make external generalizations difficult.

    10

  • 2.2.1 Issues of Data Availability

    Empirical disagreement regarding the direction and causes of migrant self-selection can be partially attributed

    to data limitations that prevent or restrict comparisons between migrants and the population at risk for

    migration in the migrants’ economy of origin. In the absence of other data, many studies have relied on

    aggregate data of the place of origin to study migrant self-selection (Bohlin and Eurenius, 2010; Hatton and

    Williamson, 1998; Lowell, 1987; Runblom and Norman, 1976). This approach is generally used when micro

    data are unavailable, for example, when only aggregate statistics on the volume of migration between two

    countries are available. In this approach, self-selection into migration is studied by comparing migration

    patterns across regions. If, for example, the rate of migration from higher income areas is greater than

    that from other areas, then the conclusion is drawn that migrants are positively self-selected on the basis of

    income. The approach is confounded, however, if self-selection also occurs within regions. Returning to the

    previous example, migrants from areas with higher average income may be poorer than non-migrants from

    the same area and are thus properly understood to be negatively self-selected on the basis of income.

    Ideally, micro data would be used, pinning down the types of migrants and permitting the comparison of

    individuals within a specific (possibly sub-national) source population to one another. However, samples in

    which prospective migrants’ quality is observed prior to migration are rare (Akee, 2010). Instead, scholars

    using micro data to study modern migration are often forced to rely on data collected after the migration

    has taken place (Chiquiar and Hanson, 2005; McKenzie and Rapoport, 2010). Except for certain indicators,

    measures collected after the migrants have been in the receiving economy for some time are likely to be

    contaminated by the experience in the destination. For example, occupations of immigrants may change

    depending on the labor market conditions of the receiving country.8 Most studies in which such data are

    available focus only on very small migration flows, such as from Pacific island nations to the United States

    and New Zealand (Akee, 2010; McKenzie, Gibson, and Stillman, 2010), or from Finland to Sweden (Rooth

    and Saarela, 2007).

    Even if pre-migration data on migrants are available, a comparison between migrants and non-migrants

    requires data on the distribution of productive characteristics of the population of origin. Without data on

    the population of origin, it cannot be determined whether migrants are positively or negatively self-selected.

    For example, individuals with low education in an absolute sense may in fact be highly educated relative to

    8For example, Perlmann (2000) shows that the share of laborers and manufacturing workers among Jewish immigrantsduring the Age of Mass Migration was much higher than in the population of origin. Occupation-based self-selection is possible,but Perlmann (2000) argues that many migrants may simply have changed occupations on arrival. Ferrie (1997) also raises theissue that many immigrants tend to work in different occupations before and after migration.

    11

  • their population of origin, but this cannot be determined without data on the source population. Chiquiar

    and Hanson (2005) show that failing to take this issue into account can lead to spurious conclusions regarding

    the nature of self-selection.

    Fernández-Huertas Moraga’s (2013) study of Mexican migrants to the United States is one of the few

    that is able to overcome these constraints. He compares survey data on migrants to that on non-migrants

    from surveys conducted before migration occurs, and is able to compare the two routs on the basis of

    wages, unemployment rates, and labor market participation. However, Fernández-Huertas Moraga’s (2013)

    study, like nearly all studies of modern migration, suffer from a problem of dual selection. That is, two

    sample selection processes operate to determine the composition of modern migratory flows: the process

    that determines whether migrants find it optimal to migrate (which the literature on migrant self-selection

    is interested in understanding), and the selection caused by restrictive immigration policy; the latter process

    generally obscures the former, and comparisons between migrants and non-migrants do not reveal the nature

    of migrant self-selection.

    2.2.2 Advantages of Historical Data

    Historical data make it possible to overcome some of these hurdles. First, the problem of dual selection

    generally does not apply. Specifically, prior to the literacy restrictions imposed in 1917, migration to the

    United States from Europe was almost entirely unrestricted. Even after the literacy test was imposed, Goldin

    (1994) hypothesizes that migration from Europe to the United States was not significantly restricted until

    the quotas imposed in 1924. Thus, migrants in the Age of Mass Migration were selected only by the process

    about which we wish to learn: that which determines whether individuals find it optimal to migrate. It is

    therefore possible to identify cleanly the self-selection of migrants at the source. Second, historical data often

    provide access to micro data that are unavailable to researchers in modern contexts. For example, unlike

    most modern data, historical data are not subject to confidentiality restrictions. Moreover, much modern

    migration is undocumented; the lack of significant legal restrictions to migration in our study period made

    it unnecessary to enter the United States illicitly, ensuring that most migration was documented.

    Several studies have used historical data to overcome these data limitations. Abramitzky, Boustan,

    and Eriksson (2013) exploit the availability of tax data in Norway to study self-selection on the basis of

    wealth into migration to the United States during the Age of Mass Migration. Unfortunately, Abramitzky,

    Boustan, and Eriksson (2013) are forced, by issues of data availability, to rely primarily on a binary indicator

    of whether a household owned any taxable assets, and are, in their analysis of international migration, unable

    12

  • to further disaggregate wealth. The issue of coarseness also arises when occupational data are used, such as

    by Abramitzky, Boustan, and Eriksson (2012), Biavaschi and Elsner (2013), and Wegge (1999, 2002). Such

    data generally require that individuals be characterized as either skilled or unskilled, masking much useful

    variation in the quality of prospective migrants. It is also possible to rank occupations by median income

    (Abramitzky, Boustan, and Eriksson, 2012; Biavaschi and Elsner, 2013), but no variation within occupations

    is recovered. For instance, this approach cannot differentiate between poor and wealthy farmers, who would

    have had vastly different living standards. Indirect inference can also be made from the post-migration

    outcomes of immigrants and their children. For instance, Ferrie and Mokyr (1994) find higher rates of

    entrepreneurship among immigrants than natives, suggesting positive self-selection. Moreover, Abramitzky,

    Boustan, and Eriksson (2014) find that immigrants from some European countries in this period hold higher-

    paid jobs than natives on arrival, suggesting that they may also have been positively self-selected.

    In most historical literature, however, even coarse data on traditional economic indicators are generally

    unavailable.9 Instead, two alternative methods of measuring migrant quality relative to the population at

    risk for migration are common in the historical literature. Mokyr (1983) and Mokyr and Ó Gráda (1982),

    for instance, use age heaping, based on individual age reports, to infer the numeracy of Irish immigrants to

    the United States. Stolz and Baten (2012) perform a similar analysis, comparing migrants from a number

    of countries to census records.

    Finally, even when all of these constraints can be overcome, it is generally only possible to evaluate the

    self-selection of migrants on a national level; that is, migrants are generally classified only by their country of

    origin. Analysis of self-selection on the national level, however, may obscure self-selection at the local level

    as a result of composition effects across sub-national entities, leading to incorrect conclusions regarding the

    true nature of self-selection. Fernández-Huertas Moraga’s (2013) study of modern Mexican immigration to

    the United States is, in part, an exception to this restriction. While he does not distinguish between different

    geographic places of origin within Mexico in determining self-selection, he does distinguish between migrants

    from urban and rural areas in Mexico, finding that migrants from each sector are self-selected differently.

    Pooling the sample shows evidence of negative self-selection, obscuring differences in incentives and ability

    to migrate among different sectors of the population. As we show in the present research, Italy is a case in

    point, with migrants from North and South Italy exhibiting different patterns of self-selection.

    9Wegge (2002) also collects data on the wealth of migrants, but systematic misreporting of this figure due to restrictionson the expatriation of cash, together with the lack of comparable data for non-migrants, makes it difficult to draw conclusionsregarding self-selection with respect to wealth.

    13

  • 2.2.3 Stature as a Measure of Pre-Migration Living Standards

    The use of stature as a measure of economic capability and productivity is grounded in a large literature.

    Fogel (1986, 1994), Fogel, Engerman, and Trussell (1982), Komlos and Meermann (2007), and Steckel (1995)

    summarize the vast literature establishing a relationship between adult stature and the standard of living

    experienced by a population in youth. With the genetic variation in height across individuals averaging

    out in comparisons of large groups to one another (Eveleth and Tanner, 1976; Frisancho, 1993; Martorell

    and Habicht, 1986; Silventoinen, 2003; Steckel, 1995),10 the average stature of a population represents the

    difference between its gross nutrition in youth (principally, its caloric intake) and contemporaneous demands

    on nutrition, such as labor and disease. Thus, in addition to being correlated with traditional measures of

    the standard of living, such as income or GDP per capita, stature captures additional facets of welfare such

    as health and consumption (Steckel, 1995). The variation in stature is also informative about the degree

    of inequality in the population in the consumption of inputs to stature (such as food, health, and leisure)

    (Komlos, 1985, 1990; Komlos and Baten, 2004; Steckel, 1995)—a feature that Stolz and Baten (2012) exploit

    in order to measure inequality when other data are lacking.

    Stature is not only correlated with inputs to individual productivity. In essence, stature is a compos-

    ite measure of human economic capability—an amalgamation of all factors that ultimately determine an

    individual’s standard of living. Thus stature reflects overall quality and economic capability in two ways.

    Individuals facing better conditions in childhood (e.g., more food availability, less disease, less hard work)

    will both become taller as adults and will also develop superior cognitive skills (Case and Paxson, 2008).

    These individuals tend to become more educated than their peers (Case, Paxson, and Islam, 2009), and to

    earn higher wages (Lundborg, Nystedt, and Rooth, 2009; Persico, Postlewaite, and Silverman, 2004) and en-

    ter into higher-skilled work (Komlos, 1990). Height might also reflect unobserved ability through an indirect

    channel: if the provision of a better childhood environment, which would make children taller, is correlated

    with parents’ characteristics, such as ambition and resourcefulness, then taller children are more likely to

    have inherited such productive characteristics from their parents. Furthermore, for certain occupations,

    there are returns to strength, which is correlated with height (Bodenhorn, Guinnane, and Mroz, 2013).

    Stature data can therefore be used to address several of the shortcomings of previous studies of migrant

    self-selection by overcoming many of the data limitations that they have faced. While other measures

    of pre-migration welfare, such as occupation and wealth, have their own advantages,11 the resistance of

    10The lack of a genetic difference in adult heights is particularly true when the two groups are from the same place of origin,as is the case in the present research.

    11For example, occupational status is measured with less idiosyncratic noise, and thus can be used in cases in which only

    14

  • stature to contamination by post-migration events, its correlation with unobserved ability, education, health,

    consumption, and pre-migration welfare, and the availability of finely measured stature data make it an

    attractive tool for the study of migrant selectivity. Applying this approach specifically to Italian migration

    also makes it possible to study self-selection from sub-national regions due to the availability of geographically

    disaggregated and finely measured data on the stature of the Italian population of the time. The historical

    coverage of the data remove the dual selection issue.

    Stature has been used by several scholars to study migrant self-selection. Crimmins et al. (2005) examine

    the self-selection of Mexican migrants to the United States in the modern context. However, the lack of

    geographically disaggregated data and the confounds raised by the dual selection of migrants in modern

    data limit the generalizability of the conclusions. Kosack and Ward (2013) expand this approach, analyzing

    Mexican immigration to the United States in the early 20th century. Unfortunately, they are unable to

    compare the stature of migrants to that of a representative sample of Mexicans as no such sample is known

    to exist for their study period. Instead the average stature of migrants is compared to that of volunteer

    soldiers and passport applicants. As Bodenhorn, Guinnane, and Mroz (2013) point out, however, both of

    these comparison samples are likely to suffer from sample-select biases. Thus, it is impossible to determine

    whether the finding that Mexican migrants were taller than soldiers and shorter than passport applicants

    is an indication of the self-selection of migrants, of the comparison samples in question, or of some mixture

    thereof. Humphries and Leunig (2009) study the location choices of mid-nineteenth-century British seamen

    based on height. The scope of conclusions that can be drawn from this study are very limited in their

    generalizability to the self-selection of an entire population into international migration.

    Our study thus improves upon previous attempts to understand self-selection into migration on a number

    of fronts. First, we use an easily and finely measured variable that is known to reflect living standards and

    other facets of quality, and whose measurement is not affected by the decision to migrate. Second, we compare

    migrants to data on the population of origin that are virtually free of self-selection. Third, our comparison is

    disaggregated to the province-birth cohort level, enabling us to study self-selection within small population

    bins, as well as the variation in self-selection across time and space—all while remaining cognizant of the

    different origin populations of the migrants. Finally, our focus on the Age of Mass Migration allows us to

    cleanly ascribe observed self-selection to individuals’ migration decisions, rather than to a combination of

    these decisions and restrictive policies.

    small samples are possible. It is also directly informative on skill and human capital. In contrast, stature requires large samplesin order to eliminate idiosyncratic differences between individuals.

    15

  • 2.3 Self-Selection of Italian Migrants in the Age of Mass Migration

    Although there have been a number of attempts to determine the nature of self-selection of Italians migrating

    to the United States during the Age of Mass Migration, a clear answer has eluded researchers (Gomellini

    and Ó Gráda, 2013). In all cases, the difficulties in studying migrant self-selection discussed above apply.

    Arguments of negative self-selection are advanced by Betrán and Pons (2004), who find that skill premia

    were falling in Italy and rising in the United States during the Age of Mass Migration. These trends indicate

    that unskilled laborers were disproportionately overrepresented in emigration, leading to a relative scarcity

    of unskilled labor in Italy. Stolz and Baten (2012) also present evidence of negative self-selection, finding

    that age heaping among Italian migrants was greater than among the origin population, suggesting negative

    self-selection on the basis of numeracy. Giffoni and Gomellini (2013), studying the relationship between

    migration and school dropout rates, support this view, at least partially, arguing that they find no evidence

    of positive self-selection of Italy. Arguments for positive selection, however, are advanced by Gomellini and

    Ó Gráda (2013), who point out that south Italian immigrants were more likely to be literate than their origin

    populations. Notably, the latter study makes a comparison of migrants to their region of origin, while the

    former compares immigrants to the entire country.

    Anthropometric measures have also been used in this context. Danubio, Amicone, and Vargiu (2005)

    sample citizenship petitions filed by Italian immigrants in Massachusetts and find an average height greater

    than that reported by A’Hearn (2003) and Federico (2003), and computed by A’Hearn, Peracchi, and Vecchi

    (2009) and A’Hearn and Vecchi (2011) for the population of Italy. Gomellini and Ó Gráda (2013) interpret

    this result as suggesting positive selection into migration. The present research builds on this strategy by

    disaggregating the scope of analysis to the sub-national level of Italy and by using data collected prior to

    migration, thus eliminating the possibility of post-migration contamination of stature through continued

    growth.

    3 Data

    The data set used in this paper is novel in two ways. First, it makes use of the stature data in the Ellis

    Island arrival records, which we discuss in further detail below. Second, it links Italian migrants to their

    places of origin with a great deal of disaggregation. In this section we discuss the collection of our data in

    further detail, provide summary statistics, and test whether our sample is representative of the population

    of migrants.

    16

  • 3.1 Data Sources

    The primary data sources for our analysis are the province-birth cohort-level Italian stature distributions

    computed from military conscription records by A’Hearn, Peracchi, and Vecchi (2009) and A’Hearn and

    Vecchi (2011), in addition to the Ellis Island arrival records database. These two sources are discussed

    immediately below. We also supplement these data with population and literacy data from the Direzione

    Generale della Statistica e del Lavoro (1912) and the Ministero di Agricoltura, Industria e Commercio (1915,

    1925).

    3.1.1 Italian Stature Data

    Analysis of self-selection of any kind based on stature requires a comparison sample that is known to ac-

    curately represent the population at risk for migration, or at least to represent non-migrants as a whole,

    without further self-selection. We are fortunate that such data exist in the Italian case. As a comparison

    sample for our migrants, we use height data compiled as a part of the Italian military conscription process.12

    During the period in question, all Italian males, regardless of physical condition, were required to present

    themselves for a medical examination, during which their heights were measured and recorded (A’Hearn,

    Peracchi, and Vecchi, 2009). As these data are the product of nearly the full male population of Italy

    (Cole, 1995), they are representative of the population as a whole. In particular, these data (as corrected

    by A’Hearn, Peracchi, and Vecchi, 2009; A’Hearn and Vecchi, 2011) are unlikely to suffer from issues of

    self-selection that are problematic in the historical heights literature.

    We acquired two sets of data based on the conscription data, one from A’Hearn, Peracchi, and Vecchi

    (2009) and the other from A’Hearn and Vecchi (2011). The A’Hearn and Vecchi (2011) data contain the

    raw means and standard deviations of the height distributions of each province (except Caserta) and birth

    cohort from 1855–1910, as well as these values standardized to their age-20 values. We refer to the latter

    data as the “Unsmoothed Age 20” series. Examples of the time series of means and standard deviations of

    the unsmoothed age-20 distributions are presented in Figure 4. The distributions of age-20 stature may not

    be suitable for comparison to those of migrant stature due to the possibility of growth after age 20. Although

    Beard and Blaser (2002) and Frisancho (1993) show that modern populations reach terminal height by age

    20, the same need not be true of our study population. Indeed, a number of studies (A’Hearn, Peracchi, and

    Vecchi, 2009; Fogel, Engerman, and Trussell, 1982; Frisancho, 1993) discuss the potential for malnutrition to

    both reduce final adult height and to delay the onset of the adolescent growth spurt (AGS), leading growth

    12For a detailed description of the data and their origin and collection, see A’Hearn, Peracchi, and Vecchi (2009).

    17

  • to continue into the early 20s. Similarly, A’Hearn, Peracchi, and Vecchi (2009) report that the delayed AGS

    may be responsible for a decline in the standard deviation of height in a population with age.

    Unfortunately, the delayed AGS is a poorly quantified anthropometric phenomenon. We were not able

    to find any literature quantifying the effect of nutrition on the rate of growth over the lifespan, and thus

    we have only a limited understanding of the bias introduced by using the age-20 distributions as a basis for

    comparison. In particular, we do not know to what extent the bias (i.e., the continued growth after age

    20) depends on average height at age 20. What we do know, however, is that shorter cohorts are likely to

    continue growing further into their twenties. That is, the age 20 distributions may be an image of height

    that is earlier in the growth process for shorter populations than for taller populations. This issue would

    thus create a mechanical bias toward finding stronger positive self-selection among shorter populations.13

    We therefore take advantage of computations performed by A’Hearn, Peracchi, and Vecchi (2009). The

    primary computations of these authors resulted in the lines in Figure 4 labeled “Smoothed Age 20,” which

    represent the time-smoothed age-20-corrected means and standard deviations of stature. They also adjust

    these distributions for continued growth to age 22, with the results represented in the series labeled “Smoothed

    Age 22.” These distributions are based on changes in the timing of measurement over the lifespan by the

    Italian military,14 but are, for the most part, out-of-sample projections performed by A’Hearn, Peracchi, and

    Vecchi (2009). Nonetheless, the growth that these adjusted height distributions depict relative to the age 20

    distributions constitutes the most rigorous possible analysis of post-age-20 growth for the population under

    analysis. However, the smoothed age-22 distributions eliminate potentially valuable within-province variation

    over time. We therefore compute an unsmoothed age 22 distribution, labeled “Implied Age 22” in Figure

    4, by adjusting the unsmoothed age-20 means by the province-birth cohort-specific difference between the

    smoothed age-20 and smoothed age-22 means. We perform a similar operation on the standard deviations

    of the distributions, which are similarly smoothed by A’Hearn, Peracchi, and Vecchi (2009) and not by

    A’Hearn and Vecchi (2011). By performing this correction, we produce province-birth cohort-specific height

    distributions normalized to age 22, at which the risk of further growth would have become negligible even in

    malnourished populations. We therefore consider these distributions to be the best available representations

    13We have replicated the results of this paper using the age 20 distributions for comparison. The magnitude of self-selectionthat we find is much stronger than our main results in this paper.

    14Specifically, there were variations in the age of measurement by the Italian military induced by the military calling updifferent birth cohorts at different ages. A’Hearn, Peracchi, and Vecchi (2009) report that the vast majority of birth cohortsare measured at age 20, but that for institutional reasons, some were measured as early as age 18, and others as late as age22. Based on this variation, A’Hearn, Peracchi, and Vecchi (2009) compute the average stature at age 22 for each province andbirth cohort by extrapolating from the age 20 distributions that they observe using the differences in the stature observed incohorts measured at different ages.

    18

  • of the average adult height of each birth-cohort and province.15 The time trend in the average height of the

    Italian population is depicted in Figure 5.

    It may be the case, however, that some smoothing of these moments is necessary. There is likely very

    little sampling error in the moments, as they come from nearly the entire male population; but there may

    be error in the reporting of ages at Ellis Island that leads us to assign passengers to the wrong birth cohort,

    and thus to the wrong comparison distribution. Some smoothing of the moments over time may therefore be

    necessary. We therefore compute for each province a kernel regression of each moment against the birth year,

    thus providing a smoothed version of the moments of the distributions.16 The smoothed moments are also

    presented in Figure 4, and are labeled “Our Smoothed Age 22.” Comparison to these distributions produces

    results that are not appreciably different from those driven by comparisons to unsmoothed distributions

    except in a small number of cases noted below.

    3.1.2 Ship Manifests

    Our information on the stature and other personal characteristics of migrants is taken from the Ellis Island

    arrival records database, which includes information on nearly all passengers who passed through the Port of

    New York from 1897 to 1925.17 This database is composed of passenger manifests deposited at Ellis Island, of

    which Figure 6 presents an example. Some of the information on these manifests is already transcribed, while

    the rest is available in handwritten form on the scanned manifests. These manifests were completed upon

    embarkation by the steamship companies transporting the passengers to Ellis Island, and were primarily

    intended to fulfill two purposes. First, they were used to maintain statistics on the number of immigrants of

    each gender and nationality entering the United States. Second, they were part of an effort to ensure that

    immigrants who might become a public charge, who were ill, or were otherwise undesirable (for instance, by

    being anarchists or polygamists), were prevented from entering the United States (Bureau of Immigration

    and Naturalization, 1909). Steamship companies were therefore required to assert that they had examined

    all passengers, and to affirm that they did not violate any of these restrictions. Beginning in late 1906,

    with the passage of the Immigration Act of 1906 (US Congress, 1907), passenger manifests were required to

    include a physical description of the passenger, of which height was a component.

    15We have also produced all of the results presented below with the age 20 distributions as the point of comparison. Allresults are stronger with the age 20 distributions than with the age 22 distributions.

    16We compute our own smoother in order to provide a province-specific average over time. We do not use the smoothedmeans computed by A’Hearn, Peracchi, and Vecchi (2009) because they are not simply averages over time, but are insteadaffected by the temporal trend in other provinces.

    17The first five years during which Ellis Island was in operation (1892–1897) are only partially covered for two reasons. First,Ellis Island at this time operated in conjunction with the older Castle Garden facility, where some immigrants were processed.Second, an 1897 fire at Ellis Island destroyed many records that were stored there.

    19

  • Figure 8 presents the time series of arrivals according to both the Ellis Island data base and the official

    immigration statistics of the United States (Ferenczi and Wilcox, 1929, Tables 2 and 3, pp. 384–393). The

    former exceeds the latter for two reasons. First, it includes both immigrants and individuals entering tem-

    porarily, while the official immigration statistics only include people entering with the intention of remaining

    permanently. We also include in Figure 8 the Ellis Island statistics deflated by the number of individuals in

    our sample who report being first-time arrivals—a proxy for the number of actual immigrants. Second, the

    Ellis Island data include individuals who purchased passage but never embarked;18 the official immigration

    statistics include only actual arrivals. The official statistics may also include individuals not included in the

    Ellis Island data, as the Port of New York was not the only place of entry for Italian migrants,19 though it

    was the location of the bulk of arrivals.

    We acquired from the Statue of Liberty-Ellis Island Foundation (SOLEIF) a subset of this database,

    consisting of the transcribed information of the roughly 4.8 million individuals passing through Ellis Island

    in this period who either reported their ethnicity as Italian, north Italian or south Italian, or whose country

    of origin was Italy. We restricted the sample to those arriving in 1907 or later so as to consider only those

    whose heights would have been recorded under the new law.20 This restriction left approximately 2.8 million

    passengers in the sample. Next, we geocoded the passengers’ reported last place of residence using an

    algorithm outlined in Appendix A.21 As we discuss in Appendix A, a variety of tests and exercises show

    that this algorithm is remarkably accurate for the individuals who can be matched: the rate of false matches

    may be below five percent. Moreover, as shown in Figure 7, the correlation of average provincial heights

    of men aged 22–65 recorded at Ellis Island and average provincial heights reported by the Italian military

    (as adjusted by us and by A’Hearn and Vecchi, 2011) is 0.72. In section 3.3, we analyze whether there are

    differences between individuals who were matched, and those who were not. We formally explore the possible

    effects of incorrect geolocation on our results in section 7.2.

    We then sampled approximately 50,000 passengers arriving after 1907, for whom we transcribed infor-

    mation from the original manifests that was not already transcribed by the SOLEIF.22 The data that we18We thank Drew Keeling for pointing this issue out to us. Our present sample includes individuals who did not embark; we

    will adjust the sample in future transcription.19Secondary ports of arrival, such as Boston, New Orleans, and Philadelphia also received substantial migratory flows from

    Italy; but all of these together amounted to a small share of the total.20There were also a small number of passengers who reported a place of residence in Italy but an ethnicity other than Italian,

    north Italian, or south Italian. We also omit these individuals from consideration.21A possibly more appropriate indicator of an immigrant’s origin would be the place of birth; however, unlike the last place

    of residence, this information is not available in digital form. If internal migration was common in Italy at the time, there wouldbe differences between the two locations that could lead to incorrect assignment of individuals to comparison distributions. Infuture work, we will transcribe the place of birth of a sample of migrants and compare them to the last place of residence inorder to determine the extent of possible error generated by using the last place of residence instead of the place of birth inorder to select a comparison distribution for each individual.

    22We transcribed a simple random sample of households (identified by the ordering of individuals on the manifests and by a

    20

  • received in digital form (indicated by the dashed lines in Figure 6a) included the passenger’s name, marital

    status, age, date of arrival, ethnicity, nationality, and last place of residence. We transcribed the answers

    to four additional questions asked regarding the migrant (indicated by dashed thick solid lines in Figure

    6b): whether he had paid for his own passage, and if not, who had paid for the passage; whom he would be

    joining in the United States; whether he had ever been in the United States before; and his height.23

    3.2 Summary Statistics

    Figure 9 depicts the arrivals of Italian passengers in the entire 1907–1925 period, disaggregated by the

    province to which they were matched by our geolocation algorithm. A striking pattern is evident: first,

    southerners were much more likely to migrate to the United States than their northern counterparts. In

    particular, the rates of emigration in the regions of Sicily and Abruzzo are over 12 percent, while those in

    Emilia Romagna were nearer to two percent. Moreover, southerners represented a much larger proportion

    of all Italian passengers traveling to the United States than did northerners. Eleven provinces of southern

    Italy, and none in northern Italy, were the origins of more than 50,000 geolocated passengers each. Moreover,

    nearly all provinces from which fewer than 5,000 geolocated passengers originated are located in the north.

    In total, 82 percent of passengers in our geolocated sample are matched to a southern province.24

    We restrict our sample to individuals aged 22–65 who could be matched to a province of origin by our

    geolocation algorithm.25 We make this age restriction and retain it throughout the paper, as this is the range

    of ages over which we can be confident that terminal height has been achieved, but rapid shrinking has not

    begun.26 Moreover, we see a peculiar pattern in the age distribution of male migrants, which is illustrated in

    Figure 10. As is typical of the Age of Mass Migration, the density of the age distribution is greatest in the

    early twenties. There is, however, a large dip in the distribution between ages 18 and 21, a trend that we do

    not observe among Italian women, and which is not present, for instance, among Russian Jewish immigrants

    (Spitzer, 2013, 2014). We believe that this dip, which corresponds to the age of military service, may be

    common last name) and not of individuals. Thus, an individual traveling with one companion was twice as likely to be sampledas an individual traveling alone. Of all passengers between 1907 and 1925, nearly 75 percent traveled alone, and 94 percenttraveled in groups of three or less. All further discussions are therefore corrected for this sampling technique through the useof appropriate weights.

    23We are very grateful to Roy Mill for providing us with access to his dEntry transcription system, and for devotingconsiderable time and effort to making it compatible with the requirements of this project.

    24This figure falls to 81% when individuals from Caserta, for which we do not have population stature information, aredropped.

    25We examine whether our algorithm induced sample selection bias in section 3.3.26Cline et al. (1989) show that shrinking begins essentially as soon as final height is attained, but accelerates with age. In

    any event, changing the end point of our sample in terms of age will not have large effects on our results, as there are relativelyfew older immigrants as compared to younger ones. The distribution of ages in our sample is illustrated in Figure 10. In thefew cases in which our results are qualitatively affected by reducing the terminal age of our sample to 40 (which is a moreconventional terminal age Silventoinen, 2003), we describe the difference.

    21

  • attributable to the restrictions on legal emigration for males in this age range (Cole, 1995). We therefore

    suspect that migrants in this age range are self-selected differently from their fellow countrymen emigrating

    at a later age.27

    We present summary statistics in Table 1. Column (1) presents summary statistics for all men and

    women in our geolocated sample, restricting attention to the fields for which no transcription was necessary.

    Consistent with official statistics (Ferenczi and Wilcox, 1929), we see that the immigrants are overwhelmingly

    male—more than 75 percent of our sample. Moreover, approximately 70 percent of passengers in our sample

    reported being married. Approximately 85 percent are matched to a province in southern Italy, as defined

    by the Bureau of Immigration. Columns (2) and (3) present these statistics for males and females separately.

    Female passengers are, on average, older than male passengers, approximately as likely to be married, and

    very slightly less likely to be from southern Italy.

    In columns (4)–(9), we restrict attention to the sample of individuals for whom we transcribed additional

    information. Column (4) presents the information for all transcribed individuals, while column (5) presents

    it for females and column (6) presents it for males. The already-digitized information for each group is

    similar to that for the untranscribed sample. Based on our transcription, we classified any passenger listing

    any person whom they would be joining in the United States as having some connection, and any individual

    who reported joining an immediate family member in the United States (i.e., a sibling, parent, child, or

    spouse) as having an immediate family connection in the United States. Over 95 percent of male and female

    migrants report that they have some connection (“Any Conn.”), but males were far less likely to report that

    this connection was an immediate family member (i.e., a parent, sibling, child, or spouse), with only 32

    percent falling in this group (“Imm. Fam. Conn.”= 1) as compared to nearly 74 percent of women. Similar

    differences are apparent in the fraction of men and women reporting having been in the United States

    before (“Repeater”). More than 40 percent of men reported that they had been in the United States before,

    compared to only 16 percent of women. A gender difference is also apparent in whether the passenger had

    paid for himself, with 90 percent of men paying their own passage and only 66 percent of women doing so.

    Moreover, an unusual relationship exists between the heights of men and women, the distributions of which

    are presented graphically in Figure 11. In particular, female passengers were much taller relative to male

    passengers than is commonly the case in modern populations (Gaulin and Boster, 1985). This relationship

    is discussed in more detail in Appendix B. As discussed in this Appendix, however, we find no reason to

    believe, based on this relationship, that there are systematic issues of accuracy in our data. However, given

    27Comparisons to the age 20 distributions of province and birth cohort height indicate that those migrating at the ages of20 and 21 are negatively self-selected.

    22

  • that we do not have data on the distributions of stature for women in Italy, we exclude women from our

    analysis.

    In column (7), we eliminate from the sample any passenger who indicated that he had been in the United

    States before. We make this restriction primarily for two reasons. First, the process of self-selection into

    return migration is not well understood (though it has recently received scholarly attention: Abramitzky,

    Boustan, and Eriksson, 2012, 2014; Bandiera, Rasul, and Viarengo, 2013; Crimmins et al., 2005; Ward,

    2013). Distinguishing between first-time and return migration prevents our sample from being contaminated

    by some other form of self-selection (i.e., into return migration) and prevents us from counting the same

    passengers more than once. Second, these passengers may have arrived in the United States before completing

    their physical growth, and would thus have grown differently (Boas, 1911, 1920; Gravlee, Bernard, and

    Leonard, 2003; Kress, 2007; Sparks and Jantz, 2002, 2003). Therefore, our benchmark sample will be that

    summarized in column (7)—males aged 22–65 who reported never having been in the United States. The

    remaining migrants after this deletion are younger than the repeat passengers, less likely to be married or

    to have an immediate family connection, and very slightly shorter. All of the results discussed below are

    stronger when these repeat passengers are included.

    Next, we summarize the geographic distribution of male heights graphically. Figure 12 presents the

    average male heights of each province (based on A’Hearn, Peracchi, and Vecchi, 2009 and A’Hearn and

    Vecchi, 2011) weighted by our passenger counts, as well as the average heights of male passengers in our

    sample from each province. The average military heights exhibit a strong pattern, with the tallest provinces

    in the north, the shortest in the south, and the middling provinces in the center. We see a similar trend in

    the heights of migrants, with the tallest originating in the north, and the shortest in the south. Column (7)

    of table 1 also shows that the average heights of male migrants in our sample was 163.80 cm.28

    Finally, we study separately two samples that allow us to break down the analysis by time period. As

    is evident from Figure 8, World War I was a massive disruption to trans-Atlantic migration. Moreover,

    in 1917, the United States enacted the literacy test, requiring that all adults entering the United States

    demonstrate literacy. Both of these events fundamentally changed international migration, and there is

    reason to believe that post-1917 passengers might be substantially different from pre-1917 passengers, and

    that pre-War migrants may have differed from the post-War migrants. We therefore split the sample into

    pre-1917 (exclusive) and post-1917 (inclusive) subsamples, which are summarized in columns (8) and (9),

    28By contrast, the average American soldier (who may have been negatively self-selected) born in the 1860s was 171 cm tall(Zehetmayer, 2011, Figure 1, p. 320).

    23

  • respectively, of Table 1.29 Most striking is the large difference in stature between the two periods: passengers

    in the post-1917 are more than one centimeter taller on average. They are also more than eight percentage

    points more likely to have an immediate family connection, six percentage points less likely to be married,

    and four percentage points more likely to have paid for their own passage as compared to those arriving

    before 1917.

    3.3 Representativeness of the Geolocated Sample

    Before beginning our primary analysis, we examine whether our geolocation algorithm has produced a

    representative sample for our analysis. First, we estimate a number of regressions of the form

    yi = β0 + β1Gi + εi,

    where yi is some individual characteristic of interest, and Gi is an indicator equal to one if individual i is

    successfully matched to a province by our algorithm, and zero otherwise. The coefficient β1 tests whether

    there is a difference in the mean of each characteristic between the geolocated and non-geolocated groups.

    Table 2 presents estimates of β1 for a variety of individual characteristics of interest and for a variety of

    samples. One division of the data is based on the recorded ethnicity of passengers traveling from Italy.

    Beginning in 1903, the passenger manifests were required to include the ethnicity (“Race or People”) of

    immigrants (Perlmann, 2001; Weil, 2000). North Italians and south Italians were officially considered to be

    two separate ethnicities. The instructions for clerks completing the passenger manifests placed the dividing

    line between north and south Italy at the southern extreme of the basin of the River Po.30 Nevertheless,

    compliance with the official definitions of the ethnicities appears to have been lax, and some passengers were

    still recorded as simply Italian, without further disaggregation. We refer to these passengers as “General

    Italians.” Figure 13 also depicts the division of Italy into North and South by the Bureau of Immigration and

    Naturalization. This field provides information on the probable geographic origin of individuals independent

    29We could also split the sample into pre-1914 (inclusive) and post-1919 (inclusive) subsamples in order to omit the WorldWar I years, but given the fall in the quantity of migration during this period, there is essentially no difference between thisapproach and ours.

    30Specifically, the manifest defined north Italians as[t]he people who are native to the basin of the River Po in northern Italy (i.e., compartments of Piedmont,Lombardy, Venetia, and Emilia) and their descendants, whether residing in Italy, Switzerland, Austria-Hungary,or any other country . . . . Most of these people speak a Gallic dialect of the Italian language.

    South Italians were defined as[t]he people who are native to that portion of Italy south of the basin of the River Po (i.e., compartments ofLiguria, Tuscany, the Marches, Umbria, Rome, the Abruzzi and Molise, Campania, Apulia, Basilicata, Calabria,Sicily, and Sardinia) and their descendants . . . .

    24

  • of our algorithm. We use it in Appendix A in order to test the accuracy of our algorithm, and here in

    order to determine if our algorithm led to imbalances between matched and unmatched individuals within

    geographic regions.

    We begin in column (1) of Table 2 by studying all males aged 22–65 in our 1907–1925 group of passengers.

    Our analysis indicates that south Italians are significantly overrepresented in our geolocated sample (66.5

    percent of the geolocated sample, as opposed to 58.6 percent of the non-geolocated sample) while north

    Italians are slightly underrepresented and general Italians are significantly underrepresented. This under-

    representation of general Italians is likely due to the fact that Italians traveling through non-Italian ports

    were less likely to be assigned a north/south ethnicity, and their locations are likely to have been recorded

    with less accuracy due to a lack of familiarity by clerks in other countries with Italian geography and

    spelling.31 We also find that those in our geolocated sample are on average 0.08 years younger than the

    non-geolocated. In addition, the average birth year is 0.35 years later. There is no statistically significant

    difference in marriage rates between groups.

    As we are interested in observing differential self-selection patterns across provinces, we must verify that

    the sample is balanced at the provincial level, which can be approximated (without using our geolocation

    algorithm) by ethnicity. In columns (2)–(4) of Table 2, we break down the the sample used in column (1) by

    the ethnicity of migrants. There exist statistically significant differences in the probability of being married

    between the geolocated and non-geolocated groups for each ethnicity, but these are small, likely reflecting

    the large sample sizes as much as any actual differences. Similarly, differences in age and birth year exist,

    but are also small. Moreover, differences in age and birth year are not particularly troubling, as all of our

    analyses condition on birth year by comparing the height data of migrants to the averages of their birth

    cohort, and height is constant with age in our range.

    Next, we perform the same exercises restricting attention to the transcribed sample. Columns (5)–(8)

    report the difference in the means of various characteristics between transcribed males aged 22–65 who were

    matched to a province by our algorithm, and those who were not matched. Statistically and economically

    significant differences persist in matching rates across ethnicities. However, there are no statistically sig-

    nificant differences in age, birth year, or marital status between groups, even within ethnicities. We also

    compare matched and unmatched individuals on the basis of transcribed information. We find no statisti-

    cally significant differences between matched and unmatched individuals on the basis of transcribed data,

    except among northerners with respect to the measures of social network status and whether the passengers

    31For example, the modal departure port for ships characterizing all Italian passengers as simply Italian was Cherbourg,France, while the modal departure port for ships decomposing all Italian passengers by ethnicity was Naples.

    25

  • paid for their own passage. In particular, matched individuals are four percentage points less likely to have

    any connection in the United States, nine percentage points less likely to have an immediate family connec-

    tion, and four percentage points more likely to have paid for their own passage than unmatched individuals.

    These differences have no implications for our main results (which are based only on height). They could

    potentially have implications for our results regarding the mechanisms driving self-selection.

    The fact that our main results are based on stature makes it particularly important to test the repre-

    sentativeness of our sample on this front. In particular, there are two potential dangers that we face in

    terms of the balancedness of our sample. First, as we compare the heights of migrants to those of the Italian

    population, it is important to ensure that our sample of migrants is representative of all migrants, rather

    than being biased upward or downward in height by our algorithm. Although not statistically significant,

    Table 2 shows that matched north Italians were 0.375 centimeters taller than their unmatched fellows, while

    south Italians were 0.009 centimeters shorter. If these difference represent a small but non-spurious bias,

    they would bias our estimates against our baseline results of more positive self-selection in the south. Figure

    14 shows this result in greater detail: among passengers of all ethnicities, the probability of being matched

    is all but constant over height. Within the separate ethnicities, there is more noise among the rare heights,

    but essentially the same conclusion follows.

    Second, since we test whether, within a province, migrants are taller or shorter than their source popula-

    tion, it is important to ensure that, conditional on the province of origin and its mean height, our matched

    sample is not taller or shorter than the unmatched group. Finally, since we test whether the trends in

    self-selection differ across cohorts and provinces or different mean heights, we must ensure that there are no

    differences in the differences between the heights of the matched and unmatched individuals across cohorts

    of different average stature. However, determining the province to which an individual belongs requires a

    successful match, which we do not have for the matched individuals. We must therefore find some way

    of associating unmatched individuals with provinces and birth cohorts independently of the geolocation

    algorithm.

    To this end, we use the following procedure, which takes advantage of the fact that Italian surnames

    are useful indicators of geographic origins (Guglielmino and De Silvestri, 1995). First, for each surname,

    we determine the modal province to which individuals with that surname who could be geolocated were

    assigned. Then, for the purposes of this exercise only, we assign all individuals to the modal province for

    their surname. We then use their (known) birth year to assign them to a province and birth cohort, from

    which a mean height for each individual’s province and birth cohort is determined. The rationale behind

    26

  • this exercise is the following: family names can be used to group migrants into bins that are clustered across

    space. If geolocated individuals are randomly drawn from within each province and birth cohort, then we

    expect that the height distributions of matched and unmatched passengers would be the same within each

    surname bin. Mapping passengers to a province predicted by their surnames brings us as close as possible

    to comparing height distributions within provinces and birth cohorts and makes it possible to test whether

    the matched-unmatched gap changes systematically with the height of the province and birth cohort.

    We first use the results of this procedure to estimate the regression equation

    zijt = β0 + β1µjt + β2Gijt + β3Gijtµjt + εijt, (1)

    where zijt represents the standardized (by the surname-implied province-birth cohort mean and standard

    deviation) height of individual i, who is matched (by this procedure) to province j and birth cohort t, with

    mean height µjt (normalized to have mean zero), and where Gijt is an indicator equal to one if individual

    i was matched to a province by our geolocation algorithm and to zero otherwise. We present the results

    of this regression in Table 3. There are two coefficients of interest. First, β2 indicates whether there is, at

    the average, a systematic difference in standardized heights between the geolocated and the non-geolocated.

    While we find that the geolocated individuals are, on average, taller than the non-geolocated conditional

    on the mean height of their place of origin, this difference is statistically insignificant. This difference is

    somewhat concerning, as it would tend to spuriously generate our findings in section 4.3. However, it must

    be kept in mind that the unmatched comprise less than 15 percent of migrants, which would make any

    sample selection biases induced by our algorithm small. Second, β3 indicates whether there is a systematic

    difference in the difference between the matched and unmatched groups between provinces and cohorts of

    different average heights. We find that this coefficient is positive, but that again, it is not statistically

    significant. Moreover, the positive sign works against our results in section 4.4. Thus, the true differential

    patterns are stronger than those that we measure.

    We also seek to verify that these findings are not driven by the linearity assumptions of equation (1).

    We therefore regress non-parametrically, in Figure 15a, zijt on µjt for each of the two groups of Gijt, and

    present an estimate of the difference between the two curves in Figure 15b.32 Throughout the range of µjt,

    the confidence band includes zero. Moreover, the nature of the difference, as above, is such that it would

    work against our differential results, except at the upper extreme of mean heights, at which the data are

    32The confidence bands in the graph are 95 percent point wise confidence intervals. We thank Anand Krishnamurthy forhelpful discussions on this topic.

    27

  • sparse. Thus, on the whole, these balancing tests do not show any compelling evidence of differences in

    heights or differential differences in heights between the matched and unmatched groups, nor compelling

    reason to believe that our results will be driven by imbalances in our geolocation algorithm, However, the

    point estimates show that we cannot rule out sample-selection bias entirely, and we will thus discuss the

    potential consequences of sample-selection bias below.

    4 The Nature and Degree of Self-Selection

    We are now equipped to begin examining the nature and degree of self-selection of Italian migrants. We

    first lay out a formal framework for our analysis. We then study migrants as compared to all of Italy

    before disaggregating the analysis to compare these migrants to their provinces of origin. We then study

    geograp