The Problem of False Positives in Automated Census Linking: Evidence from Nineteenth-Century New York's Irish Immigrants
Tyler Anbinder,
Dylan Connor,
Cormac Ó Gráda and
Simone Wegge
Additional contact information
Tyler Anbinder: George Washington University
Dylan Connor: Arizona State University
Simone Wegge: College of Staten Island and The Graduate Center—CUNY
CAGE Online Working Paper Series from Competitive Advantage in the Global Economy (CAGE)
Abstract:
Automated census linkage algorithms have become popular for generating longitudinal data on social mobility, especially for immigrants and their children. But what if these algorithms are particularly bad at tracking immigrants? Using nineteenth-century Irish immigrants as a test case, we examine the most popular of these algorithms—that created by Abramitzky, Boustan, Eriksson (ABE), and their collaborators. Our findings raise serious questions about the quality of automated census links. False positives range from about one-third to one-half of all links depending on the ABE variant used. These bad links lead to sizeable estimation errors when measuring Irish immigrant social mobility.
Date: 2021
New Economics Papers: this item is included in nep-his, nep-isf and nep-ure
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (3)
Downloads: (external link)
https://warwick.ac.uk/fac/soc/economics/research/c ... tions/wp568.2021.pdf
Related works:
Working Paper: The Problem of False Positives in Automated Census Linking: Evidence from Nineteenth-Century New York's Irish Immigrants (2021) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:cge:wacage:568
Access Statistics for this paper
More papers in CAGE Online Working Paper Series from Competitive Advantage in the Global Economy (CAGE) Contact information at EDIRC.
Bibliographic data for series maintained by Jane Snape ().