The problem of false positives in automated census linking: Nineteenth-century New York’s Irish immigrants as a case study
Cormac Ó Gráda,
Tyler Anbinder,
Dylan Connor and
Simone A. Wegge
Historical Methods: A Journal of Quantitative and Interdisciplinary History, 2023, vol. 56, issue 4, 240-259
Abstract:
Automated census linkage algorithms have become popular for generating longitudinal data on social mobility, especially for immigrants and their children. But what if these algorithms are particularly bad at tracking immigrants? This study utilizes a database on nineteenth-century Irish immigrants, generated from the most widely used algorithms, created by Abramitzky, Boustan, and Eriksson (ABE). Our objective is to assess the extent to which different individuals are erroneously linked together across census years and the consequences of these “false positives” for calculating social mobility. Our findings raise serious questions about the quality of the matches generated by the “first generation” of automated census linkage algorithms. False positives range from about one-third to one-half of all links. These bad links lead to sizeable estimation errors when measuring Irish immigrant social and geographic mobility.
Date: 2023
References: Add references at CitEc
Citations:
Downloads: (external link)
http://hdl.handle.net/10.1080/01615440.2024.2312293 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:vhimxx:v:56:y:2023:i:4:p:240-259
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/vhim20
DOI: 10.1080/01615440.2024.2312293
Access Statistics for this article
Historical Methods: A Journal of Quantitative and Interdisciplinary History is currently edited by J. David Hacker and Kenneth Sylvester
More articles in Historical Methods: A Journal of Quantitative and Interdisciplinary History from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().