EconPapers    
Economics at your fingertips  
 

Automated Linking of Historical Data

Ran Abramitzky, Leah Boustan, Katherine Eriksson, James Feigenbaum and Santiago Perez

Journal of Economic Literature, 2021, vol. 59, issue 3, 865-918

Abstract: The recent digitization of complete count census data is an extraordinary opportunity for social scientists to create large longitudinal datasets by linking individuals from one census to another or from other sources to the census. We evaluate different automated methods for record linkage, performing a series of comparisons across methods and against hand linking. We have three main findings that lead us to conclude that automated methods perform well. First, a number of automated methods generate very low (less than 5 percent) false positive rates. The automated methods trace out a frontier illustrating the trade-off between the false positive rate and the (true) match rate. Relative to more conservative automated algorithms, humans tend to link more observations but at a cost of higher rates of false positives. Second, when human linkers and algorithms use the same linking variables, there is relatively little disagreement between them. Third, across a number of plausible analyses, coefficient estimates and parameters of interest are very similar when using linked samples based on each of the different automated methods. We provide code and Stata commands to implement the various automated methods.

JEL-codes: C81 C83 N01 N31 N32 (search for similar items in EconPapers)
Date: 2021
References: Add references at CitEc
Citations: View citations in EconPapers (44)

Downloads: (external link)
https://www.aeaweb.org/doi/10.1257/jel.20201599 (application/pdf)
https://www.aeaweb.org/journals/data/icpsr-unavailable
https://www.aeaweb.org/doi/10.1257/jel.20201599.ds (application/zip)
Access to full text is restricted to AEA members and institutional subscribers.

Related works:
Working Paper: Automated Linking of Historical Data (2019) Downloads
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:aea:jeclit:v:59:y:2021:i:3:p:865-918

Ordering information: This journal article can be ordered from
https://www.aeaweb.org/journals/subscriptions

DOI: 10.1257/jel.20201599

Access Statistics for this article

Journal of Economic Literature is currently edited by Steven Durlauf

More articles in Journal of Economic Literature from American Economic Association Contact information at EDIRC.
Bibliographic data for series maintained by Michael P. Albert ().

 
Page updated 2025-03-22
Handle: RePEc:aea:jeclit:v:59:y:2021:i:3:p:865-918