A Generalized Fellegi--Sunter Framework for Multiple Record Linkage With Application to Homicide Record Systems
Mauricio Sadinle and
Stephen E. Fienberg
Journal of the American Statistical Association, 2013, vol. 108, issue 502, 385-397
Abstract:
We present a probabilistic method for linking multiple datafiles. This task is not trivial in the absence of unique identifiers for the individuals recorded. This is a common scenario when linking census data to coverage measurement surveys for census coverage evaluation, and in general when multiple record systems need to be integrated for posterior analysis. Our method generalizes the Fellegi--Sunter theory for linking records from two datafiles and its modern implementations. The goal of multiple record linkage is to classify the record K -tuples coming from K datafiles according to the different matching patterns. Our method incorporates the transitivity of agreement in the computation of the data used to model matching probabilities. We use a mixture model to fit matching probabilities via maximum likelihood using the Expectation--Maximization algorithm. We present a method to decide the record K -tuples membership to the subsets of matching patterns and we prove its optimality. We apply our method to the integration of the three Colombian homicide record systems and perform a simulation study to explore the performance of the method under measurement error and different scenarios. The proposed method works well and opens new directions for future research.
Date: 2013
References: View complete reference list from CitEc
Citations: View citations in EconPapers (14)
Downloads: (external link)
http://hdl.handle.net/10.1080/01621459.2012.757231 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:jnlasa:v:108:y:2013:i:502:p:385-397
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/UASA20
DOI: 10.1080/01621459.2012.757231
Access Statistics for this article
Journal of the American Statistical Association is currently edited by Xuming He, Jun Liu, Joseph Ibrahim and Alyson Wilson
More articles in Journal of the American Statistical Association from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().