Regression analysis with linked data: problems and possible solutions
Andrea Tancredi () and
Brunero Liseo ()
Additional contact information
Andrea Tancredi: Università di Roma "La Sapienza", Italy
Brunero Liseo: Università di Roma "La Sapienza", Italy
Statistica, 2015, vol. 75, issue 1, 19-35
Abstract:
In this paper we have described and extended some recent proposals on a general Bayesian methodology for performing record linkage and making inference using the resulting matched units. In particular, we have framed the record linkage process into a formal statistical model which comprises both the matching variables and the other variables included at the inferential stage. This way, the researcher is able to account for the matching process uncertainty in inferential procedures based on probabilistically linked data, and at the same time, he/she is also able to generate a feedback propagation of the information between the working statistical model and the record linkage stage. We have argued that this feedback effect is both essential to eliminate potential biases that otherwise would characterize the resulting linked data inference, and able to improve record linkage performances. The practical implementation of the procedure is based on the use of standard Bayesian computational techniques, such as Markov Chain Monte Carlo algorithms. Although the methodology is quite general, we have restricted our analysis to the popular and important case of multiple linear regression set-up for expository convenience.
Keywords: Bayesian regression; Hit-miss model; Metropolis-Hastings algorithm; Record linkage (search for similar items in EconPapers)
Date: 2015
References: Add references at CitEc
Citations: View citations in EconPapers (2)
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bot:rivsta:v:75:y:2015:i:1:p:19-35
Access Statistics for this article
Statistica is currently edited by Department of Statistics, University of Bologna
More articles in Statistica from Department of Statistics, University of Bologna Contact information at EDIRC.
Bibliographic data for series maintained by Giovanna Galatà ().