Integration and imputation of survey data in R: the StatMatch package
Marcello D’Orazio
Additional contact information
Marcello D’Orazio: Italian National Institute of Statistics
Romanian Statistical Review, 2015, vol. 63, issue 2, 57-68
Abstract:
Statistical matching methods permit to integrate two or more data sources with the purpose of investigating the relationship between variables not jointly observed. Recently these methods received much attention as valid alternative to produce new statistical outputs.The paper provides an overview on the statistical matching methods implemented in the package StatMatch for the R environment, focusing on the most widespread methods and how they were improved. Particular attention is devoted to hot deck matching methods, strictly related to the ones developed for the imputation of missing values. The corresponding functions in StatMatch are very powerful and are flexible enough to be applied for imputing missing values in a survey. The paper tackles also the problem of matching data from complex sample surveys, a very important topic in National Statistical Institutes. Finally it is described the concept of uncertainty characterizing the statistical matching framework and how this alternative approach can be exploited for different purposes.
Keywords: hot deck imputation methods; statistical matching; uncertainty (search for similar items in EconPapers)
Date: 2015
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://www.revistadestatistica.ro/wp-content/uploads/2015/04/RRS2_2015_A06.pdf (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:rsr:journl:v:63:y:2015:i:2:p:57-68
Access Statistics for this article
More articles in Romanian Statistical Review from Romanian Statistical Review Contact information at EDIRC.
Bibliographic data for series maintained by Adrian Visoiu ().