EconPapers    
Economics at your fingertips  
 

High-Dimensional Imputation for the Social Sciences: A Comparison of State-of-The-Art Methods

Edoardo Costantini, Kyle M. Lang, Tim Reeskens and Klaas Sijtsma

Sociological Methods & Research, 2025, vol. 54, issue 2, 448-499

Abstract: Including a large number of predictors in the imputation model underlying a multiple imputation (MI) procedure is one of the most challenging tasks imputers face. A variety of high-dimensional MI techniques can help, but there has been limited research on their relative performance. In this study, we investigated a wide range of extant high-dimensional MI techniques that can handle a large number of predictors in the imputation models and general missing data patterns. We assessed the relative performance of seven high-dimensional MI methods with a Monte Carlo simulation study and a resampling study based on real survey data. The performance of the methods was defined by the degree to which they facilitate unbiased and confidence-valid estimates of the parameters of complete data analysis models. We found that using lasso penalty or forward selection to select the predictors used in the MI model and using principal component analysis to reduce the dimensionality of auxiliary data produce the best results.

Keywords: Multiple imputation; high-dimensionality; regularized regression; principal components; CART; random forest (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
https://journals.sagepub.com/doi/10.1177/00491241231200194 (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:sae:somere:v:54:y:2025:i:2:p:448-499

DOI: 10.1177/00491241231200194

Access Statistics for this article

More articles in Sociological Methods & Research
Bibliographic data for series maintained by SAGE Publications ().

 
Page updated 2025-04-05
Handle: RePEc:sae:somere:v:54:y:2025:i:2:p:448-499