EconPapers    
Economics at your fingertips  
 

Multiple imputation with large proportions of missing data: How much is too much?

Jin Hyuk Lee and John Huber
Additional contact information
Jin Hyuk Lee: Texas A&M Health Science Center

United Kingdom Stata Users' Group Meetings 2011 from Stata Users Group

Abstract: Multiple imputation (MI) is known as an effective method for handling missing data. However, it is not clear that the method will be effective when the data contain a high percentage of missing observations on a variable. This study examines the effectiveness of MI in data with 10% to 80% missing observations using absolute bias and root mean squared error of MI measured under missing completely at random, missing at random, and not missing at random assumptions. Using both simulated data drawn from multivariate normal distribution and example data from the Predictive Study of Coronary Heart Disease, the bias and root mean squared error using MI are much smaller than of the results when complete case analysis is used. In addition, the bias of MI is consistent regardless of increasing imputation numbers (M) from M = 10 to M = 50. Moreover, compared to the regression method and predictive mean matching method, the Markov chain Monte Carlo method can also be used for continuous and univariate missing variables as an imputation mechanism. In conclusion, MI produces less-biased estimates, but when large proportions of data are missing, other things need to be considered such as the number of imputations, imputation mechanisms, and missing data mechanisms for proper imputation.

Date: 2011-09-26
References: Add references at CitEc
Citations: View citations in EconPapers (4)

Downloads: (external link)
http://repec.org/usug2011/UK11_Lee.pptx (application/x-ms-powerpoint)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:boc:usug11:23

Access Statistics for this paper

More papers in United Kingdom Stata Users' Group Meetings 2011 from Stata Users Group Contact information at EDIRC.
Bibliographic data for series maintained by Christopher F Baum ().

 
Page updated 2025-03-19
Handle: RePEc:boc:usug11:23