EconPapers    
Economics at your fingertips  
 

How to choose an approach to handling missing categorical data: (un)expected findings from a simulated statistical experiment

Svetlana Zhuchkova () and Aleksei Rotmistrov ()
Additional contact information
Svetlana Zhuchkova: HSE University
Aleksei Rotmistrov: HSE University

Quality & Quantity: International Journal of Methodology, 2022, vol. 56, issue 1, No 1, 22 pages

Abstract: Abstract The study is devoted to a comparison of three approaches to handling missing data of categorical variables: complete case analysis, multiple imputation (based on random forest), and the missing-indicator method. Focusing on OLS regression, we describe how the choice of the approach depends on the missingness mechanism, its proportion, and model specification. The results of a simulated statistical experiment show that each approach may lead to either almost unbiased or dramatically biased estimates. The choice of the appropriate approach should be primarily based on the missingness mechanism: one should choose CCA under MCAR, MI under MAR, and, again, CCA under MNAR. Although MIM produces almost unbiased estimates under MCAR and MNAR as well, it leads to inefficient regression coefficients—ones with too big standard errors and, consequently, incorrect p-values.

Keywords: Categorical data; Complete case analysis; Missing data; Missing indicator method; Multiple imputation; Random forest; Regression analysis; Statistical experiment (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)

Downloads: (external link)
http://link.springer.com/10.1007/s11135-021-01114-w Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:qualqt:v:56:y:2022:i:1:d:10.1007_s11135-021-01114-w

Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/11135

DOI: 10.1007/s11135-021-01114-w

Access Statistics for this article

Quality & Quantity: International Journal of Methodology is currently edited by Vittorio Capecchi

More articles in Quality & Quantity: International Journal of Methodology from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:qualqt:v:56:y:2022:i:1:d:10.1007_s11135-021-01114-w