How to Deal with Missing Categorical Data: Test of a Simple Bayesian Method
Thomas Astebro and
Gongyue Chen
Additional contact information
Gongyue Chen: University of Waterloo [Waterloo]
Post-Print from HAL
Abstract:
The authors analyze the efficiency of six missing data techniques for categorical item nonresponse under the assumption that data are missing at random or missing completely at random. By efficiency, the authors mean a procedure that produces an unbiased estimate of true sample properties that is also easy to implement. The investigated techniques include listwise deletion, mode substitution, random imputation, two regression imputations, and a Bayesian model-based procedure. The authors analyze efficiency under six experimental conditions for a survey-based data set. They find that listwise deletion is efficient for the data analyzed. If data loss due to listwise deletion is an issue, the analysis points to the Bayesian method. Regression imputation is also efficient, but the result is conditioned on the specific data structure and may not hold in general. Additional problems arise when using regression imputation, making it less appropriate.
Keywords: missing data; categorical variable; Bayesian; imputation (search for similar items in EconPapers)
Date: 2003-07
References: Add references at CitEc
Citations: View citations in EconPapers (9)
Published in Organizational Research Methods, 2003, 6 (3), pp.309-327. ⟨10.1177/1094428103254672⟩
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:hal:journl:hal-00477167
DOI: 10.1177/1094428103254672
Access Statistics for this paper
More papers in Post-Print from HAL
Bibliographic data for series maintained by CCSD ().