The Analysis of Multivariate Misclassified Data With Special Attention to Randomized Response Data
Ardo van den Hout and
Peter G. M. van der Heijden
Sociological Methods & Research, 2004, vol. 32, issue 3, 384-410
Abstract:
This article discusses log-linear analysis of misclassified categorical data when conditional misclassification probabilities are known. This kind of misclassification occurs when data are collected using a randomized response design. The authors describe the misclassification by a latent class model. Since a latent class model is a log-linear model with one or more categorical latent variables, it is possible to investigate relations between misclassified variables. Methods to fit log-linear models for the latent table are discussed, including an EM algorithm. Attention is given to problems with boundary solutions. The results can also be used in statistical disclosure control when the post-randomization method is applied to protect the privacy of respondents, in epidemiology when specificity and sensitivity are known, and in data mining when privacy is protected by intentional statistical perturbation. Examples are given using randomized response data from a research into social benefit fraud.
Date: 2004
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://journals.sagepub.com/doi/10.1177/0049124103257440 (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:sae:somere:v:32:y:2004:i:3:p:384-410
DOI: 10.1177/0049124103257440
Access Statistics for this article
More articles in Sociological Methods & Research
Bibliographic data for series maintained by SAGE Publications ().