Variable selection in clustering via Dirichlet process mixture models

Kim, Sinae; Tadesse, Mahlet G.; Vannucci, Marina

Variable selection in clustering via Dirichlet process mixture models

Sinae Kim, Mahlet G. Tadesse and Marina Vannucci

Biometrika, 2006, vol. 93, issue 4, 877-893

Abstract: The increased collection of high-dimensional data in various fields has raised a strong interest in clustering algorithms and variable selection procedures. In this paper, we propose a model-based method that addresses the two problems simultaneously. We introduce a latent binary vector to identify discriminating variables and use Dirichlet process mixture models to define the cluster structure. We update the variable selection index using a Metropolis algorithm and obtain inference on the cluster structure via a split-merge Markov chain Monte Carlo technique. We explore the performance of the methodology on simulated data and illustrate an application with a DNA microarray study. Copyright 2006, Oxford University Press.

Date: 2006
References: Add references at CitEc
Citations: View citations in EconPapers (21)

Downloads: (external link)
http://hdl.handle.net/10.1093/biomet/93.4.877 (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:oup:biomet:v:93:y:2006:i:4:p:877-893

Ordering information: This journal article can be ordered from
https://academic.oup.com/journals

Access Statistics for this article

Biometrika is currently edited by Paul Fearnhead

More articles in Biometrika from Biometrika Trust Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, UK.
Bibliographic data for series maintained by Oxford University Press ().