Clustering for binary data and mixture models—choice of the model

Nadif, M.; Govaert, G.

Clustering for binary data and mixture models—choice of the model

M. Nadif and G. Govaert

Applied Stochastic Models and Data Analysis, 1997, vol. 13, issue 3‐4, 269-278

Abstract: When cluster analysis is based on mixture models, choosing an appropriate model is a difficult problem. Previous studies usually addressed a part of this problem by estimating the number of clusters and assuming the type of model to be known. Various criteria to be minimized have been proposed to measure a model's suitability by balancing model fit and model complexity. In this work, we extend the work of Govaert (1990) and Celeux and Govaert (1995) to the use of some of these information criteria in the detection of the type of Bernoulli mixture model while assuming that the number of clusters is known. We simulated samples with various underlying types of model and separations of components using Monte Carlo simulations. These simulations show the advantages and the weaknesses of the considered information criteria with a view to determining the type of model. In addition, they underline the importance of a judicious choice of model type in order to obtain a good clustering. © 1998 John Wiley & Sons, Ltd.

Date: 1997
References: Add references at CitEc
Citations:

Downloads: (external link)
https://doi.org/10.1002/(SICI)1099-0747(199709/12)13:3/43.0.CO;2-7

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:wly:apsmda:v:13:y:1997:i:3-4:p:269-278

Access Statistics for this article

More articles in Applied Stochastic Models and Data Analysis from John Wiley & Sons
Bibliographic data for series maintained by Wiley Content Delivery ().