EconPapers    
Economics at your fingertips  
 

Issues of robustness and high dimensionality in cluster analysis

Kaye Basford (), Geoff McLachlan () and Richard Bean ()
Additional contact information
Kaye Basford: University of Queensland, School of Land and Food Sciences
Geoff McLachlan: University of Queensland, Department of Mathematics & Institute for Molecular Bioscience
Richard Bean: University of Queensland, Institute for Molecular Bioscience

A chapter in Compstat 2006 - Proceedings in Computational Statistics, 2006, pp 3-15 from Springer

Abstract: Abstract Finite mixture models are being increasingly used to model the distributions of a wide variety of random phenomena. While normal mixture models are often used to cluster data sets of continuous multivariate data, a more robust clustering can be obtained by considering the t mixture model-based approach. Mixtures of factor analyzers enable model-based density estimation to be undertaken for high-dimensional data where the number of observations n is very large relative to their dimension p. As the approach using the multivariate normal family of distributions is sensitive to outliers, it is more robust to adopt the multivariate t family for the component error and factor distributions. The computational aspects associated with robustness and high dimensionality in these approaches to cluster analysis are discussed and illustrated.

Keywords: Finite mixture models; normal components; mixtures of factor analyzers; t distributions; EM algorithm (search for similar items in EconPapers)
Date: 2006
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-3-7908-1709-6_1

Ordering information: This item can be ordered from
http://www.springer.com/9783790817096

DOI: 10.1007/978-3-7908-1709-6_1

Access Statistics for this chapter

More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2026-06-01
Handle: RePEc:spr:sprchp:978-3-7908-1709-6_1