Determining the Number of Clusters Using Multivariate Ranks
Mohammed Baragilly () and
Biman Chakraborty ()
Additional contact information
Mohammed Baragilly: University of Birmingham, School of Mathematics
Biman Chakraborty: University of Birmingham, School of Mathematics
A chapter in Recent Advances in Robust Statistics: Theory and Applications, 2016, pp 17-33 from Springer
Abstract:
Abstract Determining number of clusters in a multivariate data has become one of the most important issues in very diversified areas of scientific disciplines. The forward search algorithm is a graphical approach that helps us in this task. The traditional forward search approach based on Mahalanobis distances has been introduced by Hadi (1992), Atkinson (1994), while Atkinson et al. (2004) used it as a clustering method. But like many other Mahalanobis distance-based methods, it cannot be correctly applied to asymmetric distributions and more generally, to distributions which depart from the elliptical symmetry assumption. We propose a new forward search methodology based on spatial ranks, where clusters are grown with one data point at a time sequentially, using spatial ranks with respect to the points already in the subsample. The algorithm starts from a randomly chosen initial subsample. We illustrate with simulated data that the proposed algorithm is robust to the choice of initial subsample and it performs well in different mixture multivariate distributions. We also propose a modified algorithm based on the volume of central rank regions. Our numerical examples show that it produces the best results under elliptic symmetry.
Keywords: Gaussian Mixture Model; Mahalanobis Distance; Subset Size; Forward Search; Clear Maximum (search for similar items in EconPapers)
Date: 2016
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-81-322-3643-6_2
Ordering information: This item can be ordered from
http://www.springer.com/9788132236436
DOI: 10.1007/978-81-322-3643-6_2
Access Statistics for this chapter
More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().