A novel clustering method with maximum number of ordered centroids and stable clusters for optimal ranking in a univariate setting
Mariaelena Bottazzi Schenone (),
Elena Grimaccia () and
Maurizio Vichi ()
Additional contact information
Mariaelena Bottazzi Schenone: Sapienza University
Elena Grimaccia: ISTAT - Italian National Institute of Statistics
Maurizio Vichi: Sapienza University
Statistical Methods & Applications, 2025, vol. 34, issue 4, No 2, 607-637
Abstract:
Abstract This paper proposes an innovative method to determine the optimal ranking of a set of univariate units in the maximum number of clusters with sortable centroids. Units within the identified clusters are considered equivalent, while units between clusters show a significant difference in terms of the variable in study. By means of bootstrap estimates of clusters’ centroids, the proposed procedure allows to identify the optimal number of “well-separated” classes, adding on the deterministic results. Moreover, the bootstrap estimates of units’ membership matrices allow us to define an optimal ranking of these units within the identified clusters: the obtained clusters are ranked so that units within each cluster are represented by the rank of the cluster they belong to. Centroids and membership matrices are obtained by applying a specialized K-means clustering on one dimensional data. This methodology is particularly useful in a framework where the aim is to rank units in equivalence classes in a univariate setting. The performance of the presented methodology is evaluated through a simulation study and compared with some widely used techniques to choose the number of clusters and with Gaussian mixture models. Moreover, two real data applications provide insights on the rank of European cities according to their air pollution level and on the rank of National Basketball Association players in terms of their on-court performance. A graphic visualization of the obtained ranking allows to immediately appreciate both the resulting partition of units into equivalence classes and its stability measurement.
Keywords: One-dimensional data clustering; Ranking in equivalence classes; Optimal number of clusters; Bootstrap; K-means clustering (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s10260-025-00803-2 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:stmapp:v:34:y:2025:i:4:d:10.1007_s10260-025-00803-2
Ordering information: This journal article can be ordered from
http://www.springer. ... cs/journal/10260/PS2
DOI: 10.1007/s10260-025-00803-2
Access Statistics for this article
Statistical Methods & Applications is currently edited by Tommaso Proietti
More articles in Statistical Methods & Applications from Springer, Società Italiana di Statistica
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().