EconPapers    
Economics at your fingertips  
 

clues: An R Package for Nonparametric Clustering Based on Local Shrinking

Fang Chang, Weiliang Qiu, Ruben H. Zamar, Ross Lazarus and Xiaogang Wang

Journal of Statistical Software, 2010, vol. 033, issue i04

Abstract: Determining the optimal number of clusters appears to be a persistent and controversial issue in cluster analysis. Most existing R packages targeting clustering require the user to specify the number of clusters in advance. However, if this subjectively chosen number is far from optimal, clustering may produce seriously misleading results. In order to address this vexing problem, we develop the R package clues to automate and evaluate the selection of an optimal number of clusters, which is widely applicable in the field of clustering analysis. Package clues uses two main procedures, shrinking and partitioning, to estimate an optimal number of clusters by maximizing an index function, either the CH index or the Silhouette index, rather than relying on guessing a pre-specified number. Five agreement indices (Rand index, Hubert and Arabie's adjusted Rand index, Morey and Agresti's adjusted Rand index, Fowlkes and Mallows index and Jaccard index), which measure the degree of agreement between any two partitions, are also provided in clues. In addition to numerical evidence, clues also supplies a deeper insight into the partitioning process with trajectory plots.

Date: 2010-02-03
References: View complete reference list from CitEc
Citations: View citations in EconPapers (4)

Downloads: (external link)
https://www.jstatsoft.org/index.php/jss/article/view/v033i04/v33i04.pdf
https://www.jstatsoft.org/index.php/jss/article/do ... 4/clues_0.5-0.tar.gz
https://www.jstatsoft.org/index.php/jss/article/do ... ile/v033i04/v33i04.R
https://www.jstatsoft.org/index.php/jss/article/do ... v033i04/WDBC.csv.zip

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:jss:jstsof:v:033:i04

DOI: 10.18637/jss.v033.i04

Access Statistics for this article

Journal of Statistical Software is currently edited by Bettina Grün, Edzer Pebesma and Achim Zeileis

More articles in Journal of Statistical Software from Foundation for Open Access Statistics
Bibliographic data for series maintained by Christopher F. Baum ().

 
Page updated 2025-03-19
Handle: RePEc:jss:jstsof:v:033:i04