EconPapers    
Economics at your fingertips  
 

clValid: An R Package for Cluster Validation

Guy Brock, Vasyl Pihur, Susmita Datta and Somnath Datta

Journal of Statistical Software, 2008, vol. 025, issue i04

Abstract: The R package clValid contains functions for validating the results of a clustering analysis. There are three main types of cluster validation measures available, "internal", "stability", and "biological". The user can choose from nine clustering algorithms in existing R packages, including hierarchical, K-means, self-organizing maps (SOM), and model-based clustering. In addition, we provide a function to perform the self-organizing tree algorithm (SOTA) method of clustering. Any combination of validation measures and clustering methods can be requested in a single function call. This allows the user to simultaneously evaluate several clustering algorithms while varying the number of clusters, to help determine the most appropriate method and number of clusters for the dataset of interest. Additionally, the package can automatically make use of the biological information contained in the Gene Ontology (GO) database to calculate the biological validation measures, via the annotation packages available in Bioconductor. The function returns an object of S4 class "clValid", which has summary, plot, print, and additional methods which allow the user to display the optimal validation scores and extract clustering results.

Date: 2008-03-18
References: View complete reference list from CitEc
Citations: View citations in EconPapers (50)

Downloads: (external link)
https://www.jstatsoft.org/index.php/jss/article/view/v025i04/v25i04.pdf
https://www.jstatsoft.org/index.php/jss/article/do ... clValid_0.5-6.tar.gz
https://www.jstatsoft.org/index.php/jss/article/do ... v025i04/v25i04.R.zip

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:jss:jstsof:v:025:i04

DOI: 10.18637/jss.v025.i04

Access Statistics for this article

Journal of Statistical Software is currently edited by Bettina Grün, Edzer Pebesma and Achim Zeileis

More articles in Journal of Statistical Software from Foundation for Open Access Statistics
Bibliographic data for series maintained by Christopher F. Baum ().

 
Page updated 2025-03-19
Handle: RePEc:jss:jstsof:v:025:i04