Estimating Number of Clusters Based on a General Similarity Matrix with Application to Microarray Data
Fallah Shafagh,
Tritchler David and
Beyene Joseph
Additional contact information
Fallah Shafagh: University of Toronto
Tritchler David: University Health Network, Toronto; University of Toronto; and SUNY at Buffalo
Beyene Joseph: Hospital for Sick Children Research Institute and University of Toronto
Statistical Applications in Genetics and Molecular Biology, 2008, vol. 7, issue 1, 25
Abstract:
Many clustering methods require that the number of clusters believed present in a given data set be specified a priori, and a number of methods for estimating the number of clusters have been developed. However, the selection of the number of clusters is well recognized as a difficult and open problem and there is a need for methods which can shed light on specific aspects of the data. This paper adopts a model for clustering based on a specific structure for a similarity matrix. Publicly available gene expression data sets are analyzed to illustrate the method and the performance of our method is assessed by simulation.
Keywords: cluster analysis; eigenanalysis; microarray; segmented regression; scree plot (search for similar items in EconPapers)
Date: 2008
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
https://doi.org/10.2202/1544-6115.1261 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bpj:sagmbi:v:7:y:2008:i:1:n:24
Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/sagmb/html
DOI: 10.2202/1544-6115.1261
Access Statistics for this article
Statistical Applications in Genetics and Molecular Biology is currently edited by Michael P. H. Stumpf
More articles in Statistical Applications in Genetics and Molecular Biology from De Gruyter
Bibliographic data for series maintained by Peter Golla ().