EconPapers    
Economics at your fingertips  
 

Discriminant analysis-based cluster ensemble

Vasudha Bhatnagar, Sangeeta Ahuja and Sharanjit Kaur

International Journal of Data Mining, Modelling and Management, 2015, vol. 7, issue 2, 83-107

Abstract: The problem of instability and non-robustness in K-means clustering has been recognised as a serious problem in both scientific and business applications. Further, these problems get accentuated in the presence of outliers in data. Cluster ensemble technique has been recently developed to combat such problems and improve overall quality of clustering scheme. In this paper, we propose a cluster ensemble method based on discriminant analysis to obtain robust clustering and report noise to the user. Clustering schemes are generated by the partitional clustering algorithm (K-means) for constructing the ensemble. The proposed algorithm operates in three phases. During the first phase, input clustering schemes are reconciled by relabeling the clusters corresponding to an arbitrary reference scheme. This is accomplished using Hungarian algorithm, which is a well-known optimisation approach. The second phase uses discriminant analysis and constructs a label matrix that is used for generating consensus partition. In the final stage, clustering scheme is refined to deliver robust and stable clustering scheme. Empirical evaluation of the algorithm shows that the proposed method significantly improves the quality of resultant ensemble. Further, comparison with the cluster ensembles generated by package R for 20 public datasets demonstrated improved quality of ensembles generated by the proposed algorithm.

Keywords: k-means clustering; cluster ensembles; discriminant analysis; consistency; optimisation. (search for similar items in EconPapers)
Date: 2015
References: Add references at CitEc
Citations:

Downloads: (external link)
http://www.inderscience.com/link.php?id=69248 (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:ids:ijdmmm:v:7:y:2015:i:2:p:83-107

Access Statistics for this article

More articles in International Journal of Data Mining, Modelling and Management from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().

 
Page updated 2025-03-19
Handle: RePEc:ids:ijdmmm:v:7:y:2015:i:2:p:83-107