EconPapers    
Economics at your fingertips  
 

On the estimation of optimal number of clusters for the induction of fuzzy decision trees

Swathi Jamjala Narayanan, Ilango Paramasivam and Rajen B. Bhatt

International Journal of Data Science, 2017, vol. 2, issue 3, 221-245

Abstract: Fuzzy decision tree (FDT) induction is a powerful methodology to extract human interpretable fuzzy classification rules. As far as our knowledge goes there is no recent comparative study of fuzzy cluster validity indices with an objective of using it for estimating the optimal number of clusters for each of the continuous attributes during the process of induction of FDT. In this paper, we study the performance of the FDT with optimal number of partitions for each node appearing in the FDT. By obtaining optimal number of fuzzy clusters, we capture the intrinsic structure of the attribute values during the formation of fuzzy partitions, which in turn improves the classification accuracy of FDT. Extensive computational experiments are conducted on FDT developed using Fuzzy ID3 and eight fuzzy cluster validity indices over 30 publicly available pattern classification datasets. Non-parametric statistical tests are conducted to test the null hypothesis.

Keywords: FDT; fuzzy decision tree; fuzzy ID3; fuzzy c-means; cluster analysis; cluster validity; non-parametric statistical test; optimal clusters; data science. (search for similar items in EconPapers)
Date: 2017
References: Add references at CitEc
Citations:

Downloads: (external link)
http://www.inderscience.com/link.php?id=86255 (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:ids:ijdsci:v:2:y:2017:i:3:p:221-245

Access Statistics for this article

More articles in International Journal of Data Science from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().

 
Page updated 2025-03-19
Handle: RePEc:ids:ijdsci:v:2:y:2017:i:3:p:221-245