EconPapers    
Economics at your fingertips  
 

Rule induction with extension matrices

Xindong Wu

Journal of the American Society for Information Science, 1998, vol. 49, issue 5, 435-454

Abstract: This article presents a heuristic, attribute‐based, noise‐tolerant data mining program, HCV (Version 2.0), based on the newly‐developed extension matrix approach. By dividing the positive examples (PE) of a specific class in a given example set into intersecting groups and adopting a set of strategies to find a heuristic conjunctive formula in each group which covers all the group's positive examples and none of the negative examples (NE), the HCV induction algorithm adopted in the HCV (Version 2.0) software finds a description formula in the form of variable‐valued logic for PE against NE in low‐order polynomial time at induction time. In addition to the HCV induction algorithm, this article also outlines some of the techniques for noise handling and discretization of numerical domains developed and implemented in the HCV (Version 2.0) software, and provides a performance comparison of HCV (Version 2.0) with other data mining algorithms ID3, C4.5, C4.5rules, and NewID in noisy and continuous domains. The empirical comparison shows that the rules generated by HCV (Version 2.0) are more compact than the decision trees or rules produced by ID3‐like algorithms, and HCV's predicative accuracy is competitive with ID3‐like algorithms. © 1998 John Wiley & Sons, Inc.

Date: 1998
References: Add references at CitEc
Citations:

Downloads: (external link)
https://doi.org/10.1002/(SICI)1097-4571(19980415)49:53.0.CO;2-R

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bla:jamest:v:49:y:1998:i:5:p:435-454

Ordering information: This journal article can be ordered from
https://doi.org/10.1002/(ISSN)1097-4571

Access Statistics for this article

More articles in Journal of the American Society for Information Science from Association for Information Science & Technology
Bibliographic data for series maintained by Wiley Content Delivery ().

 
Page updated 2025-03-19
Handle: RePEc:bla:jamest:v:49:y:1998:i:5:p:435-454