Active learning-based pedagogical rule extraction
Enric Junqué de Fortuny and
David Martens
Working Papers from University of Antwerp, Faculty of Business and Economics
Abstract:
Many of the state-of-the-art data mining techniques introduce non-linearities in their models to cope with complex data-relationships effectively. Although such techniques are consistently included among the top classification techniques in terms of predictive power, their lack of transparency renders them useless in any domain where comprehensibility is of importance. Rule-extraction algorithms remedy this by distilling comprehensible rulesets from complex models that explain how the classifications are made. The present article considers a new rule extraction technique, based on active learning. The technique generates artificial data points around training data with low confidence in the output score, after which these are labelled by the black-box model. The main novelty of the proposed method is that it uses a pedagogical approach without making any architectural assumptions of the underlying model. It can therefore be applied to any black-box technique. Furthermore, it can generate any rule format, depending on the chosen underlying rule induction technique. In a large-scale empirical study, we demonstrate the validity of our technique to extract trees and rules from Artificial Neural Networks, Support Vector Machines and Random Forests, on 25 datasets of varying size and dimensionality. Our results show that not only do the generated rules explain the black-box models well (thereby facilitating the acceptance of such models), the proposed algorithm also performs significantly better than traditional rule induction techniques in terms of accuracy as well as fidelity.
Keywords: Rule extraction; Active learning; Comprehensibility; Pedagogical (search for similar items in EconPapers)
Pages: 33 pages
Date: 2014-08
New Economics Papers: this item is included in nep-cmp
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://repository.uantwerpen.be/docman/irua/fbdff0/145284.pdf (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ant:wpaper:2014016
Access Statistics for this paper
More papers in Working Papers from University of Antwerp, Faculty of Business and Economics Contact information at EDIRC.
Bibliographic data for series maintained by Joeri Nys ().