evtree: Evolutionary Learning of Globally Optimal Classification and Regression Trees in R
Thomas Grubinger,
Achim Zeileis () and
Karl-Peter Pfeiffer
Journal of Statistical Software, 2014, vol. 061, issue i01
Abstract:
Commonly used classification and regression tree methods like the CART algorithm are recursive partitioning methods that build the model in a forward stepwise search. Although this approach is known to be an efficient heuristic, the results of recursive tree methods are only locally optimal, as splits are chosen to maximize homogeneity at the next step only. An alternative way to search over the parameter space of trees is to use global optimization methods like evolutionary algorithms. This paper describes the evtree package, which implements an evolutionary algorithm for learning globally optimal classification and regression trees in R. Computationally intensive tasks are fully computed in C++ while the partykit package is leveraged for representing the resulting trees in R, providing unified infrastructure for summaries, visualizations, and predictions. evtree is compared to the open-source CART implementation rpart, conditional inference trees (ctree), and the open-source C4.5 implementation J48. A benchmark study of predictive accuracy and complexity is carried out in which evtree achieved at least similar and most of the time better results compared to rpart, ctree, and J48. Furthermore, the usefulness of evtree in practice is illustrated in a textbook customer classification task.
Date: 2014-10-24
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (20)
Downloads: (external link)
https://www.jstatsoft.org/index.php/jss/article/view/v061i01/v61i01.pdf
https://www.jstatsoft.org/index.php/jss/article/do ... /evtree_1.0-0.tar.gz
https://www.jstatsoft.org/index.php/jss/article/do ... ile/v061i01/v61i01.R
https://www.jstatsoft.org/index.php/jss/article/do ... v61i01-benchmark.zip
Related works:
Working Paper: evtree: Evolutionary Learning of Globally Optimal Classification and Regression Trees in R (2011) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:jss:jstsof:v:061:i01
DOI: 10.18637/jss.v061.i01
Access Statistics for this article
Journal of Statistical Software is currently edited by Bettina Grün, Edzer Pebesma and Achim Zeileis
More articles in Journal of Statistical Software from Foundation for Open Access Statistics
Bibliographic data for series maintained by Christopher F. Baum ().