evtree: Evolutionary Learning of Globally Optimal Classification and Regression Trees in R
Thomas Grubinger (),
Achim Zeileis () and
Karl-Peter Pfeiffer ()
Working Papers from Faculty of Economics and Statistics, University of Innsbruck
Commonly used classification and regression tree methods like the CART algorithm are recursive partitioning methods that build the model in a forward stepwise search. Although this approach is known to be an efficient heuristic, the results of recursive tree methods are only locally optimal, as splits are chosen to maximize homogeneity at the next step only. An alternative way to search over the parameter space of trees is to use global optimization methods like evolutionary algorithms. This paper describes the "evtree" package, which implements an evolutionary algorithm for learning globally optimal classification and regression trees in R. Computationally intensive tasks are fully computed in C++ while the "partykit" (Hothorn and Zeileis 2011) package is leveraged for representing the resulting trees in R, providing unified infrastructure for summaries, visualizations, and predictions. "evtree" is compared to "rpart" (Therneau and Atkinson 1997), the open-source CART implementation, and conditional inference trees ("ctree", Hothorn, Hornik, and Zeileis 2006). The usefulness of "evtree" is illustrated in a textbook customer classification task and a benchmark study of predictive accuracy in which "evtree" achieved at least similar and most of the time better results compared to the recursive algorithms "rpart" and "ctree".
Keywords: machine learning; classification trees; regression trees; evolutionary algorithms; R (search for similar items in EconPapers)
JEL-codes: C14 C45 C87 (search for similar items in EconPapers)
New Economics Papers: this item is included in nep-cmp and nep-ore
References: View references in EconPapers View complete reference list from CitEc
Citations Track citations by RSS feed
Downloads: (external link)
Journal Article: evtree: Evolutionary Learning of Globally Optimal Classification and Regression Trees in R (2014)
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
Persistent link: http://EconPapers.repec.org/RePEc:inn:wpaper:2011-20
Access Statistics for this paper
More papers in Working Papers from Faculty of Economics and Statistics, University of Innsbruck Contact information at EDIRC.
Series data maintained by Janette Walde ().