Genetically Engineered Decision Trees: Population Diversity Produces Smarter Trees
Zhiwei Fu (),
Bruce Golden (),
Shreevardhan Lele (),
S. Raghavan () and
Edward Wasil ()
Additional contact information
Zhiwei Fu: Fannie Mae, 4000 Wisconsin Avenue NW, Washington, DC 20016
Bruce Golden: Robert H.Smith School of Business, University of Maryland, College Park, Maryland 20742
Shreevardhan Lele: Robert H.Smith School of Business, University of Maryland, College Park, Maryland 20742
S. Raghavan: Robert H.Smith School of Business, University of Maryland, College Park, Maryland 20742
Edward Wasil: Kogod School of Business, American University, Washington, DC 20016
Operations Research, 2003, vol. 51, issue 6, 894-907
Abstract:
When considering a decision tree for the purpose of classification, accuracy is usually the sole performance measure used in the construction process. In this paper, we introduce the idea of combining a decision tree's expected value and variance in a new probabilistic measure for assessing the performance of a tree. We develop a genetic algorithm for constructing a tree using our new measure and conduct computational experiments that show the advantages of our approach. Further, we investigate the effect of introducing diversity into the population used by our genetic algorithm. We allow the genetic algorithm to simultaneously focus on two distinct probabilistic measures---one that is risk averse and one that is risk seeking. Our bivariate genetic algorithm for constructing a decision tree performs very well, scales up quite nicely to handle data sets with hundreds of thousands of points, and requires only a small percent of the data to generate a high-quality decision tree. We demonstrate the effectiveness of our algorithm on three large data sets.
Keywords: Statistics; data analysis: data mining; Marketing; estimation/statistical techniques: decision trees; Computers/computer science; artificial intelligence: genetic algorithms (search for similar items in EconPapers)
Date: 2003
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
http://dx.doi.org/10.1287/opre.51.6.894.24919 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:inm:oropre:v:51:y:2003:i:6:p:894-907
Access Statistics for this article
More articles in Operations Research from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().