Computing and using the deviance with classification trees
Gilbert Ritschard ()
Additional contact information
Gilbert Ritschard: University of Geneva, Department of Econometrics
A chapter in Compstat 2006 - Proceedings in Computational Statistics, 2006, pp 55-66 from Springer
Abstract:
Abstract The reliability of induced classification trees is most often evaluated by means of the error rate. Whether computed on test data or through cross-validation, this error rate is suited for classification purposes. We claim that it is, however, a partial indicator only of the quality of the knowledge provided by trees and that there is a need for additional indicators. For example, the error rate is not representative of the quality of the description provided. In this paper we focus on this descriptive aspect. We consider the deviance as a goodness-of-fit statistic that attempts to measure how well the tree is at reproducing the conditional distribution of the response variable for each possible profile (rather than the individual response value for each case) and we discuss various statistical tests that can be derived from them. Special attention is devoted to computational aspects.
Keywords: Classification tree; Deviance; Goodness-of-fit; Chi-square statistics; BIC (search for similar items in EconPapers)
Date: 2006
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-3-7908-1709-6_5
Ordering information: This item can be ordered from
http://www.springer.com/9783790817096
DOI: 10.1007/978-3-7908-1709-6_5
Access Statistics for this chapter
More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().