Depth-weighted Bayes classification
Ondrej Vencalek and
Computational Statistics & Data Analysis, 2018, vol. 123, issue C, 1-12
Two procedures for supervised classification are proposed. These are based on data depth and focus on the centre of each class. The classifiers add either a depth or a depth rank term to the objective function of the Bayes classifier. The cost of misclassifying a point depends not only on a class where it belongs, but also on its centrality with respect to this class. The classification of points that are more central is enforced while outliers are downweighted. The proposed objective function can also be used to evaluate the performance of other classifiers instead of the usual average misclassification rate. Use of the depth function increases robustness of the new procedures against the large inclusion of contaminated data that often impede the Bayes classifier. Properties of the new methods are investigated and compared with those of the Bayes classifier. Theoretical results are derived for elliptically symmetric distributions, while comparison for non-symmetric distributions is conducted by means of a simulation study. Comparisons are conducted for both theoretical classifiers and their empirical counterparts. The performance of the newly proposed classifiers is also compared to the performance of several standard methods in some real life situations.
Keywords: Bayes classifier; Data depth; Nonparametric; Rank; Supervised learning (search for similar items in EconPapers)
References: View references in EconPapers View complete reference list from CitEc
Citations Track citations by RSS feed
Downloads: (external link)
Full text for ScienceDirect subscribers only.
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
Persistent link: https://EconPapers.repec.org/RePEc:eee:csdana:v:123:y:2018:i:c:p:1-12
Access Statistics for this article
Computational Statistics & Data Analysis is currently edited by S.P. Azen
More articles in Computational Statistics & Data Analysis from Elsevier
Bibliographic data for series maintained by Dana Niculescu ().