EconPapers    
Economics at your fingertips  
 

Properties of bagged nearest neighbour classifiers

Peter Hall and Richard J. Samworth

Journal of the Royal Statistical Society Series B, 2005, vol. 67, issue 3, 363-379

Abstract: Summary. It is shown that bagging, a computationally intensive method, asymptotically improves the performance of nearest neighbour classifiers provided that the resample size is less than 69% of the actual sample size, in the case of with‐replacement bagging, or less than 50% of the sample size, for without‐replacement bagging. However, for larger sampling fractions there is no asymptotic difference between the risk of the regular nearest neighbour classifier and its bagged version. In particular, neither achieves the large sample performance of the Bayes classifier. In contrast, when the sampling fractions converge to 0, but the resample sizes diverge to ∞, the bagged classifier converges to the optimal Bayes rule and its risk converges to the risk of the latter. These results are most readily seen when the two populations have well‐defined densities, but they may also be derived in other cases, where densities exist in only a relative sense. Cross‐validation can be used effectively to choose the sampling fraction. Numerical calculation is used to illustrate these theoretical properties.

Date: 2005
References: View complete reference list from CitEc
Citations: View citations in EconPapers (6)

Downloads: (external link)
https://doi.org/10.1111/j.1467-9868.2005.00506.x

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bla:jorssb:v:67:y:2005:i:3:p:363-379

Ordering information: This journal article can be ordered from
http://ordering.onli ... 1111/(ISSN)1467-9868

Access Statistics for this article

Journal of the Royal Statistical Society Series B is currently edited by P. Fryzlewicz and I. Van Keilegom

More articles in Journal of the Royal Statistical Society Series B from Royal Statistical Society Contact information at EDIRC.
Bibliographic data for series maintained by Wiley Content Delivery ().

 
Page updated 2025-03-19
Handle: RePEc:bla:jorssb:v:67:y:2005:i:3:p:363-379