Validation of k-Nearest Neighbor Classifiers Using Inclusion and
Exclusion
Abstract
This paper presents a series of PAC error bounds for -nearest neighbors classifiers, with O() expected range in the difference between error bound and actual error rate, for each integer , where is the number of in-sample examples. The best previous expected bound range was O(). The result shows that -nn classifiers, in spite of their famously fractured decision boundaries, come arbitrarily close to having Gaussian-style O() expected differences between PAC (probably approximately correct) error bounds and actual expected out-of-sample error rates.
View on arXivComments on this paper
