158

Reducing Nearest Neighbor Training Sets Optimally and Exactly

Canadian Conference on Computational Geometry (CCCG), 2023
Abstract

In nearest-neighbor classification, a training set PP of points in Rd\mathbb{R}^d with given classification is used to classify every point in Rd\mathbb{R}^d: Every point gets the same classification as its nearest neighbor in PP. Recently, Eppstein [SOSA'22] developed an algorithm to detect the relevant training points, those points pPp\in P, such that PP and P{p}P\setminus\{p\} induce different classifications. We investigate the problem of finding the minimum cardinality reduced training set PPP'\subseteq P such that PP and PP' induce the same classification. We show that the set of relevant points is such a minimum cardinality reduced training set if PP is in general position. Furthermore, we show that finding a minimum cardinality reduced training set for possibly degenerate PP is in P for d=1d=1, and NP-complete for d2d\geq 2.

View on arXiv
Comments on this paper