218

Two-Sample Testing with a Graph-Based Total Variation Integral Probability Metric

Main:16 Pages
6 Figures
Bibliography:6 Pages
Appendix:20 Pages
Abstract

We consider a novel multivariate nonparametric two-sample testing problem where, under the alternative, distributions PP and QQ are separated in an integral probability metric over functions of bounded total variation (TV IPM). We propose a new test, the graph TV test, which uses a graph-based approximation to the TV IPM as its test statistic. We show that this test, computed with an ε\varepsilon-neighborhood graph and calibrated by permutation, is minimax rate-optimal for detecting alternatives separated in the TV IPM. As an important special case, we show that this implies the graph TV test is optimal for detecting spatially localized alternatives, whereas the χ2\chi^2 test is provably suboptimal. Our theory is supported with numerical experiments on simulated and real data.

View on arXiv
Comments on this paper