10
5

Gaussian Mean Testing Made Simple

Abstract

We study the following fundamental hypothesis testing problem, which we term Gaussian mean testing. Given i.i.d. samples from a distribution pp on Rd\mathbb{R}^d, the task is to distinguish, with high probability, between the following cases: (i) pp is the standard Gaussian distribution, N(0,Id)\mathcal{N}(0,I_d), and (ii) pp is a Gaussian N(μ,Σ)\mathcal{N}(\mu,\Sigma) for some unknown covariance Σ\Sigma and mean μRd\mu \in \mathbb{R}^d satisfying μ2ϵ\|\mu\|_2 \geq \epsilon. Recent work gave an algorithm for this testing problem with the optimal sample complexity of Θ(d/ϵ2)\Theta(\sqrt{d}/\epsilon^2). Both the previous algorithm and its analysis are quite complicated. Here we give an extremely simple algorithm for Gaussian mean testing with a one-page analysis. Our algorithm is sample optimal and runs in sample linear time.

View on arXiv
Comments on this paper