ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.01730
42
2
v1v2 (latest)

Nonasymptotic one-and two-sample tests in high dimension with unknown covariance structure

1 September 2021
Gilles Blanchard
Jean-Baptiste Fermanian
ArXiv (abs)PDFHTML
Abstract

Let X=(Xi)1≤i≤n\mathbf{X} = (X_i)_{1\leq i \leq n}X=(Xi​)1≤i≤n​ be an i.i.d. sample of square-integrable variables in Rd\mathbb{R}^dRd, with common expectation μ\muμ and covariance matrix Σ\SigmaΣ, both unknown. We consider the problem of testing if μ\muμ is η\etaη-close to zero, i.e. ∥μ∥≤η\|\mu\| \leq \eta ∥μ∥≤η against ∥μ∥≥(η+δ)\|\mu\| \geq (\eta + \delta)∥μ∥≥(η+δ); we also tackle the more general two-sample mean closeness testing problem. The aim of this paper is to obtain nonasymptotic upper and lower bounds on the minimal separation distance δ\deltaδ such that we can control both the Type I and Type II errors at a given level. The main technical tools are concentration inequalities, first for a suitable estimator of ∥μ∥2\|\mu\|^2∥μ∥2 used a test statistic, and secondly for estimating the operator and Frobenius norms of Σ\SigmaΣ coming into the quantiles of said test statistic. These properties are obtained for Gaussian and bounded distributions. A particular attention is given to the dependence in the pseudo-dimension d∗d_*d∗​ of the distribution, defined as d∗:=∥Σ∥22/∥Σ∥∞2d_* := \|\Sigma\|_2^2/\|\Sigma\|_\infty^2d∗​:=∥Σ∥22​/∥Σ∥∞2​. In particular, for η=0\eta=0η=0, the minimum separation distance is Θ(d∗14∥Σ∥∞/n){\Theta}(d_*^{\frac{1}{4}}\sqrt{\|\Sigma\|_\infty/n})Θ(d∗41​​∥Σ∥∞​/n​), in contrast with the minimax estimation distance for μ\muμ, which is Θ(de12∥Σ∥∞/n){\Theta}(d_e^{\frac{1}{2}}\sqrt{\|\Sigma\|_\infty/n})Θ(de21​​∥Σ∥∞​/n​) (where de:=∥Σ∥1/∥Σ∥∞d_e:=\|\Sigma\|_1/\|\Sigma\|_\inftyde​:=∥Σ∥1​/∥Σ∥∞​). This generalizes a phenomenon spelled out in particular by Baraud (2002).

View on arXiv
Comments on this paper