223

Normal approximation and concentration of spectral projectors of sample covariance

Abstract

Let X,X1,,XnX,X_1,\dots, X_n be i.i.d. Gaussian random variables in a separable Hilbert space H{\mathbb H} with zero mean and covariance operator Σ=E(XX),\Sigma={\mathbb E}(X\otimes X), and let Σ^:=n1j=1n(XjXj)\hat \Sigma:=n^{-1}\sum_{j=1}^n (X_j\otimes X_j) be the sample (empirical) covariance operator based on (X1,,Xn).(X_1,\dots, X_n). Denote by PrP_r the spectral projector of Σ\Sigma corresponding to its rr-th eigenvalue μr\mu_r and by P^r\hat P_r the empirical counterpart of Pr.P_r. The main goal of the paper is to obtain tight bounds on supxRP{P^rPr22EP^rPr22Var1/2(P^rPr22)x}Φ(x), \sup_{x\in {\mathbb R}} \left|{\mathbb P}\left\{\frac{\|\hat P_r-P_r\|_2^2-{\mathbb E}\|\hat P_r-P_r\|_2^2}{{\rm Var}^{1/2}(\|\hat P_r-P_r\|_2^2)}\leq x\right\}-\Phi(x)\right|, where 2\|\cdot\|_2 denotes the Hilbert--Schmidt norm and Φ\Phi is the standard normal distribution function. Such accuracy of normal approximation of the distribution of squared Hilbert--Schmidt error is characterized in terms of so called effective rank of Σ\Sigma defined as r(Σ)=tr(Σ)Σ,{\bf r}(\Sigma)=\frac{{\rm tr}(\Sigma)}{\|\Sigma\|_{\infty}}, where tr(Σ){\rm tr}(\Sigma) is the trace of Σ\Sigma and Σ\|\Sigma\|_{\infty} is its operator norm, as well as another parameter characterizing the size of Var(P^rPr22).{\rm Var}(\|\hat P_r-P_r\|_2^2). Other results include non-asymptotic bounds and asymptotic representations for the mean squared Hilbert--Schmidt norm error EP^rPr22{\mathbb E}\|\hat P_r-P_r\|_2^2 and the variance Var(P^rPr22),{\rm Var}(\|\hat P_r-P_r\|_2^2), and concentration inequalities for P^rPr22\|\hat P_r-P_r\|_2^2 around its expectation.

View on arXiv
Comments on this paper