46
142

Minimax Rates of Estimation for Sparse PCA in High Dimensions

Abstract

We study sparse principal components analysis in the high-dimensional setting, where pp (the number of variables) can be much larger than nn (the number of observations). We prove optimal, non-asymptotic lower and upper bounds on the minimax estimation error for the leading eigenvector when it belongs to an q\ell_q ball for q[0,1]q \in [0,1]. Our bounds are sharp in pp and nn for all q[0,1]q \in [0, 1] over a wide class of distributions. The upper bound is obtained by analyzing the performance of q\ell_q-constrained PCA. In particular, our results provide convergence rates for 1\ell_1-constrained PCA.

View on arXiv
Comments on this paper