Subexponential-Time Algorithms for Sparse PCA

We study the computational cost of recovering a unit-norm sparse principal component planted in a random matrix, in either the Wigner or Wishart spiked model (observing either with drawn from the Gaussian orthogonal ensemble, or independent samples from , respectively). Prior work has shown that when the signal-to-noise ratio ( or , respectively) is a small constant and the fraction of nonzero entries in the planted vector is , it is possible to recover in polynomial time if . While it is possible to recover in exponential time under the weaker condition , it is believed that polynomial-time recovery is impossible unless . We investigate the precise amount of time required for recovery in the "possible but hard" regime by exploring the power of subexponential-time algorithms, i.e., algorithms running in time for some constant . For any , we give a recovery algorithm with runtime roughly , demonstrating a smooth tradeoff between sparsity and runtime. Our family of algorithms interpolates smoothly between two existing algorithms: the polynomial-time diagonal thresholding algorithm and the -time exhaustive search algorithm. Furthermore, by analyzing the low-degree likelihood ratio, we give rigorous evidence suggesting that the tradeoff achieved by our algorithms is optimal.
View on arXiv