26
85

On the limitation of spectral methods: From the Gaussian hidden clique problem to rank one perturbations of Gaussian tensors

Abstract

We consider the following detection problem: given a realization of a symmetric matrix X{\mathbf{X}} of dimension nn, distinguish between the hypothesis that all upper triangular variables are i.i.d. Gaussians variables with mean 0 and variance 11 and the hypothesis where X{\mathbf{X}} is the sum of such matrix and an independent rank-one perturbation. This setup applies to the situation where under the alternative, there is a planted principal submatrix B{\mathbf{B}} of size LL for which all upper triangular variables are i.i.d. Gaussians with mean 11 and variance 11, whereas all other upper triangular elements of X{\mathbf{X}} not in B{\mathbf{B}} are i.i.d. Gaussians variables with mean 0 and variance 11. We refer to this as the `Gaussian hidden clique problem.' When L=(1+ϵ)nL=(1+\epsilon)\sqrt{n} (ϵ>0\epsilon>0), it is possible to solve this detection problem with probability 1on(1)1-o_n(1) by computing the spectrum of X{\mathbf{X}} and considering the largest eigenvalue of X{\mathbf{X}}. We prove that this condition is tight in the following sense: when L<(1ϵ)nL<(1-\epsilon)\sqrt{n} no algorithm that examines only the eigenvalues of X{\mathbf{X}} can detect the existence of a hidden Gaussian clique, with error probability vanishing as nn\to\infty. We prove this result as an immediate consequence of a more general result on rank-one perturbations of kk-dimensional Gaussian tensors. In this context we establish a lower bound on the critical signal-to-noise ratio below which a rank-one signal cannot be detected.

View on arXiv
Comments on this paper