87
22

Recovering a Hidden Community Beyond the Spectral Limit in O(ElogV)O(|E| \log^*|V|) Time

Abstract

The stochastic block model for one community with parameters n,K,p,n, K, p, and qq is considered: KK out of nn vertices are in the community; two vertices are connected by an edge with probability pp if they are both in the community and with probability qq otherwise, where p>q>0p > q > 0 and p/qp/q is assumed to be bounded. An estimator based on observation of the graph G=(V,E)G=(V,E) is said to achieve weak recovery if the mean number of misclassified vertices is o(K)o(K) as nn \to \infty. A critical role is played by the effective signal-to-noise ratio λ=K2(pq)2/((nK)q).\lambda=K^2(p-q)^2/((n-K)q). In the regime K=Θ(n)K=\Theta(n), a na\"{i}ve degree-thresholding algorithm achieves weak recovery in O(E)O(|E|) time if λ\lambda \to \infty, which coincides with the information theoretic possibility of weak recovery. The main focus of the paper is on weak recovery in the sublinear regime K=o(n)K=o(n) and np=no(1).np = n^{o(1)}. It is shown that weak recovery is provided by a belief propagation algorithm running for log(n)+O(1)\log^\ast(n)+O(1) iterations, if λ>1/e,\lambda > 1/e, with the total time complexity O(Elogn)O(|E| \log^*n). Conversely, no local algorithm with radius tt of interaction satisfying t=o(lognlog(2+np))t = o(\frac{\log n}{\log(2+np)}) can asymptotically outperform trivial random guessing if λ1/e.\lambda \leq 1/e. By analyzing a linear message-passing algorithm that corresponds to applying power iteration to the non-backtracking matrix of the graph, we provide evidence to suggest that spectral methods fail to provide weak recovery if λ1.\lambda \leq 1.

View on arXiv
Comments on this paper