87
22

Recovering a Hidden Community Beyond the Spectral Limit in O(ElogV)O(|E| \log^*|V|) Time

Abstract

Community detection is considered for a stochastic block model graph of nn vertices, with KK vertices in the planted community, edge probability pp for pairs of vertices both in the community, and edge probability qq for other pairs of vertices. The main focus of the paper is on recovery of the community based on the graph GG, with o(K)o(K) misclassified vertices on average, in the sublinear regime n1o(1)Ko(n).n^{1-o(1)} \leq K \leq o(n). It is shown that such recovery is attainable by a belief propagation algorithm running for logn+O(1)\log^\ast n+O(1) iterations, if λ=K2(pq)2/((nK)q)\lambda =K^2(p-q)^2/((n-K)q), the signal-to-noise ratio, exceeds 1/e,1/e, with the total time complexity O(Elogn)O(|E| \log^*n). Conversely, if λ1/e\lambda \leq 1/e, no local algorithm can asymptotically outperform trivial random guessing. By analyzing a linear message-passing algorithm that corresponds to applying power iteration to the non-backtracking matrix of the graph, we provide evidence to suggest that spectral methods fail to recovery the community if λ1.\lambda \leq 1. In addition, the belief propagation algorithm can be combined with a linear-time voting procedure to achieve the information limit of exact recovery (correctly classify all vertices with high probability) for all Knlogn(ρBP+o(1)),K \ge \frac{n}{\log n} \left( \rho_{\rm BP} +o(1) \right), where ρBP\rho_{\rm BP} is a function of p/qp/q.

View on arXiv
Comments on this paper