Detection in the stochastic block model with multiple clusters: proof of the achievability conjectures, acyclic BP, and the information-computation gap

In a paper that initiated the modern study of the stochastic block model, Decelle et al., backed up Mossel et al., made a fascinating conjecture: Denote by the number of balanced communities, the probability of connecting inside communities and across, and set ; for any , it is possible to detect communities efficiently whenever (the KS threshold), whereas for , it is possible to detect communities information-theoretically for some . Massouli\'e, Mossel et al.\ and Bordenave et al.\ succeeded in proving that the KS threshold is efficiently achievable for , while Mossel et al.\ proved that it cannot be crossed information-theoretically for . The above conjecture remained open for . This paper proves this conjecture. For the efficient part, an acyclic belief propagation (ABP) algorithm is developed and proved to detect communities for any down the KS threshold in time . Achieving this requires showing optimality of BP in the presence of cycles and random initialization, a challenge in the realm of graphical models. The paper also connects ABP to a power iteration method on a -nonbacktracking operator, formalizing the message passing and spectral interplay. Further, it shows that the model can be learned efficiently down the KS threshold, implying that ABP improves upon the state-of-the-art both in terms of complexity and universality. For the information-theoretic (IT) part, a non-efficient algorithm sampling a typical clustering is shown to break down the KS threshold at . The emerging gap is shown to be large in some cases; if , the KS threshold reads whereas the IT bound reads . This thus makes the SBM a good study case for information-computation gaps. The results extend to general SBMs.
View on arXiv