140

Rate-optimal community detection near the KS threshold via node-robust algorithms

Main:2 Pages
Appendix:81 Pages
Abstract

We study community detection in the \emph{symmetric kk-stochastic block model}, where nn nodes are evenly partitioned into kk clusters with intra- and inter-cluster connection probabilities pp and qq, respectively.Our main result is a polynomial-time algorithm that achieves the minimax-optimal misclassification rate\begin{equation*}\exp \Bigl(-\bigl(1 \pm o(1)\bigr) \tfrac{C}{k}\Bigr),\quad \text{where } C = (\sqrt{pn} - \sqrt{qn})^2,\end{equation*}whenever CKk2logkC \ge K\,k^2\,\log k for some universal constant KK, matching the Kesten--Stigum (KS) threshold up to a logk\log k factor.Notably, this rate holds even when an adversary corrupts an ηexp((1±o(1))Ck)\eta \le \exp\bigl(- (1 \pm o(1)) \tfrac{C}{k}\bigr) fraction of the nodes.To the best of our knowledge, the minimax rate was previously only attainable either via computationally inefficient procedures [ZZ15] or via polynomial-time algorithms that require strictly stronger assumptions such as CKk3C \ge K k^3 [GMZZ17].In the node-robust setting, the best known algorithm requires the substantially stronger condition CKk102C \ge K k^{102} [LM22].Our results close this gap by providing the first polynomial-time algorithm that achieves the minimax rate near the KS threshold in both settings.Our work has two key technical contributions:(1) we robustify majority voting via the Sum-of-Squares framework,(2) we develop a novel graph bisection algorithm via robust majority voting, which allows us to significantly improve the misclassification rate to 1/poly(k)1/\mathrm{poly}(k) for the initial estimation near the KS threshold.

View on arXiv
Comments on this paper