Semidefinite Programs for Exact Recovery of a Hidden Community
We study a semidefinite programming (SDP) relaxation of the maximum likelihood estimation for exactly recovering a hidden community of cardinality from an symmetric data matrix , where for distinct indices , if are both in the community and otherwise, for two known probability distributions and . We identify a sufficient condition and a necessary condition for the success of SDP for the general model. For both the Bernoulli case ( and with ) and the Gaussian case ( and with ), which correspond to the problem of planted dense subgraph recovery and submatrix localization respectively, the general results lead to the following findings: (1) If , SDP attains the information-theoretic recovery limits with sharp constants; (2) If , SDP is order-wise optimal, but strictly suboptimal by a constant factor; (3) If and , SDP is order-wise suboptimal. A key ingredient in the proof of the necessary condition is a construction of a primal feasible solution based on random perturbation of the true cluster matrix.
View on arXiv