Submatrix localization via message passing

The principal submatrix localization problem deals with recovering a principal submatrix of elevated mean in a large symmetric matrix subject to additive standard Gaussian noise. This problem serves as a prototypical example for community detection, in which the community corresponds to the support of the submatrix. The main result of this paper is that in the regime , the support of the submatrix can be weakly recovered (with misclassification errors on average) by an optimized message passing algorithm if , the signal-to-noise ratio, exceeds . This extends a result by Deshpande and Montanari previously obtained for In addition, the algorithm can be extended to provide exact recovery whenever information-theoretically possible and achieve the information limit of exact recovery as long as . The total running time of the algorithm is . Another version of the submatrix localization problem, known as noisy biclustering, aims to recover a submatrix of elevated mean in a large Gaussian matrix. The optimized message passing algorithm and its analysis are adapted to the bicluster problem assuming and A sharp information-theoretic condition for the weak recovery of both clusters is also identified.
View on arXiv