20
24

Improved Global Guarantees for the Nonconvex Burer--Monteiro Factorization via Rank Overparameterization

Abstract

We consider minimizing a twice-differentiable, LL-smooth, and μ\mu-strongly convex objective ϕ\phi over an n×nn\times n positive semidefinite matrix M0M\succeq0, under the assumption that the minimizer MM^{\star} has low rank rnr^{\star}\ll n. Following the Burer--Monteiro approach, we instead minimize the nonconvex objective f(X)=ϕ(XXT)f(X)=\phi(XX^{T}) over a factor matrix XX of size n×rn\times r. This substantially reduces the number of variables from O(n2)O(n^{2}) to as few as O(n)O(n) and also enforces positive semidefiniteness for free, but at the cost of giving up the convexity of the original problem. In this paper, we prove that if the search rank rrr\ge r^{\star} is overparameterized by a \emph{constant factor} with respect to the true rank rr^{\star}, namely as in r>14(L/μ1)2rr>\frac{1}{4}(L/\mu-1)^{2}r^{\star}, then despite nonconvexity, local optimization is guaranteed to globally converge from any initial point to the global optimum. This significantly improves upon a previous rank overparameterization threshold of rnr\ge n, which we show is sharp in the absence of smoothness and strong convexity, but would increase the number of variables back up to O(n2)O(n^{2}). Conversely, without rank overparameterization, we prove that such a global guarantee is possible if and only if ϕ\phi is almost perfectly conditioned, with a condition number of L/μ<3L/\mu<3. Therefore, we conclude that a small amount of overparameterization can lead to large improvements in theoretical guarantees for the nonconvex Burer--Monteiro factorization.

View on arXiv
Comments on this paper