We consider using gradient descent to minimize the nonconvex function over an factor matrix , in which is an underlying smooth convex cost function defined over matrices. While only a second-order stationary point can be provably found in reasonable time, if is additionally rank deficient, then its rank deficiency certifies it as being globally optimal. This way of certifying global optimality necessarily requires the search rank of the current iterate to be overparameterized with respect to the rank of the global minimizer . Unfortunately, overparameterization significantly slows down the convergence of gradient descent, from a linear rate with to a sublinear rate when , even when is strongly convex. In this paper, we propose an inexpensive preconditioner that restores the convergence rate of gradient descent back to linear in the overparameterized case, while also making it agnostic to possible ill-conditioning in the global minimizer .
View on arXiv@article{zhang2025_2206.03345, title={ Preconditioned Gradient Descent for Overparameterized Nonconvex Burer--Monteiro Factorization with Global Optimality Certification }, author={ Gavin Zhang and Salar Fattahi and Richard Y. Zhang }, journal={arXiv preprint arXiv:2206.03345}, year={ 2025 } }