Gradient Descent Converges to Minimizers: The Case of Non-Isolated Critical Points

2 May 2016

Abstract

We prove that the set of initial conditions so that gradient descent converges to strict saddle points has (Lebesgue) measure zero, even for non-isolated critical points, answering an open question in [Lee, Simchowitz, Jordan, Recht, COLT2016].

View on arXiv

Comments on this paper