279

Gradient Descent Converges to Minimizers: The Case of Non-Isolated Critical Points

Abstract

We prove that the set of initial conditions so that gradient descent converges to strict saddle points has (Lebesgue) measure zero, even for non-isolated critical points, answering an open question in [Lee, Simchowitz, Jordan, Recht, COLT2016].

View on arXiv
Comments on this paper