Gradient Descent Converges to Minimizers: The Case of Non-Isolated
Critical Points
Abstract
We prove that the set of initial conditions so that gradient descent converges to strict saddle points has (Lebesgue) measure zero, even for non-isolated critical points, answering an open question in [Lee, Simchowitz, Jordan, Recht, COLT2016].
View on arXivComments on this paper
