Path Length Bounds for Gradient Descent and Flow

Abstract
We derive bounds on the path length of gradient descent (GD) and gradient flow (GF) curves for various classes of smooth convex and nonconvex functions. Among other results, we prove that: (a) if the iterates are linearly convergent with factor , then is at most ; (b) under the Polyak-Kurdyka-Lojasiewicz (PKL) condition, is at most , where is the condition number, and at least ; (c) for quadratics, is and in some cases can be independent of ; (d) assuming just convexity, can be at most ; (e) for separable quasiconvex functions, is . Thus, we advance current understanding of the properties of GD and GF curves beyond rates of convergence. We expect our techniques to facilitate future studies for other algorithms.
View on arXivComments on this paper