Convergence and Convergence Rate of Stochastic Gradient Search in the Case of Multiple and Non-Isolated Extrema

6 July 2009

Abstract

The asymptotic behavior of stochastic gradient algorithms is studied. Relying on results from differential geometry (Lojasiewicz gradient inequality), the single limit-point convergence of the algorithm iterates is demonstrated and relatively tight bounds on the convergence rate are derived. In sharp contrast to the existing asymptotic results, the new results presented here do not require the objective function to have an isolated minimum and to be strongly convex in an open vicinity of that minimum. On the contrary, these new results allow the objective function to have multiple and non-isolated minima. They also offer new insights into the asymptotic properties of several classes of recursive algorithms which are routinely used in machine learning, statistics, engineering and operations research.

View on arXiv

Comments on this paper