77
11

Central limit theorems for stochastic gradient descent with averaging for stable manifolds

Abstract

In this article we establish new central limit theorems for Ruppert-Polyak averaged stochastic gradient descent schemes. Compared to previous work we do not assume that convergence occurs to an isolated attractor but instead allow convergence to a stable manifold. On the stable manifold the target function is constant and the oscillations in the tangential direction may be significantly larger than the ones in the normal direction. As we show, one still recovers a central limit theorem with the same rates as in the case of isolated attractors. Here we consider step-sizes γn=nγ\gamma_n=n^{-\gamma} with γ(34,1)\gamma\in(\frac34,1), typically.

View on arXiv
Comments on this paper