93
v1v2 (latest)

Modeling AdaGrad, RMSProp, and Adam with Integro-Differential Equations

Main:45 Pages
18 Figures
Bibliography:2 Pages
1 Tables
Abstract

In this paper, we propose a continuous-time formulation for the AdaGrad, RMSProp, and Adam optimization algorithms by modeling them as first-order integro-differential equations. We perform numerical simulations of these equations, along with stability and convergence analyses, to demonstrate their validity as accurate approximations of the original algorithms. Our results indicate a strong agreement between the behavior of the continuous-time models and the discrete implementations, thus providing a new perspective on the theoretical understanding of adaptive optimization methods.

View on arXiv
Comments on this paper