385
v1v2v3 (latest)

Convergence analysis of controlled particle systems arising in deep learning: from finite to infinite sample size

Main:42 Pages
Bibliography:4 Pages
Abstract

This paper deals with a class of neural SDEs and studies the limiting behavior of the associated sampled optimal control problems as the sample size grows to infinity. The neural SDEs with NN samples can be linked to the NN-particle systems with centralized control. We analyze the Hamilton-Jacobi-Bellman equation corresponding to the NN-particle system and establish regularity results which are uniform in NN. The uniform regularity estimates are obtained by the stochastic maximum principle and the analysis of a backward stochastic Riccati equation. Using these uniform regularity results, we show the convergence of the minima of the objective functionals and optimal parameters of the neural SDEs as the sample size NN tends to infinity. The limiting objects can be identified with suitable functions defined on the Wasserstein space of Borel probability measures. Furthermore, quantitative convergence rates are also obtained.

View on arXiv
Comments on this paper