Higher Order Generalization Error for First Order Discretization of Langevin Diffusion

Abstract
We propose a novel approach to analyze generalization error for discretizations of Langevin diffusion, such as the stochastic gradient Langevin dynamics (SGLD). For an tolerance of expected generalization error, it is known that a first order discretization can reach this target if we run iterations with samples. In this article, we show that with additional smoothness assumptions, even first order methods can achieve arbitrarily runtime complexity. More precisely, for each , we provide a sufficient smoothness condition on the loss function such that a first order discretization can reach expected generalization error given iterations with samples.
View on arXivComments on this paper