282

Constrained Langevin Algorithms with L-mixing External Random Variables

Neural Information Processing Systems (NeurIPS), 2022
Abstract

Langevin algorithms are gradient descent methods augmented with additive noise, and are widely used in Markov Chain Monte Carlo (MCMC) sampling, optimization, and learning. In recent years, the non-asymptotic analysis of Langevin algorithms for non-convex optimization learning has been extensively explored. For constrained problems with non-convex losses over compact convex domain in the case of IID data variables, Langevin algorithm achieves a deviation of O(T1/4(logT)1/2)O(T^{-1/4} (\log T)^{1/2}) from its target distribution [22]. In this paper, we obtain a deviation of O(T1/2logT)O(T^{-1/2} \log T) in 11-Wasserstein distance for non-convex losses with LL-mixing data variables and polyhedral constraints (which are not necessarily bounded). This deviation indicates that our convergence rate is faster than those in the previous works on constrained Langevin algorithms for non-convex optimization.

View on arXiv
Comments on this paper