73
167

Sharp Convergence Rates for Langevin Dynamics in the Nonconvex Setting

Abstract

We study the problem of sampling from a distribution where the negative logarithm of the target density is LL-smooth everywhere and mm-strongly convex outside a ball of radius RR, but potentially non-convex inside this ball. We study both overdamped and underdamped Langevin MCMC and prove upper bounds on the time required to obtain a sample from a distribution that is within ϵ\epsilon of the target distribution in 11-Wasserstein distance. For the first-order method (overdamped Langevin MCMC), the time complexity is O~(ecLR2dϵ2)\tilde{\mathcal{O}}\left(e^{cLR^2}\frac{d}{\epsilon^2}\right), where dd is the dimension of the underlying space. For the second-order method (underdamped Langevin MCMC), the time complexity is O~(ecLR2dϵ)\tilde{\mathcal{O}}\left(e^{cLR^2}\frac{\sqrt{d}}{\epsilon}\right) for some explicit positive constant cc. Surprisingly, the convergence rate is only polynomial in the dimension dd and the target accuracy ϵ\epsilon. It is however exponential in the problem parameter LR2LR^2, which is a measure of non-logconcavity of the target distribution.

View on arXiv
Comments on this paper