80
19

Parameter-free Stochastic Optimization of Variationally Coherent Functions

Abstract

We design and analyze an algorithm for first-order stochastic optimization of a large class of functions on Rd\mathbb{R}^d. In particular, we consider the \emph{variationally coherent} functions which can be convex or non-convex. The iterates of our algorithm on variationally coherent functions converge almost surely to the global minimizer x\boldsymbol{x}^*. Additionally, the very same algorithm with the same hyperparameters, after TT iterations guarantees on convex functions that the expected suboptimality gap is bounded by O~(xx0T1/2+ϵ)\widetilde{O}(\|\boldsymbol{x}^* - \boldsymbol{x}_0\| T^{-1/2+\epsilon}) for any ϵ>0\epsilon>0. It is the first algorithm to achieve both these properties at the same time. Also, the rate for convex functions essentially matches the performance of parameter-free algorithms. Our algorithm is an instance of the Follow The Regularized Leader algorithm with the added twist of using \emph{rescaled gradients} and time-varying linearithmic regularizers.

View on arXiv
Comments on this paper