29
3
v1v2v3 (latest)

Stochastic regularized majorization-minimization with weakly convex and multi-convex surrogates

Abstract

Stochastic majorization-minimization (SMM) is a class of stochastic optimization algorithms that proceed by sampling new data points and minimizing a recursive average of surrogate functions of an objective function. The surrogates are required to be strongly convex and convergence rate analysis for the general non-convex setting was not available. In this paper, we propose an extension of SMM where surrogates are allowed to be only weakly convex or block multi-convex, and the averaged surrogates are approximately minimized with proximal regularization or block-minimized within diminishing radii, respectively. For the general nonconvex constrained setting with non-i.i.d. data samples, we show that the first-order optimality gap of the proposed algorithm decays at the rate O((logn)1+ϵ/n1/2)O((\log n)^{1+\epsilon}/n^{1/2}) for the empirical loss and O((logn)1+ϵ/n1/4)O((\log n)^{1+\epsilon}/n^{1/4}) for the expected loss, where nn denotes the number of data samples processed. Under some additional assumption, the latter convergence rate can be improved to O((logn)1+ϵ/n1/2)O((\log n)^{1+\epsilon}/n^{1/2}). As a corollary, we obtain the first convergence rate bounds for various optimization methods under general nonconvex dependent data setting: Double-averaging projected gradient descent and its generalizations, proximal point empirical risk minimization, and online matrix/tensor decomposition algorithms. We also provide experimental validation of our results.

View on arXiv
Comments on this paper