25
32

Optimal Stochastic Non-smooth Non-convex Optimization through Online-to-Non-convex Conversion

Abstract

We present new algorithms for optimizing non-smooth, non-convex stochastic objectives based on a novel analysis technique. This improves the current best-known complexity for finding a (δ,ϵ)(\delta,\epsilon)-stationary point from O(ϵ4δ1)O(\epsilon^{-4}\delta^{-1}) stochastic gradient queries to O(ϵ3δ1)O(\epsilon^{-3}\delta^{-1}), which we also show to be optimal. Our primary technique is a reduction from non-smooth non-convex optimization to online learning, after which our results follow from standard regret bounds in online learning. For deterministic and second-order smooth objectives, applying more advanced optimistic online learning techniques enables a new complexity of O(ϵ1.5δ0.5)O(\epsilon^{-1.5}\delta^{-0.5}). Our techniques also recover all optimal or best-known results for finding ϵ\epsilon stationary points of smooth or second-order smooth objectives in both stochastic and deterministic settings.

View on arXiv
Comments on this paper