13
2

Improved Complexities for Stochastic Conditional Gradient Methods under Interpolation-like Conditions

Abstract

We analyze stochastic conditional gradient methods for constrained optimization problems arising in over-parametrized machine learning. We show that one could leverage the interpolation-like conditions satisfied by such models to obtain improved oracle complexities. Specifically, when the objective function is convex, we show that the conditional gradient method requires O(ϵ2)\mathcal{O}(\epsilon^{-2}) calls to the stochastic gradient oracle to find an ϵ\epsilon-optimal solution. Furthermore, by including a gradient sliding step, we show that the number of calls reduces to O(ϵ1.5)\mathcal{O}(\epsilon^{-1.5}).

View on arXiv
Comments on this paper