v1v2v3v4v5 (latest)

Accelerating SGD with momentum for over-parameterized learning

31 October 2018

Papers citing "Accelerating SGD with momentum for over-parameterized learning"

12 / 12 papers shown

Nesterov acceleration in benignly non-convex landscapesInternational Conference on Learning Representations (ICLR), 2024

Kanan Gupta

Stephan Wojtowytsch

306

10 Oct 2024

Accelerated stochastic approximation with state-dependent noiseMathematical programming (Math. Program.), 2023

387

04 Jul 2023

Adaptive Step-Size Methods for Compressed SGDIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Adarsh M. Subramaniam

A. Magesh

Venugopal V. Veeravalli

146

20 Jul 2022

Nesterov Accelerated Shuffling Gradient Method for Convex OptimizationInternational Conference on Machine Learning (ICML), 2022

Trang H. Tran

K. Scheinberg

Lam M. Nguyen

364

07 Feb 2022

An Even More Optimal Stochastic Optimization Algorithm: Minibatching and Interpolation LearningNeural Information Processing Systems (NeurIPS), 2021

Blake E. Woodworth

Nathan Srebro

240

04 Jun 2021

A Unified Analysis of First-Order Methods for Smooth Games via Integral Quadratic ConstraintsJournal of machine learning research (JMLR), 2020

365

23 Sep 2020

On the Generalization Benefit of Noise in Stochastic Gradient Descent

217

116

26 Jun 2020

DEED: A General Quantization Scheme for Communication Efficiency in Bits

19 Jun 2020

Optimization for deep learning: theory and algorithms

Tian Ding

ODL

343

179

19 Dec 2019

Student Specialization in Deep ReLU Networks With Finite Width and Input Dimension

Yuandong Tian

MLT

218

30 Sep 2019

Painless Stochastic Gradient: Interpolation, Line-Search, and Convergence RatesNeural Information Processing Systems (NeurIPS), 2019

441

231

24 May 2019

Fast and Faster Convergence of SGD for Over-Parameterized Models and an Accelerated Perceptron

Sharan Vaswani

Francis R. Bach

Mark Schmidt

398

320

16 Oct 2018