ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.10583
  4. Cited By
Scheduled Restart Momentum for Accelerated Stochastic Gradient Descent
v1v2 (latest)

Scheduled Restart Momentum for Accelerated Stochastic Gradient Descent

SIAM Journal of Imaging Sciences (SIIMS), 2020
24 February 2020
Bao Wang
T. Nguyen
Andrea L. Bertozzi
Richard G. Baraniuk
Stanley J. Osher
    ODL
ArXiv (abs)PDFHTML

Papers citing "Scheduled Restart Momentum for Accelerated Stochastic Gradient Descent"

22 / 22 papers shown
Adaptive Memory Momentum via a Model-Based Framework for Deep Learning Optimization
Adaptive Memory Momentum via a Model-Based Framework for Deep Learning Optimization
Kristi Topollai
A. Choromańska
ODL
417
1
0
06 Oct 2025
PnP-CM: Consistency Models as Plug-and-Play Priors for Inverse Problems
PnP-CM: Consistency Models as Plug-and-Play Priors for Inverse Problems
Merve Gülle
Junno Yun
Yasar Utku Alçalar
Mehmet Akçakaya
MedIm
378
3
0
25 Sep 2025
MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts
MomentumSMoE: Integrating Momentum into Sparse Mixture of ExpertsNeural Information Processing Systems (NeurIPS), 2024
R. Teo
Tan M. Nguyen
MoE
245
6
0
18 Oct 2024
Resetting the Optimizer in Deep RL: An Empirical Study
Resetting the Optimizer in Deep RL: An Empirical StudyNeural Information Processing Systems (NeurIPS), 2023
Kavosh Asadi
Rasool Fakoor
Shoham Sabach
ODL
323
32
0
30 Jun 2023
AdaSAM: Boosting Sharpness-Aware Minimization with Adaptive Learning
  Rate and Momentum for Training Deep Neural Networks
AdaSAM: Boosting Sharpness-Aware Minimization with Adaptive Learning Rate and Momentum for Training Deep Neural NetworksNeural Networks (Neural Netw.), 2023
Hao Sun
Li Shen
Qihuang Zhong
Liang Ding
Shi-Yong Chen
Jingwei Sun
Jing Li
Guangzhong Sun
Dacheng Tao
210
47
0
01 Mar 2023
Momentum Transformer: Closing the Performance Gap Between Self-attention
  and Its Linearization
Momentum Transformer: Closing the Performance Gap Between Self-attention and Its LinearizationMathematical and Scientific Machine Learning (MSML), 2022
T. Nguyen
Richard G. Baraniuk
Robert M. Kirby
Stanley J. Osher
Bao Wang
372
10
0
01 Aug 2022
Last-iterate convergence analysis of stochastic momentum methods for
  neural networks
Last-iterate convergence analysis of stochastic momentum methods for neural networksNeurocomputing (Neurocomputing), 2022
Dongpo Xu
Jinlan Liu
Yinghua Lu
Jun Kong
Danilo Mandic
101
13
0
30 May 2022
An Adaptive Gradient Method with Energy and Momentum
An Adaptive Gradient Method with Energy and MomentumAnnals of Applied Mathematics (AAM), 2022
Hailiang Liu
Xuping Tian
ODL
226
10
0
23 Mar 2022
Learning POD of Complex Dynamics Using Heavy-ball Neural ODEs
Learning POD of Complex Dynamics Using Heavy-ball Neural ODEsJournal of Scientific Computing (J. Sci. Comput.), 2022
Justin Baker
E. Cherkaev
A. Narayan
Bao Wang
AI4CE
417
8
0
24 Feb 2022
Training Deep Neural Networks with Adaptive Momentum Inspired by the
  Quadratic Optimization
Training Deep Neural Networks with Adaptive Momentum Inspired by the Quadratic Optimization
Tao Sun
Huaming Ling
Zuoqiang Shi
Dongsheng Li
Bao Wang
ODL
211
13
0
18 Oct 2021
Improving Transformers with Probabilistic Attention Keys
Improving Transformers with Probabilistic Attention Keys
Tam Nguyen
T. Nguyen
Dung D. Le
Duy Khuong Nguyen
Viet-Anh Tran
Richard G. Baraniuk
Nhat Ho
Stanley J. Osher
261
38
0
16 Oct 2021
How Does Momentum Benefit Deep Neural Networks Architecture Design? A
  Few Case Studies
How Does Momentum Benefit Deep Neural Networks Architecture Design? A Few Case Studies
Bao Wang
Hedi Xia
T. Nguyen
Stanley Osher
AI4CE
247
13
0
13 Oct 2021
AIR-Net: Adaptive and Implicit Regularization Neural Network for Matrix
  Completion
AIR-Net: Adaptive and Implicit Regularization Neural Network for Matrix CompletionSIAM Journal of Imaging Sciences (SIAM J. Imaging Sci.), 2021
Zhemin Li
Tao Sun
Hongxia Wang
Bao Wang
255
10
0
12 Oct 2021
Heavy Ball Neural Ordinary Differential Equations
Heavy Ball Neural Ordinary Differential EquationsNeural Information Processing Systems (NeurIPS), 2021
Hedi Xia
Vai Suliafu
H. Ji
T. Nguyen
Andrea L. Bertozzi
Stanley J. Osher
Bao Wang
227
70
0
10 Oct 2021
Accelerated Componentwise Gradient Boosting using Efficient Data
  Representation and Momentum-based Optimization
Accelerated Componentwise Gradient Boosting using Efficient Data Representation and Momentum-based Optimization
Daniel Schalk
J. Herbinger
David Rügamer
249
4
0
07 Oct 2021
Accelerated Gradient Descent Learning over Multiple Access Fading
  Channels
Accelerated Gradient Descent Learning over Multiple Access Fading ChannelsIEEE Journal on Selected Areas in Communications (JSAC), 2021
Raz Paul
Yuval Friedman
Kobi Cohen
322
34
0
26 Jul 2021
Momentum-inspired Low-Rank Coordinate Descent for Diagonally Constrained
  SDPs
Momentum-inspired Low-Rank Coordinate Descent for Diagonally Constrained SDPs
Junhyung Lyle Kim
Jose Antonio Lara Benitez
Taha Toghani
Cameron R. Wolfe
Zhiwei Zhang
Anastasios Kyrillidis
229
1
0
16 Jun 2021
Convolutional Neural Network(CNN/ConvNet) in Stock Price Movement
  Prediction
Convolutional Neural Network(CNN/ConvNet) in Stock Price Movement Prediction
Kunal Bhardwaj
164
5
0
03 Jun 2021
Stability and Generalization of the Decentralized Stochastic Gradient
  Descent
Stability and Generalization of the Decentralized Stochastic Gradient Descent
Tao Sun
Dongsheng Li
Bao Wang
297
0
0
02 Feb 2021
Stochastic Gradient Descent with Nonlinear Conjugate Gradient-Style
  Adaptive Momentum
Stochastic Gradient Descent with Nonlinear Conjugate Gradient-Style Adaptive Momentum
Bao Wang
Qiang Ye
ODL
226
16
0
03 Dec 2020
SMG: A Shuffling Gradient-Based Method with Momentum
SMG: A Shuffling Gradient-Based Method with MomentumInternational Conference on Machine Learning (ICML), 2020
Trang H. Tran
Lam M. Nguyen
Quoc Tran-Dinh
437
25
0
24 Nov 2020
Descending through a Crowded Valley - Benchmarking Deep Learning
  Optimizers
Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers
Robin M. Schmidt
Frank Schneider
Philipp Hennig
ODL
928
195
0
03 Jul 2020
1
Page 1 of 1