ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.00195
  4. Cited By
Weight Prediction Boosts the Convergence of AdamW

Weight Prediction Boosts the Convergence of AdamW

1 February 2023
Lei Guan
ArXivPDFHTML

Papers citing "Weight Prediction Boosts the Convergence of AdamW"

7 / 7 papers shown
Title
Optimizing Large Language Models for ESG Activity Detection in Financial Texts
Optimizing Large Language Models for ESG Activity Detection in Financial Texts
Mattia Birti
Francesco Osborne
Andrea Maurino
44
0
0
28 Feb 2025
One Step Learning, One Step Review
One Step Learning, One Step Review
Xiaolong Huang
Qiankun Li
Xueran Li
Xuesong Gao
33
1
0
19 Jan 2024
PipeOptim: Ensuring Effective 1F1B Schedule with Optimizer-Dependent Weight Prediction
PipeOptim: Ensuring Effective 1F1B Schedule with Optimizer-Dependent Weight Prediction
Lei Guan
Dongsheng Li
Jiye Liang
Wenjian Wang
Wenjian Wang
Xicheng Lu
35
1
0
01 Dec 2023
AdaPlus: Integrating Nesterov Momentum and Precise Stepsize Adjustment
  on AdamW Basis
AdaPlus: Integrating Nesterov Momentum and Precise Stepsize Adjustment on AdamW Basis
Lei Guan
ODL
34
3
0
05 Sep 2023
XGrad: Boosting Gradient-Based Optimizers With Weight Prediction
XGrad: Boosting Gradient-Based Optimizers With Weight Prediction
Lei Guan
Dongsheng Li
Yanqi Shi
Jian Meng
ODL
38
2
0
26 May 2023
Are Transformers More Robust Than CNNs?
Are Transformers More Robust Than CNNs?
Yutong Bai
Jieru Mei
Alan Yuille
Cihang Xie
ViT
AAML
192
257
0
10 Nov 2021
Densely Connected Convolutional Networks
Densely Connected Convolutional Networks
Gao Huang
Zhuang Liu
L. V. D. van der Maaten
Kilian Q. Weinberger
PINN
3DV
309
36,371
0
25 Aug 2016
1