Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.18240
Cited By
XGrad: Boosting Gradient-Based Optimizers With Weight Prediction
26 May 2023
Lei Guan
Dongsheng Li
Yanqi Shi
Jian Meng
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"XGrad: Boosting Gradient-Based Optimizers With Weight Prediction"
5 / 5 papers shown
Title
PipeOptim: Ensuring Effective 1F1B Schedule with Optimizer-Dependent Weight Prediction
Lei Guan
Dongsheng Li
Jiye Liang
Wenjian Wang
Wenjian Wang
Xicheng Lu
20
1
0
01 Dec 2023
AdaPlus: Integrating Nesterov Momentum and Precise Stepsize Adjustment on AdamW Basis
Lei Guan
ODL
16
3
0
05 Sep 2023
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
297
6,950
0
20 Apr 2018
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,743
0
26 Sep 2016
Densely Connected Convolutional Networks
Gao Huang
Zhuang Liu
L. V. D. van der Maaten
Kilian Q. Weinberger
PINN
3DV
249
36,362
0
25 Aug 2016
1