Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.04206
Cited By
GRAWA: Gradient-based Weighted Averaging for Distributed Training of Deep Learning Models
7 March 2024
Tolga Dimlioglu
A. Choromańska
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GRAWA: Gradient-based Weighted Averaging for Distributed Training of Deep Learning Models"
3 / 3 papers shown
Title
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
273
2,886
0
15 Sep 2016
Densely Connected Convolutional Networks
Gao Huang
Zhuang Liu
L. V. D. van der Maaten
Kilian Q. Weinberger
PINN
3DV
247
36,356
0
25 Aug 2016
The Loss Surfaces of Multilayer Networks
A. Choromańska
Mikael Henaff
Michaël Mathieu
Gerard Ben Arous
Yann LeCun
ODL
175
1,185
0
30 Nov 2014
1