Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.08222
Cited By
Improving the convergence of SGD through adaptive batch sizes
18 October 2019
Scott Sievert
Zachary B. Charles
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Improving the convergence of SGD through adaptive batch sizes"
2 / 2 papers shown
Title
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
310
2,896
0
15 Sep 2016
Linear Convergence of Gradient and Proximal-Gradient Methods Under the Polyak-Łojasiewicz Condition
Hamed Karimi
J. Nutini
Mark Schmidt
139
1,205
0
16 Aug 2016
1