Improving the convergence of SGD through adaptive batch sizes

18 October 2019

Papers citing "Improving the convergence of SGD through adaptive batch sizes"

2 / 2 papers shown

Title
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima N. Keskar Dheevatsa Mudigere J. Nocedal M. Smelyanskiy P. T. P. Tang ODL 310 2,896 0 15 Sep 2016
Linear Convergence of Gradient and Proximal-Gradient Methods Under the Polyak-Łojasiewicz Condition Hamed Karimi J. Nutini Mark Schmidt 139 1,205 0 16 Aug 2016