v1v2 (latest)

On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima

15 September 2016

Papers citing "On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima"

4 / 1,554 papers shown

Title
Distributed Training of Deep Neural Networks: Theoretical and Practical Limits of Parallel Scalability J. Keuper Franz-Josef Pfreundt GNN 119 98 0 22 Sep 2016
Spectral Methods for Correlated Topic Models Forough Arabshahi Anima Anandkumar OOD 45 2 0 30 May 2016
Parallelizing Word2Vec in Shared and Distributed Memory Shihao Ji N. Satish Sheng Li Pradeep Dubey VLM MoE 51 72 0 15 Apr 2016
Revisiting Distributed Synchronous SGD Jianmin Chen Xinghao Pan R. Monga Samy Bengio Rafal Jozefowicz 91 801 0 04 Apr 2016