Papers
Communities
Events
Blog
Pricing
Search
Open menu
All Papers
Title
Home
Papers
2001.02312
Cited By
Stochastic Weight Averaging in Parallel: Large-Batch Training that Generalizes Well
7 January 2020
Vipul Gupta
S. Serrano
D. DeCoste
MoMe
Papers citing
"Stochastic Weight Averaging in Parallel: Large-Batch Training that Generalizes Well"
2 / 2 papers shown
Title
There Are Many Consistent Explanations of Unlabeled Data: Why You Should Average
Ben Athiwaratkun
Marc Finzi
Pavel Izmailov
A. Wilson
35
232
0
14 Jun 2018
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
65
2696
0
15 Sep 2016
1