Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.00574
Cited By
Beyond Single-Model Views for Deep Learning: Optimization versus Generalizability of Stochastic Optimization Algorithms
1 March 2024
Toki Tahmid Inan
Mingrui Liu
Amarda Shehu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Beyond Single-Model Views for Deep Learning: Optimization versus Generalizability of Stochastic Optimization Algorithms"
2 / 2 papers shown
Title
The Dynamics of Sharpness-Aware Minimization: Bouncing Across Ravines and Drifting Towards Wide Minima
Peter L. Bartlett
Philip M. Long
Olivier Bousquet
63
34
0
04 Oct 2022
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
273
2,878
0
15 Sep 2016
1