Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.18490
Cited By
SANE: The phases of gradient descent through Sharpness Adjusted Number of Effective parameters
29 May 2023
Lawrence Wang
Stephen J. Roberts
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SANE: The phases of gradient descent through Sharpness Adjusted Number of Effective parameters"
2 / 2 papers shown
Title
The large learning rate phase of deep learning: the catapult mechanism
Aitor Lewkowycz
Yasaman Bahri
Ethan Dyer
Jascha Narain Sohl-Dickstein
Guy Gur-Ari
ODL
156
233
0
04 Mar 2020
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
273
2,886
0
15 Sep 2016
1