Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.06821
Cited By
To Each Optimizer a Norm, To Each Norm its Generalization
11 June 2020
Sharan Vaswani
Reza Babanezhad
Jose Gallego
Aaron Mishkin
Simon Lacoste-Julien
Nicolas Le Roux
Re-assign community
ArXiv
PDF
HTML
Papers citing
"To Each Optimizer a Norm, To Each Norm its Generalization"
3 / 3 papers shown
Title
Importance Tempering: Group Robustness for Overparameterized Models
Yiping Lu
Wenlong Ji
Zachary Izzo
Lexing Ying
37
7
0
19 Sep 2022
Whitening and second order optimization both make information in the dataset unusable during training, and can reduce or prevent generalization
Neha S. Wadia
Daniel Duckworth
S. Schoenholz
Ethan Dyer
Jascha Narain Sohl-Dickstein
19
13
0
17 Aug 2020
When Does Preconditioning Help or Hurt Generalization?
S. Amari
Jimmy Ba
Roger C. Grosse
Xuechen Li
Atsushi Nitanda
Taiji Suzuki
Denny Wu
Ji Xu
26
32
0
18 Jun 2020
1