Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2006.06821
Cited By
To Each Optimizer a Norm, To Each Norm its Generalization
11 June 2020
Sharan Vaswani
Reza Babanezhad
Jose Gallego
Aaron Mishkin
Damien Scieur
Nicolas Le Roux
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"To Each Optimizer a Norm, To Each Norm its Generalization"
5 / 5 papers shown
Importance Tempering: Group Robustness for Overparameterized Models
Yiping Lu
Wenlong Ji
Zachary Izzo
Lexing Ying
317
7
0
19 Sep 2022
An Unconstrained Layer-Peeled Perspective on Neural Collapse
Wenlong Ji
Yiping Lu
Yiliang Zhang
Zhun Deng
Weijie J. Su
570
98
0
06 Oct 2021
Which Minimizer Does My Neural Network Converge To?
Manuel Nonnenmacher
David Reeb
Ingo Steinwart
ODL
254
6
0
04 Nov 2020
Whitening and second order optimization both make information in the dataset unusable during training, and can reduce or prevent generalization
Neha S. Wadia
Daniel Duckworth
S. Schoenholz
Ethan Dyer
Jascha Narain Sohl-Dickstein
492
18
0
17 Aug 2020
Implicit Regularization via Neural Feature Alignment
A. Baratin
Thomas George
César Laurent
R. Devon Hjelm
Guillaume Lajoie
Pascal Vincent
Damien Scieur
171
7
0
03 Aug 2020
1
Page 1 of 1