Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
All Papers
Title
Home
Papers
2312.03885
Cited By
v1
v2
v3
v4 (latest)
Gathering and Exploiting Higher-Order Information when Training Large Structured Models
6 December 2023
Pierre Wolinski
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Gathering and Exploiting Higher-Order Information when Training Large Structured Models"
11 / 11 papers shown
Title
Sharpness-Aware Minimization for Efficiently Improving Generalization
Pierre Foret
Ariel Kleiner
H. Mobahi
Behnam Neyshabur
AAML
370
1,474
0
03 Oct 2020
Similarity of Neural Network Representations Revisited
Simon Kornblith
Mohammad Norouzi
Honglak Lee
Geoffrey E. Hinton
261
1,523
0
01 May 2019
Are All Layers Created Equal?
Chiyuan Zhang
Samy Bengio
Y. Singer
198
147
0
06 Feb 2019
Neural Tangent Kernel: Convergence and Generalization in Neural Networks
Arthur Jacot
Franck Gabriel
Clément Hongler
559
3,359
0
20 Jun 2018
Block Mean Approximation for Efficient Second Order Optimization
Yao Lu
Mehrtash Harandi
Leonid Sigal
Razvan Pascanu
ODL
77
4
0
16 Apr 2018
Empirical Analysis of the Hessian of Over-Parametrized Neural Networks
Levent Sagun
Utku Evci
V. U. Güney
Yann N. Dauphin
Léon Bottou
191
423
0
14 Jun 2017
Sharp Minima Can Generalize For Deep Nets
Laurent Dinh
Razvan Pascanu
Samy Bengio
Yoshua Bengio
ODL
257
792
0
15 Mar 2017
Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs)
Djork-Arné Clevert
Thomas Unterthiner
Sepp Hochreiter
411
5,644
0
23 Nov 2015
Optimizing Neural Networks with Kronecker-factored Approximate Curvature
James Martens
Roger C. Grosse
ODL
400
1,064
0
19 Mar 2015
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan
Andrew Zisserman
FAtt
MDE
2.5K
102,452
0
04 Sep 2014
Riemannian metrics for neural networks I: feedforward networks
Yann Ollivier
243
105
0
04 Mar 2013
1