Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.08947
Cited By
Exploring Generalization in Deep Learning
27 June 2017
Behnam Neyshabur
Srinadh Bhojanapalli
David A. McAllester
Nathan Srebro
FAtt
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Exploring Generalization in Deep Learning"
50 / 305 papers shown
Title
An Adaptive Policy to Employ Sharpness-Aware Minimization
Weisen Jiang
Hansi Yang
Yu Zhang
James T. Kwok
AAML
81
31
0
28 Apr 2023
Fundamental Tradeoffs in Learning with Prior Information
Anirudha Majumdar
32
0
0
26 Apr 2023
Generalization Matters: Loss Minima Flattening via Parameter Hybridization for Efficient Online Knowledge Distillation
Tianli Zhang
Mengqi Xue
Jiangtao Zhang
Haofei Zhang
Yu Wang
Lechao Cheng
Jie Song
Mingli Song
28
5
0
26 Mar 2023
Randomized Adversarial Training via Taylor Expansion
Gao Jin
Xinping Yi
Dengyu Wu
Ronghui Mu
Xiaowei Huang
AAML
41
34
0
19 Mar 2023
Generalizing and Decoupling Neural Collapse via Hyperspherical Uniformity Gap
Weiyang Liu
L. Yu
Adrian Weller
Bernhard Schölkopf
37
17
0
11 Mar 2023
Provable Pathways: Learning Multiple Tasks over Multiple Paths
Yingcong Li
Samet Oymak
MoE
26
4
0
08 Mar 2023
DART: Diversify-Aggregate-Repeat Training Improves Generalization of Neural Networks
Samyak Jain
Sravanti Addepalli
P. Sahu
Priyam Dey
R. Venkatesh Babu
MoMe
OOD
43
20
0
28 Feb 2023
SAM operates far from home: eigenvalue regularization as a dynamical phenomenon
Atish Agarwala
Yann N. Dauphin
21
20
0
17 Feb 2023
Exploring the Effect of Multi-step Ascent in Sharpness-Aware Minimization
Hoki Kim
Jinseong Park
Yujin Choi
Woojin Lee
Jaewook Lee
20
9
0
27 Jan 2023
Understanding the Spectral Bias of Coordinate Based MLPs Via Training Dynamics
J. Lazzari
Xiuwen Liu
24
3
0
14 Jan 2023
Learning Latent Representations to Co-Adapt to Humans
Sagar Parekh
Dylan P. Losey
18
12
0
19 Dec 2022
Adversarial Weight Perturbation Improves Generalization in Graph Neural Networks
Yihan Wu
Aleksandar Bojchevski
Heng Huang
AAML
34
30
0
09 Dec 2022
Challenging the Universal Representation of Deep Models for 3D Point Cloud Registration
David Bojanić
Kristijan Bartol
J. Forest
Stefan Gumhold
Tomislav Petković
Tomislav Pribanić
3DPC
33
0
0
29 Nov 2022
Improving Robust Generalization by Direct PAC-Bayesian Bound Minimization
Zifa Wang
Nan Ding
Tomer Levinboim
Xi Chen
Radu Soricut
AAML
35
5
0
22 Nov 2022
Two Facets of SDE Under an Information-Theoretic Lens: Generalization of SGD via Training Trajectories and via Terminal States
Ziqiao Wang
Yongyi Mao
27
10
0
19 Nov 2022
REPAIR: REnormalizing Permuted Activations for Interpolation Repair
Keller Jordan
Hanie Sedghi
O. Saukh
R. Entezari
Behnam Neyshabur
MoMe
46
94
0
15 Nov 2022
Quantifying the Impact of Label Noise on Federated Learning
Shuqi Ke
Chao Huang
Xin Liu
FedML
28
7
0
15 Nov 2022
How Does Sharpness-Aware Minimization Minimize Sharpness?
Kaiyue Wen
Tengyu Ma
Zhiyuan Li
AAML
23
47
0
10 Nov 2022
Do highly over-parameterized neural networks generalize since bad solutions are rare?
Julius Martinetz
T. Martinetz
22
1
0
07 Nov 2022
Instance-Dependent Generalization Bounds via Optimal Transport
Songyan Hou
Parnian Kassraie
Anastasis Kratsios
Andreas Krause
Jonas Rothfuss
22
6
0
02 Nov 2022
Improving Lipschitz-Constrained Neural Networks by Learning Activation Functions
Stanislas Ducotterd
Alexis Goujon
Pakshal Bohra
Dimitris Perdios
Sebastian Neumayer
M. Unser
35
12
0
28 Oct 2022
Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language Models
Hong Liu
Sang Michael Xie
Zhiyuan Li
Tengyu Ma
AI4CE
40
49
0
25 Oct 2022
Sufficient Invariant Learning for Distribution Shift
Taero Kim
Sungjun Lim
Kyungwoo Song
OOD
31
2
0
24 Oct 2022
Wasserstein Barycenter-based Model Fusion and Linear Mode Connectivity of Neural Networks
A. K. Akash
Sixu Li
Nicolas García Trillos
31
12
0
13 Oct 2022
Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach
Peng Mi
Li Shen
Tianhe Ren
Yiyi Zhou
Xiaoshuai Sun
Rongrong Ji
Dacheng Tao
AAML
27
69
0
11 Oct 2022
Second-order regression models exhibit progressive sharpening to the edge of stability
Atish Agarwala
Fabian Pedregosa
Jeffrey Pennington
25
26
0
10 Oct 2022
Novice Type Error Diagnosis with Natural Language Models
Chuqin Geng
Haolin Ye
Yixuan Li
Tianyu Han
B. Pientka
X. Si
25
3
0
07 Oct 2022
Scale-invariant Bayesian Neural Networks with Connectivity Tangent Kernel
Sungyub Kim
Si-hun Park
Kyungsu Kim
Eunho Yang
BDL
26
4
0
30 Sep 2022
Quantile-constrained Wasserstein projections for robust interpretability of numerical and machine learning models
Marouane Il Idrissi
Nicolas Bousquet
Fabrice Gamboa
Bertrand Iooss
Jean-Michel Loubes
35
2
0
23 Sep 2022
Test-Time Adaptation with Principal Component Analysis
Thomas Cordier
Victor Bouvier
Gilles Hénaff
C´eline Hudelot
TTA
25
1
0
13 Sep 2022
Instance-Dependent Noisy Label Learning via Graphical Modelling
Arpit Garg
Cuong C. Nguyen
Rafael Felix
Thanh-Toan Do
G. Carneiro
NoLa
34
27
0
02 Sep 2022
On the Implicit Bias in Deep-Learning Algorithms
Gal Vardi
FedML
AI4CE
34
72
0
26 Aug 2022
An Impartial Take to the CNN vs Transformer Robustness Contest
Francesco Pinto
Philip H. S. Torr
P. Dokania
UQCV
AAML
27
48
0
22 Jul 2022
On the Strong Correlation Between Model Invariance and Generalization
Weijian Deng
Stephen Gould
Liang Zheng
OOD
32
16
0
14 Jul 2022
Lipschitz Continuity Retained Binary Neural Network
Yuzhang Shang
Dan Xu
Bin Duan
Ziliang Zong
Liqiang Nie
Yan Yan
16
19
0
13 Jul 2022
PAC-Bayesian Domain Adaptation Bounds for Multiclass Learners
Anthony Sicilia
Katherine Atwell
Malihe Alikhani
Seong Jae Hwang
BDL
51
9
0
12 Jul 2022
A Deep Learning Approach for the solution of Probability Density Evolution of Stochastic Systems
S. Pourtakdoust
Amir H. Khodabakhsh
33
12
0
05 Jul 2022
Aug-NeRF: Training Stronger Neural Radiance Fields with Triple-Level Physically-Grounded Augmentations
Tianlong Chen
Peihao Wang
Zhiwen Fan
Zhangyang Wang
36
55
0
04 Jul 2022
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Edoardo Cetin
Philip J. Ball
Steve Roberts
Oya Celiktutan
30
36
0
03 Jul 2022
Integral Probability Metrics PAC-Bayes Bounds
Ron Amit
Baruch Epstein
Shay Moran
Ron Meir
24
18
0
01 Jul 2022
Federated Latent Class Regression for Hierarchical Data
Bin Yang
T. Carette
Masanobu Jimbo
Shinya Maruyama
FedML
15
0
0
22 Jun 2022
Gradient-Based Adversarial and Out-of-Distribution Detection
Jinsol Lee
Mohit Prabhushankar
Ghassan AlRegib
UQCV
32
13
0
16 Jun 2022
Reconstructing Training Data from Trained Neural Networks
Niv Haim
Gal Vardi
Gilad Yehudai
Ohad Shamir
Michal Irani
40
132
0
15 Jun 2022
Understanding the Generalization Benefit of Normalization Layers: Sharpness Reduction
Kaifeng Lyu
Zhiyuan Li
Sanjeev Arora
FAtt
37
69
0
14 Jun 2022
Towards Understanding Sharpness-Aware Minimization
Maksym Andriushchenko
Nicolas Flammarion
AAML
26
133
0
13 Jun 2022
Intrinsic dimensionality and generalization properties of the
R
\mathcal{R}
R
-norm inductive bias
Navid Ardeshir
Daniel J. Hsu
Clayton Sanford
CML
AI4CE
18
6
0
10 Jun 2022
Meet You Halfway: Explaining Deep Learning Mysteries
Oriel BenShmuel
AAML
FedML
FAtt
OOD
22
0
0
09 Jun 2022
Why Robust Generalization in Deep Learning is Difficult: Perspective of Expressive Power
Binghui Li
Jikai Jin
Han Zhong
J. Hopcroft
Liwei Wang
OOD
82
27
0
27 May 2022
On the Effective Number of Linear Regions in Shallow Univariate ReLU Networks: Convergence Guarantees and Implicit Bias
Itay Safran
Gal Vardi
Jason D. Lee
MLT
56
23
0
18 May 2022
Discovering and Explaining the Representation Bottleneck of Graph Neural Networks from Multi-order Interactions
Fang Wu
Siyuan Li
Lirong Wu
Dragomir R. Radev
Stan Z. Li
27
2
0
15 May 2022
Previous
1
2
3
4
5
6
7
Next