Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1611.03530
Cited By
Understanding deep learning requires rethinking generalization
10 November 2016
Chiyuan Zhang
Samy Bengio
Moritz Hardt
Benjamin Recht
Oriol Vinyals
HAI
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Understanding deep learning requires rethinking generalization"
50 / 1,028 papers shown
Title
Linear Stability Hypothesis and Rank Stratification for Nonlinear Models
Yaoyu Zhang
Zhongwang Zhang
Leyang Zhang
Zhiwei Bai
Tao Luo
Z. Xu
27
7
0
21 Nov 2022
Two Facets of SDE Under an Information-Theoretic Lens: Generalization of SGD via Training Trajectories and via Terminal States
Ziqiao Wang
Yongyi Mao
30
10
0
19 Nov 2022
Why Deep Learning Generalizes
Benjamin L. Badger
TDI
AI4CE
20
3
0
17 Nov 2022
On the Sample Complexity of Two-Layer Networks: Lipschitz vs. Element-Wise Lipschitz Activation
Amit Daniely
Elad Granot
MLT
32
1
0
17 Nov 2022
REPAIR: REnormalizing Permuted Activations for Interpolation Repair
Keller Jordan
Hanie Sedghi
O. Saukh
R. Entezari
Behnam Neyshabur
MoMe
46
94
0
15 Nov 2022
Robust Training of Graph Neural Networks via Noise Governance
Siyi Qian
Haochao Ying
Renjun Hu
Jingbo Zhou
Jintai Chen
Danny Chen
Jian Wu
NoLa
33
34
0
12 Nov 2022
Gradient Imitation Reinforcement Learning for General Low-Resource Information Extraction
Xuming Hu
Shiao Meng
Chenwei Zhang
Xiangli Yang
Lijie Wen
Irwin King
Philip S. Yu
52
0
0
11 Nov 2022
NEON: Enabling Efficient Support for Nonlinear Operations in Resistive RAM-based Neural Network Accelerators
Aditya Manglik
Minesh Patel
Haiyu Mao
Behzad Salami
Jisung Park
Lois Orosa
O. Mutlu
20
1
0
10 Nov 2022
How Does Sharpness-Aware Minimization Minimize Sharpness?
Kaiyue Wen
Tengyu Ma
Zhiyuan Li
AAML
23
47
0
10 Nov 2022
Do highly over-parameterized neural networks generalize since bad solutions are rare?
Julius Martinetz
T. Martinetz
30
1
0
07 Nov 2022
Biased Self-supervised learning for ASR
Florian Kreyssig
Yangyang Shi
Jinxi Guo
Leda Sari
Abdel-rahman Mohamed
P. Woodland
SSL
30
2
0
04 Nov 2022
Private Semi-supervised Knowledge Transfer for Deep Learning from Noisy Labels
Qiuchen Zhang
Jing Ma
Jian Lou
Li Xiong
Xiaoqian Jiang
NoLa
21
0
0
03 Nov 2022
Instance-Dependent Generalization Bounds via Optimal Transport
Songyan Hou
Parnian Kassraie
Anastasis Kratsios
Andreas Krause
Jonas Rothfuss
22
6
0
02 Nov 2022
Discriminative Speaker Representation via Contrastive Learning with Class-Aware Attention in Angular Space
Zhe Li
Man-Wai Mak
Helen M. Meng
34
9
0
29 Oct 2022
A Functional-Space Mean-Field Theory of Partially-Trained Three-Layer Neural Networks
Zhengdao Chen
Eric Vanden-Eijnden
Joan Bruna
MLT
27
5
0
28 Oct 2022
Noise Injection Node Regularization for Robust Learning
N. Levi
I. Bloch
M. Freytsis
T. Volansky
AI4CE
32
2
0
27 Oct 2022
Bridging the visual gap in VLN via semantically richer instructions
Joaquín Ossandón
Benjamín Earle
Alvaro Soto
37
0
0
27 Oct 2022
The Curious Case of Benign Memorization
Sotiris Anagnostidis
Gregor Bachmann
Lorenzo Noci
Thomas Hofmann
AAML
54
8
0
25 Oct 2022
Noise Injection as a Probe of Deep Learning Dynamics
Noam Levi
I. Bloch
M. Freytsis
T. Volansky
42
2
0
24 Oct 2022
A PAC-Bayesian Generalization Bound for Equivariant Networks
Arash Behboodi
Gabriele Cesa
Taco S. Cohen
58
17
0
24 Oct 2022
Revisiting Sparse Convolutional Model for Visual Recognition
Xili Dai
Mingyang Li
Pengyuan Zhai
Shengbang Tong
Xingjian Gao
Shao-Lun Huang
Zhihui Zhu
Chong You
Yi Ma
FAtt
35
27
0
24 Oct 2022
A Non-Asymptotic Moreau Envelope Theory for High-Dimensional Generalized Linear Models
Lijia Zhou
Frederic Koehler
Pragya Sur
Danica J. Sutherland
Nathan Srebro
83
9
0
21 Oct 2022
Optimisation & Generalisation in Networks of Neurons
Jeremy Bernstein
AI4CE
24
2
0
18 Oct 2022
Dimensionality of datasets in object detection networks
Ajay Chawda
A. Vierling
Karsten Berns
3DPC
10
0
0
13 Oct 2022
SGD with Large Step Sizes Learns Sparse Features
Maksym Andriushchenko
Aditya Varre
Loucas Pillaud-Vivien
Nicolas Flammarion
45
56
0
11 Oct 2022
Rediscovery of Numerical Lüscher's Formula from the Neural Network
Yu Lu
Yijia Wang
YingChun Chen
Jia-Jun Wu
23
1
0
05 Oct 2022
Block-wise Training of Residual Networks via the Minimizing Movement Scheme
Skander Karkar
Ibrahim Ayed
Emmanuel de Bézenac
Patrick Gallinari
33
1
0
03 Oct 2022
The Dynamic of Consensus in Deep Networks and the Identification of Noisy Labels
Daniel Shwartz
Uri Stern
D. Weinshall
NoLa
36
2
0
02 Oct 2022
On the Impossible Safety of Large AI Models
El-Mahdi El-Mhamdi
Sadegh Farhadkhani
R. Guerraoui
Nirupam Gupta
L. Hoang
Rafael Pinot
Sébastien Rouault
John Stephan
37
31
0
30 Sep 2022
Scale-invariant Bayesian Neural Networks with Connectivity Tangent Kernel
Sungyub Kim
Si-hun Park
Kyungsu Kim
Eunho Yang
BDL
32
4
0
30 Sep 2022
On the Robustness of Random Forest Against Untargeted Data Poisoning: An Ensemble-Based Approach
M. Anisetti
C. Ardagna
Alessandro Balestrucci
Nicola Bena
Ernesto Damiani
C. Yeun
AAML
OOD
34
10
0
28 Sep 2022
Why neural networks find simple solutions: the many regularizers of geometric complexity
Benoit Dherin
Michael Munn
M. Rosca
David Barrett
57
31
0
27 Sep 2022
Deep Double Descent via Smooth Interpolation
Matteo Gamba
Erik Englesson
Mårten Björkman
Hossein Azizpour
63
11
0
21 Sep 2022
Deep Linear Networks can Benignly Overfit when Shallow Ones Do
Niladri S. Chatterji
Philip M. Long
23
8
0
19 Sep 2022
Neural Collapse with Normalized Features: A Geometric Analysis over the Riemannian Manifold
Can Yaras
Peng Wang
Zhihui Zhu
Laura Balzano
Qing Qu
25
42
0
19 Sep 2022
Lazy vs hasty: linearization in deep networks impacts learning schedule based on example difficulty
Thomas George
Guillaume Lajoie
A. Baratin
34
5
0
19 Sep 2022
Generalization Bounds for Deep Transfer Learning Using Majority Predictor Accuracy
Cuong N.Nguyen
L. Ho
Vu C. Dinh
Tal Hassner
Cuong V.Nguyen
19
4
0
13 Sep 2022
Black-Box Audits for Group Distribution Shifts
Marc Juárez
Samuel Yeom
Matt Fredrikson
MLAU
27
4
0
08 Sep 2022
Data-Driven Target Localization Using Adaptive Radar Processing and Convolutional Neural Networks
Shyam Venkatasubramanian
S. Gogineni
Bosung Kang
Ali Pezeshki
M. Rangaswamy
Vahid Tarokh
44
3
0
07 Sep 2022
Generalisation under gradient descent via deterministic PAC-Bayes
Eugenio Clerico
Tyler Farghly
George Deligiannidis
Benjamin Guedj
Arnaud Doucet
33
4
0
06 Sep 2022
Data Provenance via Differential Auditing
Xin Mu
Ming Pang
Feida Zhu
19
1
0
04 Sep 2022
Instance-Dependent Noisy Label Learning via Graphical Modelling
Arpit Garg
Cuong C. Nguyen
Rafael Felix
Thanh-Toan Do
G. Carneiro
NoLa
39
27
0
02 Sep 2022
PanorAMS: Automatic Annotation for Detecting Objects in Urban Context
Inske Groenen
S. Rudinac
M. Worring
21
4
0
30 Aug 2022
Learning from Noisy Labels with Coarse-to-Fine Sample Credibility Modeling
Boshen Zhang
Yuxi Li
Yuanpeng Tu
Jinlong Peng
Yabiao Wang
Cunlin Wu
Yanghua Xiao
Cairong Zhao
NoLa
38
6
0
23 Aug 2022
Intersection of Parallels as an Early Stopping Criterion
Ali Vardasbi
Maarten de Rijke
Mostafa Dehghani
MoMe
41
5
0
19 Aug 2022
Universal Solutions of Feedforward ReLU Networks for Interpolations
Changcun Huang
20
2
0
16 Aug 2022
Do Quantum Circuit Born Machines Generalize?
Kaitlin Gili
Mohamed Hibat-Allah
M. Mauri
C. Ballance
A. Perdomo-Ortiz
33
29
0
27 Jul 2022
Learning from Data with Noisy Labels Using Temporal Self-Ensemble
Jun Ho Lee
J. Baik
Taebaek Hwang
J. Choi
NoLa
28
1
0
21 Jul 2022
Benign, Tempered, or Catastrophic: A Taxonomy of Overfitting
Neil Rohit Mallinar
James B. Simon
Amirhesam Abedsoltan
Parthe Pandit
M. Belkin
Preetum Nakkiran
26
37
0
14 Jul 2022
PAC-Bayesian Domain Adaptation Bounds for Multiclass Learners
Anthony Sicilia
Katherine Atwell
Malihe Alikhani
Seong Jae Hwang
BDL
56
9
0
12 Jul 2022
Previous
1
2
3
4
5
...
19
20
21
Next