Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1611.03530
Cited By
Understanding deep learning requires rethinking generalization
10 November 2016
Chiyuan Zhang
Samy Bengio
Moritz Hardt
Benjamin Recht
Oriol Vinyals
HAI
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Understanding deep learning requires rethinking generalization"
50 / 1,110 papers shown
Title
A comprehensive study on the prediction reliability of graph neural networks for virtual screening
Soojung Yang
K. Lee
Seongok Ryu
26
7
0
17 Mar 2020
What Information Does a ResNet Compress?
L. N. Darlow
Amos Storkey
SSL
32
11
0
13 Mar 2020
Analyzing Visual Representations in Embodied Navigation Tasks
Erik Wijmans
Julian Straub
Dhruv Batra
Irfan Essa
Judy Hoffman
Ari S. Morcos
21
2
0
12 Mar 2020
SASL: Saliency-Adaptive Sparsity Learning for Neural Network Acceleration
Jun Shi
Jianfeng Xu
K. Tasaka
Zhibo Chen
8
25
0
12 Mar 2020
A Mean-field Analysis of Deep ResNet and Beyond: Towards Provable Optimization Via Overparameterization From Depth
Yiping Lu
Chao Ma
Yulong Lu
Jianfeng Lu
Lexing Ying
MLT
44
78
0
11 Mar 2020
SuperMix: Supervising the Mixing Data Augmentation
Ali Dabouei
Sobhan Soleymani
Fariborz Taherkhani
Nasser M. Nasrabadi
24
98
0
10 Mar 2020
AL2: Progressive Activation Loss for Learning General Representations in Classification Neural Networks
Majed El Helou
Frederike Dumbgen
Sabine Süsstrunk
CLL
AI4CE
30
2
0
07 Mar 2020
The Variational InfoMax Learning Objective
Vincenzo Crescimanna
Bruce P. Graham
23
0
0
07 Mar 2020
Combating noisy labels by agreement: A joint training method with co-regularization
Hongxin Wei
Lei Feng
Xiangyu Chen
Bo An
NoLa
319
501
0
05 Mar 2020
Analyzing Accuracy Loss in Randomized Smoothing Defenses
Yue Gao
Harrison Rosenberg
Kassem Fawaz
S. Jha
Justin Hsu
AAML
24
6
0
03 Mar 2020
Towards Noise-resistant Object Detection with Noisy Annotations
Junnan Li
Caiming Xiong
R. Socher
Guosheng Lin
ObjD
NoLa
70
29
0
03 Mar 2020
Iterative Averaging in the Quest for Best Test Error
Diego Granziol
Xingchen Wan
Samuel Albanie
Stephen J. Roberts
29
3
0
02 Mar 2020
Double Trouble in Double Descent : Bias and Variance(s) in the Lazy Regime
Stéphane dÁscoli
Maria Refinetti
Giulio Biroli
Florent Krzakala
100
152
0
02 Mar 2020
Out-of-Distribution Generalization via Risk Extrapolation (REx)
David M. Krueger
Ethan Caballero
J. Jacobsen
Amy Zhang
Jonathan Binas
Dinghuai Zhang
Rémi Le Priol
Aaron Courville
OOD
215
910
0
02 Mar 2020
Do CNNs Encode Data Augmentations?
Eddie Q. Yan
Yanping Huang
OOD
23
5
0
29 Feb 2020
Overfitting in adversarially robust deep learning
Leslie Rice
Eric Wong
Zico Kolter
47
788
0
26 Feb 2020
Predicting Neural Network Accuracy from Weights
Thomas Unterthiner
Daniel Keysers
Sylvain Gelly
Olivier Bousquet
Ilya O. Tolstikhin
30
101
0
26 Feb 2020
Understanding Self-Training for Gradual Domain Adaptation
Ananya Kumar
Tengyu Ma
Percy Liang
CLL
TTA
28
228
0
26 Feb 2020
Generalized Product Quantization Network for Semi-supervised Image Retrieval
Young Kyun Jang
N. Cho
26
38
0
26 Feb 2020
Convex Geometry and Duality of Over-parameterized Neural Networks
Tolga Ergen
Mert Pilanci
MLT
49
54
0
25 Feb 2020
On Feature Normalization and Data Augmentation
Boyi Li
Felix Wu
Ser-Nam Lim
Serge J. Belongie
Kilian Q. Weinberger
26
134
0
25 Feb 2020
Understanding and Mitigating the Tradeoff Between Robustness and Accuracy
Aditi Raghunathan
Sang Michael Xie
Fanny Yang
John C. Duchi
Percy Liang
AAML
51
224
0
25 Feb 2020
Coherent Gradients: An Approach to Understanding Generalization in Gradient Descent-based Optimization
S. Chatterjee
ODL
OOD
13
51
0
25 Feb 2020
Stochastic Polyak Step-size for SGD: An Adaptive Learning Rate for Fast Convergence
Nicolas Loizou
Sharan Vaswani
I. Laradji
Simon Lacoste-Julien
29
182
0
24 Feb 2020
The Early Phase of Neural Network Training
Jonathan Frankle
D. Schwab
Ari S. Morcos
23
173
0
24 Feb 2020
An Optimization and Generalization Analysis for Max-Pooling Networks
Alon Brutzkus
Amir Globerson
MLT
AI4CE
16
4
0
22 Feb 2020
Generalisation error in learning with random features and the hidden manifold model
Federica Gerace
Bruno Loureiro
Florent Krzakala
M. Mézard
Lenka Zdeborová
30
166
0
21 Feb 2020
Bayesian Deep Learning and a Probabilistic Perspective of Generalization
A. Wilson
Pavel Izmailov
UQCV
BDL
OOD
24
642
0
20 Feb 2020
Implicit Regularization of Random Feature Models
Arthur Jacot
Berfin Simsek
Francesco Spadaro
Clément Hongler
Franck Gabriel
36
82
0
19 Feb 2020
Identifying Critical Neurons in ANN Architectures using Mixed Integer Programming
M. Elaraby
Guy Wolf
Margarida Carvalho
26
5
0
17 Feb 2020
Learning Not to Learn in the Presence of Noisy Labels
Liu Ziyin
Blair Chen
Ru Wang
Paul Pu Liang
Ruslan Salakhutdinov
Louis-Philippe Morency
Masahito Ueda
NoLa
26
18
0
16 Feb 2020
Stress Test Evaluation of Transformer-based Models in Natural Language Understanding Tasks
Carlos Aspillaga
Andrés Carvallo
Vladimir Araujo
ELM
47
31
0
14 Feb 2020
Self-Distillation Amplifies Regularization in Hilbert Space
H. Mobahi
Mehrdad Farajtabar
Peter L. Bartlett
43
229
0
13 Feb 2020
The Conditional Entropy Bottleneck
Ian S. Fischer
OOD
29
117
0
13 Feb 2020
Topologically Densified Distributions
Christoph Hofer
Florian Graf
Marc Niethammer
Roland Kwitt
27
15
0
12 Feb 2020
Object Detection as a Positive-Unlabeled Problem
Yuewei Yang
Kevin J. Liang
Lawrence Carin
30
38
0
11 Feb 2020
Distribution Approximation and Statistical Estimation Guarantees of Generative Adversarial Networks
Minshuo Chen
Wenjing Liao
H. Zha
Tuo Zhao
28
15
0
10 Feb 2020
A Diffusion Theory For Deep Learning Dynamics: Stochastic Gradient Descent Exponentially Favors Flat Minima
Zeke Xie
Issei Sato
Masashi Sugiyama
ODL
30
17
0
10 Feb 2020
Semi-Supervised Class Discovery
Jeremy Nixon
J. Liu
David Berthelot
20
2
0
10 Feb 2020
Spectrum Dependent Learning Curves in Kernel Regression and Wide Neural Networks
Blake Bordelon
Abdulkadir Canatar
Cengiz Pehlevan
149
201
0
07 Feb 2020
A Precise High-Dimensional Asymptotic Theory for Boosting and Minimum-
ℓ
1
\ell_1
ℓ
1
-Norm Interpolated Classifiers
Tengyuan Liang
Pragya Sur
50
68
0
05 Feb 2020
TDEFSI: Theory Guided Deep Learning Based Epidemic Forecasting with Synthetic Information
Lijing Wang
Jiangzhuo Chen
Madhav Marathe
AI4TS
31
19
0
28 Jan 2020
QActor: On-line Active Learning for Noisy Labeled Stream Data
Taraneh Younesian
Zilong Zhao
Amirmasoud Ghiassi
Robert Birke
L. Chen
23
5
0
28 Jan 2020
How Much Position Information Do Convolutional Neural Networks Encode?
Md. Amirul Islam
Sen Jia
Neil D. B. Bruce
SSL
205
344
0
22 Jan 2020
Optimized Generic Feature Learning for Few-shot Classification across Domains
Tonmoy Saikia
Thomas Brox
Cordelia Schmid
VLM
30
48
0
22 Jan 2020
Memory capacity of neural networks with threshold and ReLU activations
Roman Vershynin
33
21
0
20 Jan 2020
Compounding the Performance Improvements of Assembled Techniques in a Convolutional Neural Network
Jungkyu Lee
Taeryun Won
Tae Kwan Lee
Hyemin Lee
Geonmo Gu
K. Hong
34
57
0
17 Jan 2020
Rethinking Generalization of Neural Models: A Named Entity Recognition Case Study
Jinlan Fu
Pengfei Liu
Qi Zhang
Xuanjing Huang
AI4CE
35
73
0
12 Jan 2020
Confidence Scores Make Instance-dependent Label-noise Learning Possible
Antonin Berthon
Bo Han
Gang Niu
Tongliang Liu
Masashi Sugiyama
NoLa
49
105
0
11 Jan 2020
Identifying and Compensating for Feature Deviation in Imbalanced Deep Learning
Han-Jia Ye
Hong-You Chen
De-Chuan Zhan
Wei-Lun Chao
39
99
0
06 Jan 2020
Previous
1
2
3
...
13
14
15
...
21
22
23
Next