Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1611.03530
Cited By
Understanding deep learning requires rethinking generalization
10 November 2016
Chiyuan Zhang
Samy Bengio
Moritz Hardt
Benjamin Recht
Oriol Vinyals
HAI
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Understanding deep learning requires rethinking generalization"
50 / 1,190 papers shown
Title
Learning with Noisy Labels for Sentence-level Sentiment Classification
Hao Wang
Bing-Quan Liu
Chaozhuo Li
Yan Yang
Tianrui Li
NoLa
34
26
0
31 Aug 2019
Image Captioning with Sparse Recurrent Neural Network
J. Tan
Chee Seng Chan
Joon Huang Chuah
VLM
34
6
0
28 Aug 2019
Embracing Imperfect Datasets: A Review of Deep Learning Solutions for Medical Image Segmentation
Nima Tajbakhsh
Laura Jeyaseelan
Q. Li
J. Chiang
Zhihao Wu
Xiaowei Ding
46
756
0
27 Aug 2019
Transferability and Hardness of Supervised Classification Tasks
Anh Tran
Cuong V Nguyen
Tal Hassner
139
164
0
21 Aug 2019
Symmetric Cross Entropy for Robust Learning with Noisy Labels
Yisen Wang
Xingjun Ma
Zaiyi Chen
Yuan Luo
Jinfeng Yi
James Bailey
NoLa
39
883
0
16 Aug 2019
Needles in Haystacks: On Classifying Tiny Objects in Large Images
Nick Pawlowski
Suvrat Bhooshan
Nicolas Ballas
F. Ciompi
Ben Glocker
M. Drozdzal
27
22
0
16 Aug 2019
Regularizing CNN Transfer Learning with Randomised Regression
Yang Zhong
A. Maki
29
13
0
16 Aug 2019
The generalization error of random features regression: Precise asymptotics and double descent curve
Song Mei
Andrea Montanari
62
629
0
14 Aug 2019
Unsupervised Out-of-Distribution Detection by Maximum Classifier Discrepancy
Qing Yu
Kiyoharu Aizawa
OODD
19
166
0
14 Aug 2019
Adaptive Ensemble of Classifiers with Regularization for Imbalanced Data Classification
Chen Wang
Qin Yu
Kai Zhou
D. Hui
Xiaofeng Gong
Ruisen Luo
34
22
0
09 Aug 2019
Convergence Rates of Variational Inference in Sparse Deep Learning
Badr-Eddine Chérief-Abdellatif
BDL
21
38
0
09 Aug 2019
Visualizing the PHATE of Neural Networks
Scott A. Gigante
Adam S. Charles
Smita Krishnaswamy
Gal Mishne
38
37
0
07 Aug 2019
How Does Learning Rate Decay Help Modern Neural Networks?
Kaichao You
Mingsheng Long
Jianmin Wang
Michael I. Jordan
39
4
0
05 Aug 2019
Nonparametric Regression on Low-Dimensional Manifolds using Deep ReLU Networks : Function Approximation and Statistical Recovery
Minshuo Chen
Haoming Jiang
Wenjing Liao
T. Zhao
13
90
0
05 Aug 2019
Sound source detection, localization and classification using consecutive ensemble of CRNN models
Slawomir Kapka
M. Lewandowski
21
66
0
02 Aug 2019
Self-training with progressive augmentation for unsupervised cross-domain person re-identification
Xinyu Zhang
Jiewei Cao
Chunhua Shen
Mingyu You
LRM
41
223
0
31 Jul 2019
Pick-and-Learn: Automatic Quality Evaluation for Noisy-Labeled Image Segmentation
Haidong Zhu
Jialin Shi
Ji Wu
NoLa
27
65
0
27 Jul 2019
A Frobenius norm regularization method for convolutional kernels to avoid unstable gradient problem
Pei-Chang Guo
38
5
0
25 Jul 2019
Hessian based analysis of SGD for Deep Nets: Dynamics and Generalization
Xinyan Li
Qilong Gu
Yingxue Zhou
Tiancong Chen
A. Banerjee
ODL
47
51
0
24 Jul 2019
Sparsely Activated Networks
Paschalis A. Bizopoulos
D. Koutsouris
19
11
0
12 Jul 2019
Residual Entropy
B. Rowe
28
7
0
08 Jul 2019
AutoCompress: An Automatic DNN Structured Pruning Framework for Ultra-High Compression Rates
Ning Liu
Xiaolong Ma
Zhiyuan Xu
Yanzhi Wang
Jian Tang
Jieping Ye
43
185
0
06 Jul 2019
Weight-space symmetry in deep networks gives rise to permutation saddles, connected by equal-loss valleys across the loss landscape
Johanni Brea
Berfin Simsek
Bernd Illing
W. Gerstner
38
55
0
05 Jul 2019
Invariant Risk Minimization
Martín Arjovsky
Léon Bottou
Ishaan Gulrajani
David Lopez-Paz
OOD
116
2,177
0
05 Jul 2019
Deep Gamblers: Learning to Abstain with Portfolio Theory
Liu Ziyin
Zhikang T. Wang
Paul Pu Liang
Ruslan Salakhutdinov
Louis-Philippe Morency
Masahito Ueda
40
111
0
29 Jun 2019
High-Dimensional Optimization in Adaptive Random Subspaces
Jonathan Lacotte
Mert Pilanci
Marco Pavone
31
16
0
27 Jun 2019
Benign Overfitting in Linear Regression
Peter L. Bartlett
Philip M. Long
Gábor Lugosi
Alexander Tsigler
MLT
24
764
0
26 Jun 2019
Invariance-inducing regularization using worst-case transformations suffices to boost accuracy and spatial robustness
Fanny Yang
Zuowen Wang
C. Heinze-Deml
30
42
0
26 Jun 2019
Further advantages of data augmentation on convolutional neural networks
Alex Hernández-García
Peter König
22
107
0
26 Jun 2019
Importance Estimation for Neural Network Pruning
Pavlo Molchanov
Arun Mallya
Stephen Tyree
I. Frosio
Jan Kautz
3DPC
42
861
0
25 Jun 2019
On the Noisy Gradient Descent that Generalizes as SGD
Jingfeng Wu
Wenqing Hu
Haoyi Xiong
Jun Huan
Vladimir Braverman
Zhanxing Zhu
MLT
29
10
0
18 Jun 2019
Finding the Needle in the Haystack with Convolutions: on the benefits of architectural bias
Stéphane dÁscoli
Levent Sagun
Joan Bruna
Giulio Biroli
25
36
0
16 Jun 2019
Empirical study of extreme overfitting points of neural networks
D. Merkulov
Ivan Oseledets
3DPC
24
7
0
14 Jun 2019
Effectiveness of Distillation Attack and Countermeasure on Neural Network Watermarking
Ziqi Yang
Hung Dang
E. Chang
AAML
27
34
0
14 Jun 2019
Gradient Descent Maximizes the Margin of Homogeneous Neural Networks
Kaifeng Lyu
Jian Li
52
326
0
13 Jun 2019
Generalization Guarantees for Neural Networks via Harnessing the Low-rank Structure of the Jacobian
Samet Oymak
Zalan Fabian
Mingchen Li
Mahdi Soltanolkotabi
MLT
28
89
0
12 Jun 2019
Does Learning Require Memorization? A Short Tale about a Long Tail
Vitaly Feldman
TDI
63
487
0
12 Jun 2019
Parameterized Structured Pruning for Deep Neural Networks
Günther Schindler
Wolfgang Roth
Franz Pernkopf
Holger Froening
26
6
0
12 Jun 2019
Characterizing the implicit bias via a primal-dual analysis
Ziwei Ji
Matus Telgarsky
18
19
0
11 Jun 2019
Stochastic Mirror Descent on Overparameterized Nonlinear Models: Convergence, Implicit Regularization, and Generalization
Navid Azizan
Sahin Lale
B. Hassibi
30
71
0
10 Jun 2019
Learning to Segment Skin Lesions from Noisy Annotations
Z. Mirikharaji
Yiqi Yan
Ghassan Hamarneh
38
77
0
10 Jun 2019
Understanding overfitting peaks in generalization error: Analytical risk curves for
l
2
l_2
l
2
and
l
1
l_1
l
1
penalized interpolation
P. Mitra
26
50
0
09 Jun 2019
The Implicit Bias of AdaGrad on Separable Data
Qian Qian
Xiaoyuan Qian
37
23
0
09 Jun 2019
Audio tagging with noisy labels and minimal supervision
Eduardo Fonseca
Manoj Plakal
F. Font
D. Ellis
Xavier Serra
25
92
0
07 Jun 2019
Disentangling neural mechanisms for perceptual grouping
Junkyung Kim
Drew Linsley
Kalpit C. Thakkar
Thomas Serre
OCL
43
54
0
04 Jun 2019
Dimensionality compression and expansion in Deep Neural Networks
Stefano Recanatesi
M. Farrell
Madhu S. Advani
Timothy Moore
Guillaume Lajoie
E. Shea-Brown
31
73
0
02 Jun 2019
The Principle of Unchanged Optimality in Reinforcement Learning Generalization
A. Irpan
Xingyou Song
OffRL
41
7
0
02 Jun 2019
Are Anchor Points Really Indispensable in Label-Noise Learning?
Xiaobo Xia
Tongliang Liu
N. Wang
Bo Han
Chen Gong
Gang Niu
Masashi Sugiyama
NoLa
38
373
0
01 Jun 2019
Implicit Regularization in Deep Matrix Factorization
Sanjeev Arora
Nadav Cohen
Wei Hu
Yuping Luo
AI4CE
52
493
0
31 May 2019
On Network Design Spaces for Visual Recognition
Ilija Radosavovic
Justin Johnson
Saining Xie
Wan-Yen Lo
Piotr Dollár
27
135
0
30 May 2019
Previous
1
2
3
...
16
17
18
...
22
23
24
Next