ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.03530
  4. Cited By
Understanding deep learning requires rethinking generalization

Understanding deep learning requires rethinking generalization

10 November 2016
Chiyuan Zhang
Samy Bengio
Moritz Hardt
Benjamin Recht
Oriol Vinyals
    HAI
ArXivPDFHTML

Papers citing "Understanding deep learning requires rethinking generalization"

50 / 1,059 papers shown
Title
Non-Vacuous Generalization Bounds at the ImageNet Scale: A PAC-Bayesian
  Compression Approach
Non-Vacuous Generalization Bounds at the ImageNet Scale: A PAC-Bayesian Compression Approach
Wenda Zhou
Victor Veitch
Morgane Austern
Ryan P. Adams
Peter Orbanz
44
211
0
16 Apr 2018
Data-Dependent Coresets for Compressing Neural Networks with
  Applications to Generalization Bounds
Data-Dependent Coresets for Compressing Neural Networks with Applications to Generalization Bounds
Cenk Baykal
Lucas Liebenwein
Igor Gilitschenski
Dan Feldman
Daniela Rus
25
79
0
15 Apr 2018
Analysis on the Nonlinear Dynamics of Deep Neural Networks: Topological
  Entropy and Chaos
Analysis on the Nonlinear Dynamics of Deep Neural Networks: Topological Entropy and Chaos
Husheng Li
25
11
0
03 Apr 2018
Joint Optimization Framework for Learning with Noisy Labels
Joint Optimization Framework for Learning with Noisy Labels
Daiki Tanaka
Daiki Ikami
T. Yamasaki
Kiyoharu Aizawa
NoLa
39
704
0
30 Mar 2018
Learning to Reweight Examples for Robust Deep Learning
Learning to Reweight Examples for Robust Deep Learning
Mengye Ren
Wenyuan Zeng
Binh Yang
R. Urtasun
OOD
NoLa
69
1,411
0
24 Mar 2018
Technical Report: When Does Machine Learning FAIL? Generalized
  Transferability for Evasion and Poisoning Attacks
Technical Report: When Does Machine Learning FAIL? Generalized Transferability for Evasion and Poisoning Attacks
Octavian Suciu
R. Marginean
Yigitcan Kaya
Hal Daumé
Tudor Dumitras
AAML
40
286
0
19 Mar 2018
Comparing Dynamics: Deep Neural Networks versus Glassy Systems
Comparing Dynamics: Deep Neural Networks versus Glassy Systems
Marco Baity-Jesi
Levent Sagun
Mario Geiger
S. Spigler
Gerard Ben Arous
C. Cammarota
Yann LeCun
M. Wyart
Giulio Biroli
AI4CE
42
113
0
19 Mar 2018
On the importance of single directions for generalization
On the importance of single directions for generalization
Ari S. Morcos
David Barrett
Neil C. Rabinowitz
M. Botvinick
27
329
0
19 Mar 2018
Deep Component Analysis via Alternating Direction Neural Networks
Deep Component Analysis via Alternating Direction Neural Networks
Calvin Murdock
Ming-Fang Chang
Simon Lucey
BDL
27
20
0
16 Mar 2018
A Kernel Theory of Modern Data Augmentation
A Kernel Theory of Modern Data Augmentation
Tri Dao
Albert Gu
Alexander J. Ratner
Virginia Smith
Christopher De Sa
Christopher Ré
27
190
0
16 Mar 2018
How to Start Training: The Effect of Initialization and Architecture
How to Start Training: The Effect of Initialization and Architecture
Boris Hanin
David Rolnick
19
253
0
05 Mar 2018
On the Power of Over-parametrization in Neural Networks with Quadratic
  Activation
On the Power of Over-parametrization in Neural Networks with Quadratic Activation
S. Du
Jason D. Lee
27
267
0
03 Mar 2018
Essentially No Barriers in Neural Network Energy Landscape
Essentially No Barriers in Neural Network Energy Landscape
Felix Dräxler
K. Veschgini
M. Salmhofer
Fred Hamprecht
MoMe
22
424
0
02 Mar 2018
Var-CNN: A Data-Efficient Website Fingerprinting Attack Based on Deep
  Learning
Var-CNN: A Data-Efficient Website Fingerprinting Attack Based on Deep Learning
Sanjit Bhat
David Lu
Albert Kwon
S. Devadas
AAML
21
190
0
28 Feb 2018
Learning Representations for Neural Network-Based Classification Using
  the Information Bottleneck Principle
Learning Representations for Neural Network-Based Classification Using the Information Bottleneck Principle
Rana Ali Amjad
Bernhard C. Geiger
35
196
0
27 Feb 2018
Shampoo: Preconditioned Stochastic Tensor Optimization
Shampoo: Preconditioned Stochastic Tensor Optimization
Vineet Gupta
Tomer Koren
Y. Singer
ODL
17
201
0
26 Feb 2018
Scalable Private Learning with PATE
Scalable Private Learning with PATE
Nicolas Papernot
Shuang Song
Ilya Mironov
A. Raghunathan
Kunal Talwar
Ulfar Erlingsson
41
607
0
24 Feb 2018
A Walk with SGD
A Walk with SGD
Chen Xing
Devansh Arpit
Christos Tsirigotis
Yoshua Bengio
27
118
0
24 Feb 2018
Deep learning algorithm for data-driven simulation of noisy dynamical
  system
Deep learning algorithm for data-driven simulation of noisy dynamical system
K. Yeo
Igor Melnyk
AI4TS
29
93
0
22 Feb 2018
Characterizing Implicit Bias in Terms of Optimization Geometry
Characterizing Implicit Bias in Terms of Optimization Geometry
Suriya Gunasekar
Jason D. Lee
Daniel Soudry
Nathan Srebro
AI4CE
46
399
0
22 Feb 2018
The Secret Sharer: Evaluating and Testing Unintended Memorization in
  Neural Networks
The Secret Sharer: Evaluating and Testing Unintended Memorization in Neural Networks
Nicholas Carlini
Chang-rui Liu
Ulfar Erlingsson
Jernej Kos
D. Song
89
1,117
0
22 Feb 2018
L2-Nonexpansive Neural Networks
L2-Nonexpansive Neural Networks
Haifeng Qian
M. Wegman
25
74
0
22 Feb 2018
The Description Length of Deep Learning Models
The Description Length of Deep Learning Models
Léonard Blier
Yann Ollivier
32
97
0
20 Feb 2018
Do deep nets really need weight decay and dropout?
Do deep nets really need weight decay and dropout?
Alex Hernández-García
Peter König
22
27
0
20 Feb 2018
The Role of Information Complexity and Randomization in Representation
  Learning
The Role of Information Complexity and Randomization in Representation Learning
Matías Vera
Pablo Piantanida
L. Rey Vega
45
14
0
14 Feb 2018
Stronger generalization bounds for deep nets via a compression approach
Stronger generalization bounds for deep nets via a compression approach
Sanjeev Arora
Rong Ge
Behnam Neyshabur
Yi Zhang
MLT
AI4CE
41
631
0
14 Feb 2018
Training and Inference with Integers in Deep Neural Networks
Training and Inference with Integers in Deep Neural Networks
Shuang Wu
Guoqi Li
F. Chen
Luping Shi
MQ
41
389
0
13 Feb 2018
Turning Your Weakness Into a Strength: Watermarking Deep Neural Networks
  by Backdooring
Turning Your Weakness Into a Strength: Watermarking Deep Neural Networks by Backdooring
Yossi Adi
Carsten Baum
Moustapha Cissé
Benny Pinkas
Joseph Keshet
18
669
0
13 Feb 2018
Deep Neural Networks Learn Non-Smooth Functions Effectively
Deep Neural Networks Learn Non-Smooth Functions Effectively
Masaaki Imaizumi
Kenji Fukumizu
26
123
0
13 Feb 2018
Towards Understanding the Generalization Bias of Two Layer Convolutional
  Linear Classifiers with Gradient Descent
Towards Understanding the Generalization Bias of Two Layer Convolutional Linear Classifiers with Gradient Descent
Yifan Wu
Barnabás Póczós
Aarti Singh
MLT
30
8
0
13 Feb 2018
Learning Compact Neural Networks with Regularization
Learning Compact Neural Networks with Regularization
Samet Oymak
MLT
41
39
0
05 Feb 2018
Semi-Supervised Convolutional Neural Networks for Human Activity
  Recognition
Semi-Supervised Convolutional Neural Networks for Human Activity Recognition
Mingzhi Zeng
Tong Yu
Tianlin Li
Le T. Nguyen
Ole J. Mengshoel
Ian Lane
SSL
HAI
11
62
0
22 Jan 2018
Faster gaze prediction with dense networks and Fisher pruning
Faster gaze prediction with dense networks and Fisher pruning
Lucas Theis
I. Korshunova
Alykhan Tejani
Ferenc Huszár
34
204
0
17 Jan 2018
Fix your classifier: the marginal value of training the last weight
  layer
Fix your classifier: the marginal value of training the last weight layer
Elad Hoffer
Itay Hubara
Daniel Soudry
35
101
0
14 Jan 2018
Approximation beats concentration? An approximation view on inference
  with smooth radial kernels
Approximation beats concentration? An approximation view on inference with smooth radial kernels
M. Belkin
39
69
0
10 Jan 2018
Boundary Optimizing Network (BON)
Boundary Optimizing Network (BON)
Marco Singh
A. Pai
30
0
0
08 Jan 2018
Theory of Deep Learning IIb: Optimization Properties of SGD
Theory of Deep Learning IIb: Optimization Properties of SGD
Chiyuan Zhang
Q. Liao
Alexander Rakhlin
Brando Miranda
Noah Golowich
T. Poggio
ODL
28
71
0
07 Jan 2018
Visualizing the Loss Landscape of Neural Nets
Visualizing the Loss Landscape of Neural Nets
Hao Li
Zheng Xu
Gavin Taylor
Christoph Studer
Tom Goldstein
111
1,850
0
28 Dec 2017
Entropy-SGD optimizes the prior of a PAC-Bayes bound: Generalization
  properties of Entropy-SGD and data-dependent priors
Entropy-SGD optimizes the prior of a PAC-Bayes bound: Generalization properties of Entropy-SGD and data-dependent priors
Gintare Karolina Dziugaite
Daniel M. Roy
MLT
30
144
0
26 Dec 2017
Spurious Local Minima are Common in Two-Layer ReLU Neural Networks
Spurious Local Minima are Common in Two-Layer ReLU Neural Networks
Itay Safran
Ohad Shamir
40
262
0
24 Dec 2017
Improving Generalization Performance by Switching from Adam to SGD
Improving Generalization Performance by Switching from Adam to SGD
N. Keskar
R. Socher
ODL
41
521
0
20 Dec 2017
Targeted Backdoor Attacks on Deep Learning Systems Using Data Poisoning
Targeted Backdoor Attacks on Deep Learning Systems Using Data Poisoning
Xinyun Chen
Chang-rui Liu
Bo-wen Li
Kimberly Lu
D. Song
AAML
SILM
44
1,808
0
15 Dec 2017
A trans-disciplinary review of deep learning research for water
  resources scientists
A trans-disciplinary review of deep learning research for water resources scientists
Chaopeng Shen
AI4CE
33
682
0
06 Dec 2017
Deep Learning Scaling is Predictable, Empirically
Deep Learning Scaling is Predictable, Empirically
Joel Hestness
Sharan Narang
Newsha Ardalani
G. Diamos
Heewoo Jun
Hassan Kianinejad
Md. Mostofa Ali Patwary
Yang Yang
Yanqi Zhou
63
716
0
01 Dec 2017
Providing theoretical learning guarantees to Deep Learning Networks
Providing theoretical learning guarantees to Deep Learning Networks
R. Mello
M. D. Ferreira
M. Ponti
28
6
0
28 Nov 2017
Invariance of Weight Distributions in Rectified MLPs
Invariance of Weight Distributions in Rectified MLPs
Russell Tsuchida
Farbod Roosta-Khorasani
M. Gallagher
MLT
32
35
0
24 Nov 2017
Sparse-Input Neural Networks for High-dimensional Nonparametric
  Regression and Classification
Sparse-Input Neural Networks for High-dimensional Nonparametric Regression and Classification
Jean Feng
N. Simon
24
99
0
21 Nov 2017
Performance Modeling and Evaluation of Distributed Deep Learning
  Frameworks on GPUs
Performance Modeling and Evaluation of Distributed Deep Learning Frameworks on GPUs
S. Shi
Qiang-qiang Wang
Xiaowen Chu
37
110
0
16 Nov 2017
Interpreting Deep Visual Representations via Network Dissection
Interpreting Deep Visual Representations via Network Dissection
Bolei Zhou
David Bau
A. Oliva
Antonio Torralba
FAtt
MILM
29
323
0
15 Nov 2017
Three Factors Influencing Minima in SGD
Three Factors Influencing Minima in SGD
Stanislaw Jastrzebski
Zachary Kenton
Devansh Arpit
Nicolas Ballas
Asja Fischer
Yoshua Bengio
Amos Storkey
42
457
0
13 Nov 2017
Previous
123...19202122
Next