ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1710.05468
  4. Cited By
Generalization in Deep Learning

Generalization in Deep Learning

16 October 2017
Kenji Kawaguchi
L. Kaelbling
Yoshua Bengio
    ODL
ArXivPDFHTML

Papers citing "Generalization in Deep Learning"

38 / 38 papers shown
Title
DGSAM: Domain Generalization via Individual Sharpness-Aware Minimization
DGSAM: Domain Generalization via Individual Sharpness-Aware Minimization
Youngjun Song
Youngsik Hwang
Jonghun Lee
Heechang Lee
Dong-Young Lim
AAML
79
0
0
30 Mar 2025
Information-Theoretic Generalization Bounds for Deep Neural Networks
Information-Theoretic Generalization Bounds for Deep Neural Networks
Haiyun He
Christina Lee Yu
75
5
0
04 Apr 2024
Investigating Generalization Behaviours of Generative Flow Networks
Investigating Generalization Behaviours of Generative Flow Networks
Lazar Atanackovic
Emmanuel Bengio
AI4CE
51
3
0
07 Feb 2024
A trans-disciplinary review of deep learning research for water
  resources scientists
A trans-disciplinary review of deep learning research for water resources scientists
Chaopeng Shen
AI4CE
129
687
0
06 Dec 2017
Global optimality conditions for deep neural networks
Global optimality conditions for deep neural networks
Chulhee Yun
S. Sra
Ali Jadbabaie
134
118
0
08 Jul 2017
Towards Understanding Generalization of Deep Learning: Perspective of
  Loss Landscapes
Towards Understanding Generalization of Deep Learning: Perspective of Loss Landscapes
Lei Wu
Zhanxing Zhu
E. Weinan
ODL
50
220
0
30 Jun 2017
Exploring Generalization in Deep Learning
Exploring Generalization in Deep Learning
Behnam Neyshabur
Srinadh Bhojanapalli
David A. McAllester
Nathan Srebro
FAtt
132
1,245
0
27 Jun 2017
A Closer Look at Memorization in Deep Networks
A Closer Look at Memorization in Deep Networks
Devansh Arpit
Stanislaw Jastrzebski
Nicolas Ballas
David M. Krueger
Emmanuel Bengio
...
Tegan Maharaj
Asja Fischer
Aaron Courville
Yoshua Bengio
Simon Lacoste-Julien
TDI
95
1,801
0
16 Jun 2017
Train longer, generalize better: closing the generalization gap in large
  batch training of neural networks
Train longer, generalize better: closing the generalization gap in large batch training of neural networks
Elad Hoffer
Itay Hubara
Daniel Soudry
ODL
142
799
0
24 May 2017
Computing Nonvacuous Generalization Bounds for Deep (Stochastic) Neural
  Networks with Many More Parameters than Training Data
Computing Nonvacuous Generalization Bounds for Deep (Stochastic) Neural Networks with Many More Parameters than Training Data
Gintare Karolina Dziugaite
Daniel M. Roy
77
808
0
31 Mar 2017
Sharp Minima Can Generalize For Deep Nets
Sharp Minima Can Generalize For Deep Nets
Laurent Dinh
Razvan Pascanu
Samy Bengio
Yoshua Bengio
ODL
98
766
0
15 Mar 2017
Data-Dependent Stability of Stochastic Gradient Descent
Data-Dependent Stability of Stochastic Gradient Descent
Ilja Kuzborskij
Christoph H. Lampert
MLT
91
165
0
05 Mar 2017
Deep Semi-Random Features for Nonlinear Function Approximation
Deep Semi-Random Features for Nonlinear Function Approximation
Kenji Kawaguchi
Bo Xie
Vikas Verma
Le Song
108
15
0
28 Feb 2017
Fast Rates for Empirical Risk Minimization of Strict Saddle Problems
Fast Rates for Empirical Risk Minimization of Strict Saddle Problems
Alon Gonen
Shai Shalev-Shwartz
60
30
0
16 Jan 2017
Aggregated Residual Transformations for Deep Neural Networks
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Zhuowen Tu
Kaiming He
428
10,281
0
16 Nov 2016
Understanding deep learning requires rethinking generalization
Understanding deep learning requires rethinking generalization
Chiyuan Zhang
Samy Bengio
Moritz Hardt
Benjamin Recht
Oriol Vinyals
HAI
264
4,612
0
10 Nov 2016
Why and When Can Deep -- but Not Shallow -- Networks Avoid the Curse of
  Dimensionality: a Review
Why and When Can Deep -- but Not Shallow -- Networks Avoid the Curse of Dimensionality: a Review
T. Poggio
H. Mhaskar
Lorenzo Rosasco
Brando Miranda
Q. Liao
66
575
0
02 Nov 2016
Generalization Error of Invariant Classifiers
Generalization Error of Invariant Classifiers
Jure Sokolić
Raja Giryes
Guillermo Sapiro
M. Rodrigues
35
78
0
14 Oct 2016
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp
  Minima
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
355
2,922
0
15 Sep 2016
Robust Large Margin Deep Neural Networks
Robust Large Margin Deep Neural Networks
Jure Sokolić
Raja Giryes
Guillermo Sapiro
M. Rodrigues
57
309
0
26 May 2016
Unsupervised Learning for Physical Interaction through Video Prediction
Unsupervised Learning for Physical Interaction through Video Prediction
Chelsea Finn
Ian Goodfellow
Sergey Levine
51
1,042
0
23 May 2016
Deep Learning without Poor Local Minima
Deep Learning without Poor Local Minima
Kenji Kawaguchi
ODL
149
922
0
23 May 2016
Bounded Optimal Exploration in MDP
Bounded Optimal Exploration in MDP
Kenji Kawaguchi
33
15
0
05 Apr 2016
Bayesian Optimization with Exponential Convergence
Bayesian Optimization with Exponential Convergence
Kenji Kawaguchi
L. Kaelbling
Tomás Lozano-Pérez
49
106
0
05 Apr 2016
Identity Mappings in Deep Residual Networks
Identity Mappings in Deep Residual Networks
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
281
10,149
0
16 Mar 2016
Benefits of depth in neural networks
Benefits of depth in neural networks
Matus Telgarsky
290
605
0
14 Feb 2016
On the Generalization Error Bounds of Neural Networks under
  Diversity-Inducing Mutual Angular Regularization
On the Generalization Error Bounds of Neural Networks under Diversity-Inducing Mutual Angular Regularization
P. Xie
Yuntian Deng
Eric Xing
83
28
0
23 Nov 2015
Train faster, generalize better: Stability of stochastic gradient
  descent
Train faster, generalize better: Stability of stochastic gradient descent
Moritz Hardt
Benjamin Recht
Y. Singer
91
1,234
0
03 Sep 2015
Path-SGD: Path-Normalized Optimization in Deep Neural Networks
Path-SGD: Path-Normalized Optimization in Deep Neural Networks
Behnam Neyshabur
Ruslan Salakhutdinov
Nathan Srebro
ODL
52
305
0
08 Jun 2015
APAC: Augmented PAttern Classification with Neural Networks
APAC: Augmented PAttern Classification with Neural Networks
Ikuro Sato
Hiroki Nishimura
Kensuke Yokoi
36
137
0
13 May 2015
Norm-Based Capacity Control in Neural Networks
Norm-Based Capacity Control in Neural Networks
Behnam Neyshabur
Ryota Tomioka
Nathan Srebro
220
583
0
27 Feb 2015
An Introduction to Matrix Concentration Inequalities
An Introduction to Matrix Concentration Inequalities
J. Tropp
70
1,139
0
07 Jan 2015
Fractional Max-Pooling
Fractional Max-Pooling
Benjamin Graham
TPM
65
517
0
18 Dec 2014
The Loss Surfaces of Multilayer Networks
The Loss Surfaces of Multilayer Networks
A. Choromańska
Mikael Henaff
Michaël Mathieu
Gerard Ben Arous
Yann LeCun
ODL
225
1,191
0
30 Nov 2014
On the Computational Efficiency of Training Neural Networks
On the Computational Efficiency of Training Neural Networks
Roi Livni
Shai Shalev-Shwartz
Ohad Shamir
68
479
0
05 Oct 2014
On the Number of Linear Regions of Deep Neural Networks
On the Number of Linear Regions of Deep Neural Networks
Guido Montúfar
Razvan Pascanu
Kyunghyun Cho
Yoshua Bengio
72
1,249
0
08 Feb 2014
On the number of response regions of deep feed forward networks with
  piece-wise linear activations
On the number of response regions of deep feed forward networks with piece-wise linear activations
Razvan Pascanu
Guido Montúfar
Yoshua Bengio
FAtt
94
257
0
20 Dec 2013
Robustness and Generalization
Robustness and Generalization
Huan Xu
Shie Mannor
OOD
158
459
0
13 May 2010
1