Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1709.01953
Cited By
v1
v2 (latest)
Implicit Regularization in Deep Learning
6 September 2017
Behnam Neyshabur
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Implicit Regularization in Deep Learning"
50 / 108 papers shown
Title
Stabilizing Policy Gradient Methods via Reward Profiling
Shihab Ahmed
El Houcine Bergou
A. Dutta
Yue Wang
164
0
0
20 Nov 2025
Deep Learning Inductive Biases for fMRI Time Series Classification during Resting-state and Movie-watching
Behdad Khodabandehloo
Reza Rajimehr
62
0
0
21 Sep 2025
Reason to Rote: Rethinking Memorization in Reasoning
Yupei Du
Philipp Mondorf
Silvia Casola
Yuekun Yao
Robert Litschko
Barbara Plank
148
0
0
07 Jul 2025
Variational Adaptive Noise and Dropout towards Stable Recurrent Neural Networks
Taisuke Kobayashi
Shingo Murata
145
0
0
02 Jun 2025
Identifying Key Challenges of Hardness-Based Resampling
Pawel Pukowski
Venet Osmani
237
0
0
09 Apr 2025
High-entropy Advantage in Neural Networks' Generalizability
Entao Yang
Wei Wei
Yue Shang
Ge Zhang
AI4CE
321
2
0
17 Mar 2025
Changing Base Without Losing Pace: A GPU-Efficient Alternative to MatMul in DNNs
Nir Ailon
Akhiad Bercovich
Yahel Uffenheimer
Omri Weinstein
377
3
0
15 Mar 2025
Implicit Bias in Matrix Factorization and its Explicit Realization in a New Architecture
Yikun Hou
Suvrit Sra
A. Yurtsever
311
0
0
27 Jan 2025
ExpTest: Automating Learning Rate Searching and Tuning with Insights from Linearized Neural Networks
Zan Chaudhry
Naoko Mizuno
263
0
0
25 Nov 2024
LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics
International Conference on Learning Representations (ICLR), 2024
Thomas Robert
M. Safaryan
Ionut-Vlad Modoranu
Dan Alistarh
ODL
366
19
0
21 Oct 2024
A Theoretical Survey on Foundation Models
Shi Fu
Yuzhu Chen
Yingjie Wang
Dacheng Tao
247
1
0
15 Oct 2024
Low-Dimension-to-High-Dimension Generalization And Its Implications for Length Generalization
Yang Chen
Long Yang
Yitao Liang
Zhouchen Lin
298
2
0
11 Oct 2024
Input Space Mode Connectivity in Deep Neural Networks
International Conference on Learning Representations (ICLR), 2024
Jakub Vrabel
Ori Shem-Ur
Yaron Oz
David Krueger
302
1
0
09 Sep 2024
DNA-SE: Towards Deep Neural-Nets Assisted Semiparametric Estimation
International Conference on Machine Learning (ICML), 2024
Qinshuo Liu
Zixin Wang
Xi-An Li
Xinyao Ji
Lei Zhang
Lin Liu
Zhonghua Liu
260
0
0
04 Aug 2024
A Margin-based Multiclass Generalization Bound via Geometric Complexity
Michael Munn
Benoit Dherin
Javier Gonzalvo
UQCV
206
2
0
28 May 2024
The Impact of Geometric Complexity on Neural Collapse in Transfer Learning
Michael Munn
Benoit Dherin
Javier Gonzalvo
AAML
239
5
0
24 May 2024
Improving Generalization of Deep Neural Networks by Optimum Shifting
AAAI Conference on Artificial Intelligence (AAAI), 2024
Yuyan Zhou
Ye Li
Lei Feng
Sheng-Jun Huang
OOD
ODL
134
0
0
23 May 2024
A General Theory for Compositional Generalization
Jingwen Fu
Zhizheng Zhang
Yan Lu
Nanning Zheng
AI4CE
CoGe
190
2
0
20 May 2024
Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey
Engineering applications of artificial intelligence (EAAI), 2024
Guoping Xu
Xiaxia Wang
Xinglong Wu
Xuesong Leng
Yongchao Xu
3DPC
195
32
0
02 May 2024
A Gauss-Newton Approach for Min-Max Optimization in Generative Adversarial Networks
IEEE International Joint Conference on Neural Network (IJCNN), 2024
Neel Mishra
Bamdev Mishra
Pratik Jawanpuria
Pawan Kumar
GAN
177
1
0
10 Apr 2024
No Free Prune: Information-Theoretic Barriers to Pruning at Initialization
Tanishq Kumar
Kevin Luo
Mark Sellke
212
8
0
02 Feb 2024
A Survey on Statistical Theory of Deep Learning: Approximation, Training Dynamics, and Generative Models
Annual Review of Statistics and Its Application (ARSIA), 2024
Namjoon Suh
Guang Cheng
MedIm
281
17
0
14 Jan 2024
Interpretability Illusions in the Generalization of Simplified Models
Dan Friedman
Andrew Kyle Lampinen
Lucas Dixon
Danqi Chen
Asma Ghandeharioun
289
19
0
06 Dec 2023
Critical Influence of Overparameterization on Sharpness-aware Minimization
Conference on Uncertainty in Artificial Intelligence (UAI), 2023
Sungbin Shin
Dongyeop Lee
Maksym Andriushchenko
Namhoon Lee
AAML
724
2
0
29 Nov 2023
A PAC-Bayesian Perspective on the Interpolating Information Criterion
Liam Hodgkinson
Christopher van der Heide
Roberto Salomone
Fred Roosta
Michael W. Mahoney
251
2
0
13 Nov 2023
Efficient Compression of Overparameterized Deep Models through Low-Dimensional Learning Dynamics
Soo Min Kwon
Zekai Zhang
Dogyoon Song
Laura Balzano
Qing Qu
245
4
0
08 Nov 2023
PRIOR: Personalized Prior for Reactivating the Information Overlooked in Federated Learning
Mingjia Shi
Yuhao Zhou
Xiaojiang Peng
Huaizheng Zhang
Shudong Huang
Qing Ye
Jiangcheng Lv
228
15
0
13 Oct 2023
A path-norm toolkit for modern networks: consequences, promises and challenges
International Conference on Learning Representations (ICLR), 2023
Antoine Gonon
Nicolas Brisebarre
E. Riccietti
Rémi Gribonval
408
10
0
02 Oct 2023
Asynchronous Graph Generator
Signal Processing (Signal Process.), 2023
Christopher P. Ley
Felipe Tobar
AI4TS
327
0
0
29 Sep 2023
Unveiling Invariances via Neural Network Pruning
Derek Xu
Luke Huan
Wei Wang
192
0
0
15 Sep 2023
The Interpolating Information Criterion for Overparameterized Models
Liam Hodgkinson
Christopher van der Heide
Roberto Salomone
Fred Roosta
Michael W. Mahoney
170
10
0
15 Jul 2023
Abide by the Law and Follow the Flow: Conservation Laws for Gradient Flows
Neural Information Processing Systems (NeurIPS), 2023
Sibylle Marcotte
Rémi Gribonval
Gabriel Peyré
301
27
0
30 Jun 2023
Catching Image Retrieval Generalization
Maksim Zhdanov
I. Karpukhin
VLM
156
0
0
23 Jun 2023
Understanding and Mitigating Extrapolation Failures in Physics-Informed Neural Networks
Lukas Fesser
Luca DÁmico-Wong
Richard Qiu
256
7
0
15 Jun 2023
The Law of Parsimony in Gradient Descent for Learning Deep Linear Networks
Can Yaras
Peng Wang
Wei Hu
Zhihui Zhu
Laura Balzano
Qing Qu
257
19
0
01 Jun 2023
(Almost) Provable Error Bounds Under Distribution Shift via Disagreement Discrepancy
Neural Information Processing Systems (NeurIPS), 2023
Elan Rosenfeld
Saurabh Garg
UQCV
150
12
0
01 Jun 2023
When Does Optimizing a Proper Loss Yield Calibration?
Neural Information Processing Systems (NeurIPS), 2023
Jarosław Błasiok
Parikshit Gopalan
Lunjia Hu
Preetum Nakkiran
209
37
0
30 May 2023
Consistent Optimal Transport with Empirical Conditional Measures
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Piyushi Manupriya
Rachit Keerti Das
Sayantan Biswas
S. Jagarlapudi
OT
412
6
0
25 May 2023
Exploring the Complexity of Deep Neural Networks through Functional Equivalence
International Conference on Machine Learning (ICML), 2023
Guohao Shen
302
6
0
19 May 2023
Adaptive Consensus Optimization Method for GANs
IEEE International Joint Conference on Neural Network (IJCNN), 2023
Sachin Kumar Danisetty
Santhosh Reddy Mylaram
Pawan Kumar
ODL
130
3
0
20 Apr 2023
Saddle-to-Saddle Dynamics in Diagonal Linear Networks
Neural Information Processing Systems (NeurIPS), 2023
Scott Pesme
Nicolas Flammarion
376
45
0
02 Apr 2023
Implicit regularization in Heavy-ball momentum accelerated stochastic gradient descent
International Conference on Learning Representations (ICLR), 2023
Avrajit Ghosh
He Lyu
Xitong Zhang
Rongrong Wang
186
27
0
02 Feb 2023
Why Deep Learning Generalizes
Benjamin L. Badger
TDI
AI4CE
127
4
0
17 Nov 2022
C-Mixup: Improving Generalization in Regression
Neural Information Processing Systems (NeurIPS), 2022
Huaxiu Yao
Yiping Wang
Linjun Zhang
James Zou
Chelsea Finn
UQCV
OOD
186
80
0
11 Oct 2022
DeepMed: Semiparametric Causal Mediation Analysis with Debiased Deep Learning
Neural Information Processing Systems (NeurIPS), 2022
Siqi Xu
Lin Liu
Zhong Liu
CML
MedIm
179
12
0
10 Oct 2022
Learning Temporal Resolution in Spectrogram for Audio Classification
AAAI Conference on Artificial Intelligence (AAAI), 2022
Haohe Liu
Xubo Liu
Qiuqiang Kong
Wenwu Wang
Mark D. Plumbley
239
13
0
04 Oct 2022
The Dynamic of Consensus in Deep Networks and the Identification of Noisy Labels
Daniel Shwartz
Uri Stern
D. Weinshall
NoLa
199
2
0
02 Oct 2022
Why neural networks find simple solutions: the many regularizers of geometric complexity
Neural Information Processing Systems (NeurIPS), 2022
Benoit Dherin
Michael Munn
M. Rosca
David Barrett
280
42
0
27 Sep 2022
Robust Constrained Reinforcement Learning
Yue Wang
Fei Miao
Shaofeng Zou
152
20
0
14 Sep 2022
The BUTTER Zone: An Empirical Study of Training Dynamics in Fully Connected Neural Networks
Charles Edison Tripp
J. Perr-Sauer
L. Hayne
M. Lunacek
Jamil Gafur
AI4CE
256
1
0
25 Jul 2022
1
2
3
Next