Papers citing 'Implicit Regularization in Deep Learning'

Title
Stabilizing Policy Gradient Methods via Reward Profiling Shihab Ahmed El Houcine Bergou A. Dutta Yue Wang 164 0 0 20 Nov 2025
Deep Learning Inductive Biases for fMRI Time Series Classification during Resting-state and Movie-watching Behdad Khodabandehloo Reza Rajimehr 62 0 0 21 Sep 2025
Reason to Rote: Rethinking Memorization in Reasoning Yupei Du Philipp Mondorf Silvia Casola Yuekun Yao Robert Litschko Barbara Plank 148 0 0 07 Jul 2025
Variational Adaptive Noise and Dropout towards Stable Recurrent Neural Networks Taisuke Kobayashi Shingo Murata 145 0 0 02 Jun 2025
Identifying Key Challenges of Hardness-Based Resampling Pawel Pukowski Venet Osmani 237 0 0 09 Apr 2025
High-entropy Advantage in Neural Networks' Generalizability Entao Yang Wei Wei Yue Shang Ge Zhang AI4CE 321 2 0 17 Mar 2025
Changing Base Without Losing Pace: A GPU-Efficient Alternative to MatMul in DNNs Nir Ailon Akhiad Bercovich Yahel Uffenheimer Omri Weinstein 377 3 0 15 Mar 2025
Implicit Bias in Matrix Factorization and its Explicit Realization in a New Architecture Yikun Hou Suvrit Sra A. Yurtsever 311 0 0 27 Jan 2025
ExpTest: Automating Learning Rate Searching and Tuning with Insights from Linearized Neural Networks Zan Chaudhry Naoko Mizuno 263 0 0 25 Nov 2024
LDAdam: Adaptive Optimization from Low-Dimensional Gradient StatisticsInternational Conference on Learning Representations (ICLR), 2024 Thomas Robert M. Safaryan Ionut-Vlad Modoranu Dan Alistarh ODL 366 19 0 21 Oct 2024
A Theoretical Survey on Foundation Models Shi Fu Yuzhu Chen Yingjie Wang Dacheng Tao 247 1 0 15 Oct 2024
Low-Dimension-to-High-Dimension Generalization And Its Implications for Length Generalization Yang Chen Long Yang Yitao Liang Zhouchen Lin 298 2 0 11 Oct 2024
Input Space Mode Connectivity in Deep Neural NetworksInternational Conference on Learning Representations (ICLR), 2024 Jakub Vrabel Ori Shem-Ur Yaron Oz David Krueger 302 1 0 09 Sep 2024
DNA-SE: Towards Deep Neural-Nets Assisted Semiparametric EstimationInternational Conference on Machine Learning (ICML), 2024 Qinshuo Liu Zixin Wang Xi-An Li Xinyao Ji Lei Zhang Lin Liu Zhonghua Liu 260 0 0 04 Aug 2024
A Margin-based Multiclass Generalization Bound via Geometric Complexity Michael Munn Benoit Dherin Javier Gonzalvo UQCV 206 2 0 28 May 2024
The Impact of Geometric Complexity on Neural Collapse in Transfer Learning Michael Munn Benoit Dherin Javier Gonzalvo AAML 239 5 0 24 May 2024
Improving Generalization of Deep Neural Networks by Optimum ShiftingAAAI Conference on Artificial Intelligence (AAAI), 2024 Yuyan Zhou Ye Li Lei Feng Sheng-Jun Huang OOD ODL 134 0 0 23 May 2024
A General Theory for Compositional Generalization Jingwen Fu Zhizheng Zhang Yan Lu Nanning Zheng AI4CE CoGe 190 2 0 20 May 2024
Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A SurveyEngineering applications of artificial intelligence (EAAI), 2024 Guoping Xu Xiaxia Wang Xinglong Wu Xuesong Leng Yongchao Xu 3DPC 195 32 0 02 May 2024
A Gauss-Newton Approach for Min-Max Optimization in Generative Adversarial NetworksIEEE International Joint Conference on Neural Network (IJCNN), 2024 Neel Mishra Bamdev Mishra Pratik Jawanpuria Pawan Kumar GAN 177 1 0 10 Apr 2024
No Free Prune: Information-Theoretic Barriers to Pruning at Initialization Tanishq Kumar Kevin Luo Mark Sellke 212 8 0 02 Feb 2024
A Survey on Statistical Theory of Deep Learning: Approximation, Training Dynamics, and Generative ModelsAnnual Review of Statistics and Its Application (ARSIA), 2024 Namjoon Suh Guang Cheng MedIm 281 17 0 14 Jan 2024
Interpretability Illusions in the Generalization of Simplified Models Dan Friedman Andrew Kyle Lampinen Lucas Dixon Danqi Chen Asma Ghandeharioun 289 19 0 06 Dec 2023
Critical Influence of Overparameterization on Sharpness-aware MinimizationConference on Uncertainty in Artificial Intelligence (UAI), 2023 Sungbin Shin Dongyeop Lee Maksym Andriushchenko Namhoon Lee AAML 724 2 0 29 Nov 2023
A PAC-Bayesian Perspective on the Interpolating Information Criterion Liam Hodgkinson Christopher van der Heide Roberto Salomone Fred Roosta Michael W. Mahoney 251 2 0 13 Nov 2023
Efficient Compression of Overparameterized Deep Models through Low-Dimensional Learning Dynamics Soo Min Kwon Zekai Zhang Dogyoon Song Laura Balzano Qing Qu 245 4 0 08 Nov 2023
PRIOR: Personalized Prior for Reactivating the Information Overlooked in Federated Learning Mingjia Shi Yuhao Zhou Xiaojiang Peng Huaizheng Zhang Shudong Huang Qing Ye Jiangcheng Lv 228 15 0 13 Oct 2023
A path-norm toolkit for modern networks: consequences, promises and challengesInternational Conference on Learning Representations (ICLR), 2023 Antoine Gonon Nicolas Brisebarre E. Riccietti Rémi Gribonval 408 10 0 02 Oct 2023
Asynchronous Graph GeneratorSignal Processing (Signal Process.), 2023 Christopher P. Ley Felipe Tobar AI4TS 327 0 0 29 Sep 2023
Unveiling Invariances via Neural Network Pruning Derek Xu Luke Huan Wei Wang 192 0 0 15 Sep 2023
The Interpolating Information Criterion for Overparameterized Models Liam Hodgkinson Christopher van der Heide Roberto Salomone Fred Roosta Michael W. Mahoney 170 10 0 15 Jul 2023
Abide by the Law and Follow the Flow: Conservation Laws for Gradient FlowsNeural Information Processing Systems (NeurIPS), 2023 Sibylle Marcotte Rémi Gribonval Gabriel Peyré 301 27 0 30 Jun 2023
Catching Image Retrieval Generalization Maksim Zhdanov I. Karpukhin VLM 156 0 0 23 Jun 2023
Understanding and Mitigating Extrapolation Failures in Physics-Informed Neural Networks Lukas Fesser Luca DÁmico-Wong Richard Qiu 256 7 0 15 Jun 2023
The Law of Parsimony in Gradient Descent for Learning Deep Linear Networks Can Yaras Peng Wang Wei Hu Zhihui Zhu Laura Balzano Qing Qu 257 19 0 01 Jun 2023
(Almost) Provable Error Bounds Under Distribution Shift via Disagreement DiscrepancyNeural Information Processing Systems (NeurIPS), 2023 Elan Rosenfeld Saurabh Garg UQCV 150 12 0 01 Jun 2023
When Does Optimizing a Proper Loss Yield Calibration?Neural Information Processing Systems (NeurIPS), 2023 Jarosław Błasiok Parikshit Gopalan Lunjia Hu Preetum Nakkiran 209 37 0 30 May 2023
Consistent Optimal Transport with Empirical Conditional MeasuresInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023 Piyushi Manupriya Rachit Keerti Das Sayantan Biswas S. Jagarlapudi OT 412 6 0 25 May 2023
Exploring the Complexity of Deep Neural Networks through Functional EquivalenceInternational Conference on Machine Learning (ICML), 2023 Guohao Shen 302 6 0 19 May 2023
Adaptive Consensus Optimization Method for GANsIEEE International Joint Conference on Neural Network (IJCNN), 2023 Sachin Kumar Danisetty Santhosh Reddy Mylaram Pawan Kumar ODL 130 3 0 20 Apr 2023
Saddle-to-Saddle Dynamics in Diagonal Linear NetworksNeural Information Processing Systems (NeurIPS), 2023 Scott Pesme Nicolas Flammarion 376 45 0 02 Apr 2023
Implicit regularization in Heavy-ball momentum accelerated stochastic gradient descentInternational Conference on Learning Representations (ICLR), 2023 Avrajit Ghosh He Lyu Xitong Zhang Rongrong Wang 186 27 0 02 Feb 2023
Why Deep Learning Generalizes Benjamin L. Badger TDI AI4CE 127 4 0 17 Nov 2022
C-Mixup: Improving Generalization in RegressionNeural Information Processing Systems (NeurIPS), 2022 Huaxiu Yao Yiping Wang Linjun Zhang James Zou Chelsea Finn UQCV OOD 186 80 0 11 Oct 2022
DeepMed: Semiparametric Causal Mediation Analysis with Debiased Deep LearningNeural Information Processing Systems (NeurIPS), 2022 Siqi Xu Lin Liu Zhong Liu CML MedIm 179 12 0 10 Oct 2022
Learning Temporal Resolution in Spectrogram for Audio ClassificationAAAI Conference on Artificial Intelligence (AAAI), 2022 Haohe Liu Xubo Liu Qiuqiang Kong Wenwu Wang Mark D. Plumbley 239 13 0 04 Oct 2022
The Dynamic of Consensus in Deep Networks and the Identification of Noisy Labels Daniel Shwartz Uri Stern D. Weinshall NoLa 199 2 0 02 Oct 2022
Why neural networks find simple solutions: the many regularizers of geometric complexityNeural Information Processing Systems (NeurIPS), 2022 Benoit Dherin Michael Munn M. Rosca David Barrett 280 42 0 27 Sep 2022
Robust Constrained Reinforcement Learning Yue Wang Fei Miao Shaofeng Zou 152 20 0 14 Sep 2022
The BUTTER Zone: An Empirical Study of Training Dynamics in Fully Connected Neural Networks Charles Edison Tripp J. Perr-Sauer L. Hayne M. Lunacek Jamil Gafur AI4CE 256 1 0 25 Jul 2022