v1v2v3v4 (latest)

Neural Tangent Kernel: Convergence and Generalization in Neural Networks

20 June 2018

Papers citing "Neural Tangent Kernel: Convergence and Generalization in Neural Networks"

50 / 2,409 papers shown

Title
Random at First, Fast at Last: NTK-Guided Fourier Pre-Processing for Tabular DL Renat Sergazinov Jing Wu Shao-An Yin AAML LMTD 198 0 0 03 Jun 2025
Neural Variance-aware Dueling Bandits with Deep Representation and Shallow Exploration Youngmin Oh J. Park Taejin Paik Jaemin Park 191 0 0 02 Jun 2025
DRAUN: An Algorithm-Agnostic Data Reconstruction Attack on Federated Unlearning Systems Hithem Lamri Manaar Alam Haiyan Jiang Michail Maniatakos MU 147 0 0 02 Jun 2025
Model Reprogramming Demystified: A Neural Tangent Kernel Perspective Ming-Yu Chung Jiashuo Fan Hancheng Ye Qinsi Wang Wei-Chen Shen Chia-Mu Yu Pin-Yu Chen Sy-Yen Kuo 163 1 0 31 May 2025
Spectral Insights into Data-Oblivious Critical Layers in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 Xuyuan Liu Lei Hsiung Yaoqing Yang Yujun Yan AAML 231 1 0 31 May 2025
Taming Hyperparameter Sensitivity in Data Attribution: Practical Selection Without Costly Retraining Weiyi Wang Junwei Deng Yuzheng Hu Shiyuan Zhang Xirui Jiang Runting Zhang Han Zhao Jiaqi W. Ma TDI 218 1 0 30 May 2025
Decom-Renorm-Merge: Model Merging on the Right Space Improves Multitasking Yuatyong Chaichana Thanapat Trachu Peerat Limkonchotiwat Konpat Preechakul Tirasan Khandhawit Ekapol Chuangsuwanich MoMe 529 0 0 29 May 2025
Scalable Complexity Control Facilitates Reasoning Ability of LLMs Liangkai Hang Junjie Yao Zhiwei Bai Tianyi Chen Yang Chen ... Feiyu Xiong Y. Zhang Weinan E Hongkang Yang Zhi-hai Xu LRM 193 2 0 29 May 2025
Characterising the Inductive Biases of Neural Networks on Boolean Data Chris Mingard Lukas Seier Niclas Goring Andrei-Vlad Badelita Charles London Ard A. Louis AI4CE 232 1 0 29 May 2025
Benignity of loss landscape with weight decay requires both large overparametrization and initialization Etienne Boursier Matthew Bowditch Matthias Englert R. Lazic 148 0 0 28 May 2025
Learning Curves of Stochastic Gradient Descent in Kernel Regression Haihan Zhang Weicheng Lin Yuanshi Liu Cong Fang 161 0 0 28 May 2025
Assessing Quantum Advantage for Gaussian Process Regression Dominic Lowe M.S. Kim Roberto Bondesan 170 2 0 28 May 2025
The informativeness of the gradient revisitedNeural Networks (NN), 2025 Rustem Takhanov AAML 125 0 0 28 May 2025
Saddle-To-Saddle Dynamics in Deep ReLU Networks: Low-Rank Bias in the First Saddle Escape Ioannis Bantzis James B. Simon Arthur Jacot ODL 300 2 0 27 May 2025
Leaner Transformers: More Heads, Less Depth Hemanth Saratchandran Damien Teney Simon Lucey 147 3 0 27 May 2025
Universal Value-Function Uncertainties Moritz A. Zanger Max Weltevrede Yaniv Oren Pascal R. van der Vaart Caroline Horsch Wendelin Bohmer M. Spaan OffRL 270 0 0 27 May 2025
A ZeNN architecture to avoid the Gaussian trap Luís Carvalho Joao L. Costa José Mourao Gonçalo Oliveira 222 0 0 26 May 2025
A Theoretical Framework for Grokking: Interpolation followed by Riemannian Norm Minimisation Etienne Boursier Scott Pesme Radu-Alexandru Dragomir 249 1 0 26 May 2025
Variational Deep Learning via Implicit Regularization Jonathan Wenger Beau Coker Juraj Marusic John P. Cunningham OOD UQCV BDL 276 1 0 26 May 2025
ExPLAIND: Unifying Model, Data, and Training Attribution to Study Model Behavior Florian Eichin Yupei Du Philipp Mondorf Maria Matveev Barbara Plank Michael A. Hedderich FAtt 406 0 0 26 May 2025
On the Role of Label Noise in the Feature Learning Process Andi Han Wei Huang Zhanpeng Zhou Gang Niu Wuyang Chen Junchi Yan Akiko Takeda Taiji Suzuki NoLa MLT 383 2 0 25 May 2025
Temperature is All You Need for Generalization in Langevin Dynamics and other Markov Processes I. Harel Yonathan Wolanowsky Gal Vardi Nathan Srebro Daniel Soudry AI4CE 305 1 0 25 May 2025
Querying Kernel Methods Suffices for Reconstructing their Training Data Daniel Barzilai Yuval Margalit Eitan Gronich Gilad Yehudai Meirav Galun Ronen Basri 189 0 0 25 May 2025
Function Forms of Simple ReLU Networks with Random Hidden Weights Ka Long Keith Ho Yoshinari Takeishi Junichi Takeuchi 98 1 0 23 May 2025
Joker: Joint Optimization Framework for Lightweight Kernel Machines Junhong Zhang Zhihui Lai 167 0 0 23 May 2025
Towards Revealing the Effectiveness of Small-Scale Fine-tuning in R1-style Reinforcement Learning Yutong Chen Jiandong Gao Ji Wu ALM 412 1 0 23 May 2025
Training-Free Reasoning and Reflection in MLLMs Hongchen Wei Zhenzhong Chen OffRL VLM LRM 218 1 0 22 May 2025
How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning Max Weltevrede Moritz A. Zanger M. Spaan Wendelin Bohmer OffRL FedML 323 0 0 22 May 2025
Bigger Isn't Always Memorizing: Early Stopping Overparameterized Diffusion Models Alessandro Favero Antonio Sclocchi Matthieu Wyart DiffM 281 9 0 22 May 2025
Small-to-Large Generalization: Data Influences Models Consistently Across Scale Alaa Khaddaj Logan Engstrom Aleksander Madry TDI AI4CE 253 0 0 22 May 2025
Risk-Averse Reinforcement Learning with Itakura-Saito Loss Igor Udovichenko Olivier Croissant Anita Toleutaeva Evgeny Burnaev Alexander Korotin 164 0 0 22 May 2025
Directional Convergence, Benign Overfitting of Gradient Descent in leaky ReLU two-layer Neural Networks Ichiro Hashimoto MLT 264 0 0 22 May 2025
TULiP: Test-time Uncertainty Estimation via Linearization and Weight Perturbation Yuhui Zhang Dongshen Wu Yuichiro Wada Takafumi Kanamori OODD 485 1 0 22 May 2025
Convergence of Adam in Deep ReLU Networks via Directional Complexity and Kakeya Bounds Anupama Sridhar Alexander Johansen 242 0 0 21 May 2025
Certified Neural Approximations of Nonlinear Dynamics Frederik Baymler Mathiesen Nikolaus Vertovec Francesco Fabiano Luca Laurenti Alessandro Abate 225 1 0 21 May 2025
Hybrid Adaptive Modeling in Process Monitoring: Leveraging Sequence Encoders and Physics-Informed Neural Networks Mouad Elaarabi Domenico Borzacchiello Philippe Le Bot Nathan Lauzeral Sebastien Comas-Cardona PINN AI4CE 243 0 0 20 May 2025
Rethink the Role of Deep Learning towards Large-scale Quantum Systems Yusheng Zhao Chi Zhang Yuxuan Du AI4CE 139 3 0 20 May 2025
New Evidence of the Two-Phase Learning Dynamics of Neural Networks Zhanpeng Zhou Yongyi Yang Mahito Sugiyama Junchi Yan 183 2 0 20 May 2025
Just One Layer Norm Guarantees Stable Extrapolation Juliusz Ziomek George Whittle Michael A. Osborne 294 3 0 20 May 2025
Nonparametric Teaching for Graph Property Learners Chen Zhang Weixin Bu Zhaochun Ren Ziyue Liu Yik-Chung Wu Ngai Wong 327 2 0 20 May 2025
A Physics-Inspired Optimizer: Velocity Regularized Adam Pranav Vaidhyanathan Lucas Schorling Natalia Ares Michael A. Osborne ODL 398 0 0 19 May 2025
Enhancing Transformers Through Conditioned Embedded Tokens Hemanth Saratchandran Simon Lucey 288 2 0 19 May 2025
A Local Polyak-Lojasiewicz and Descent Lemma of Gradient Descent For Overparametrized Linear Models Ziqing Xu Hancheng Min Salma Tarmoun Enrique Mallada Rene Vidal 233 2 0 16 May 2025
Is Supervised Learning Really That Different from Unsupervised? Oskar Allerbo Thomas B. Schön OOD SSL 470 0 0 16 May 2025
The Power of Random Features and the Limits of Distribution-Free Gradient Descent Ari Karchmer Eran Malach 235 0 0 15 May 2025
Towards scalable surrogate models based on Neural Fields for large scale aerodynamic simulations Giovanni Catalani Jean Fesquet Xavier Bertrand Frédéric Tost Michaël Bauerheim Joseph Morlier AI4CE 282 1 0 14 May 2025
Neural Multivariate Regression: Qualitative Insights from the Unconstrained Feature Model George Andriopoulos Soyuj Jung Basnet Juan Guevara Li Guo George Andriopoulos 240 0 0 14 May 2025
SAD Neural Networks: Divergent Gradient Flows and Asymptotic Optimality via o-minimal Structures Julian Kranz Davide Gallon Steffen Dereich Arnulf Jentzen 184 3 0 14 May 2025
Slow Transition to Low-Dimensional Chaos in Heavy-Tailed Recurrent Neural NetworksbioRxiv (bioRxiv), 2025 Yi Xie Stefan Mihalas Łukasz Kuśmierz 209 0 0 14 May 2025
Block-Biased Mamba for Long-Range Sequence Processing Annan Yu N. Benjamin Erichson Mamba 291 2 0 13 May 2025