ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1806.07572
  4. Cited By
Neural Tangent Kernel: Convergence and Generalization in Neural Networks
v1v2v3v4 (latest)

Neural Tangent Kernel: Convergence and Generalization in Neural Networks

20 June 2018
Arthur Jacot
Franck Gabriel
Clément Hongler
ArXiv (abs)PDFHTML

Papers citing "Neural Tangent Kernel: Convergence and Generalization in Neural Networks"

50 / 2,409 papers shown
Title
Random at First, Fast at Last: NTK-Guided Fourier Pre-Processing for Tabular DL
Random at First, Fast at Last: NTK-Guided Fourier Pre-Processing for Tabular DL
Renat Sergazinov
Jing Wu
Shao-An Yin
AAMLLMTD
198
0
0
03 Jun 2025
Neural Variance-aware Dueling Bandits with Deep Representation and Shallow Exploration
Neural Variance-aware Dueling Bandits with Deep Representation and Shallow Exploration
Youngmin Oh
J. Park
Taejin Paik
Jaemin Park
191
0
0
02 Jun 2025
DRAUN: An Algorithm-Agnostic Data Reconstruction Attack on Federated Unlearning Systems
DRAUN: An Algorithm-Agnostic Data Reconstruction Attack on Federated Unlearning Systems
Hithem Lamri
Manaar Alam
Haiyan Jiang
Michail Maniatakos
MU
147
0
0
02 Jun 2025
Model Reprogramming Demystified: A Neural Tangent Kernel Perspective
Model Reprogramming Demystified: A Neural Tangent Kernel Perspective
Ming-Yu Chung
Jiashuo Fan
Hancheng Ye
Qinsi Wang
Wei-Chen Shen
Chia-Mu Yu
Pin-Yu Chen
Sy-Yen Kuo
163
1
0
31 May 2025
Spectral Insights into Data-Oblivious Critical Layers in Large Language Models
Spectral Insights into Data-Oblivious Critical Layers in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Xuyuan Liu
Lei Hsiung
Yaoqing Yang
Yujun Yan
AAML
231
1
0
31 May 2025
Taming Hyperparameter Sensitivity in Data Attribution: Practical Selection Without Costly Retraining
Taming Hyperparameter Sensitivity in Data Attribution: Practical Selection Without Costly Retraining
Weiyi Wang
Junwei Deng
Yuzheng Hu
Shiyuan Zhang
Xirui Jiang
Runting Zhang
Han Zhao
Jiaqi W. Ma
TDI
218
1
0
30 May 2025
Decom-Renorm-Merge: Model Merging on the Right Space Improves Multitasking
Decom-Renorm-Merge: Model Merging on the Right Space Improves Multitasking
Yuatyong Chaichana
Thanapat Trachu
Peerat Limkonchotiwat
Konpat Preechakul
Tirasan Khandhawit
Ekapol Chuangsuwanich
MoMe
529
0
0
29 May 2025
Scalable Complexity Control Facilitates Reasoning Ability of LLMs
Scalable Complexity Control Facilitates Reasoning Ability of LLMs
Liangkai Hang
Junjie Yao
Zhiwei Bai
Tianyi Chen
Yang Chen
...
Feiyu Xiong
Y. Zhang
Weinan E
Hongkang Yang
Zhi-hai Xu
LRM
193
2
0
29 May 2025
Characterising the Inductive Biases of Neural Networks on Boolean Data
Characterising the Inductive Biases of Neural Networks on Boolean Data
Chris Mingard
Lukas Seier
Niclas Goring
Andrei-Vlad Badelita
Charles London
Ard A. Louis
AI4CE
232
1
0
29 May 2025
Benignity of loss landscape with weight decay requires both large overparametrization and initialization
Benignity of loss landscape with weight decay requires both large overparametrization and initialization
Etienne Boursier
Matthew Bowditch
Matthias Englert
R. Lazic
148
0
0
28 May 2025
Learning Curves of Stochastic Gradient Descent in Kernel Regression
Learning Curves of Stochastic Gradient Descent in Kernel Regression
Haihan Zhang
Weicheng Lin
Yuanshi Liu
Cong Fang
161
0
0
28 May 2025
Assessing Quantum Advantage for Gaussian Process Regression
Assessing Quantum Advantage for Gaussian Process Regression
Dominic Lowe
M.S. Kim
Roberto Bondesan
170
2
0
28 May 2025
The informativeness of the gradient revisited
The informativeness of the gradient revisitedNeural Networks (NN), 2025
Rustem Takhanov
AAML
125
0
0
28 May 2025
Saddle-To-Saddle Dynamics in Deep ReLU Networks: Low-Rank Bias in the First Saddle Escape
Saddle-To-Saddle Dynamics in Deep ReLU Networks: Low-Rank Bias in the First Saddle Escape
Ioannis Bantzis
James B. Simon
Arthur Jacot
ODL
300
2
0
27 May 2025
Leaner Transformers: More Heads, Less Depth
Leaner Transformers: More Heads, Less Depth
Hemanth Saratchandran
Damien Teney
Simon Lucey
147
3
0
27 May 2025
Universal Value-Function Uncertainties
Universal Value-Function Uncertainties
Moritz A. Zanger
Max Weltevrede
Yaniv Oren
Pascal R. van der Vaart
Caroline Horsch
Wendelin Bohmer
M. Spaan
OffRL
270
0
0
27 May 2025
A ZeNN architecture to avoid the Gaussian trap
A ZeNN architecture to avoid the Gaussian trap
Luís Carvalho
Joao L. Costa
José Mourao
Gonçalo Oliveira
222
0
0
26 May 2025
A Theoretical Framework for Grokking: Interpolation followed by Riemannian Norm Minimisation
A Theoretical Framework for Grokking: Interpolation followed by Riemannian Norm Minimisation
Etienne Boursier
Scott Pesme
Radu-Alexandru Dragomir
249
1
0
26 May 2025
Variational Deep Learning via Implicit Regularization
Variational Deep Learning via Implicit Regularization
Jonathan Wenger
Beau Coker
Juraj Marusic
John P. Cunningham
OODUQCVBDL
276
1
0
26 May 2025
ExPLAIND: Unifying Model, Data, and Training Attribution to Study Model Behavior
ExPLAIND: Unifying Model, Data, and Training Attribution to Study Model Behavior
Florian Eichin
Yupei Du
Philipp Mondorf
Maria Matveev
Barbara Plank
Michael A. Hedderich
FAtt
406
0
0
26 May 2025
On the Role of Label Noise in the Feature Learning Process
On the Role of Label Noise in the Feature Learning Process
Andi Han
Wei Huang
Zhanpeng Zhou
Gang Niu
Wuyang Chen
Junchi Yan
Akiko Takeda
Taiji Suzuki
NoLaMLT
383
2
0
25 May 2025
Temperature is All You Need for Generalization in Langevin Dynamics and other Markov Processes
Temperature is All You Need for Generalization in Langevin Dynamics and other Markov Processes
I. Harel
Yonathan Wolanowsky
Gal Vardi
Nathan Srebro
Daniel Soudry
AI4CE
305
1
0
25 May 2025
Querying Kernel Methods Suffices for Reconstructing their Training Data
Querying Kernel Methods Suffices for Reconstructing their Training Data
Daniel Barzilai
Yuval Margalit
Eitan Gronich
Gilad Yehudai
Meirav Galun
Ronen Basri
189
0
0
25 May 2025
Function Forms of Simple ReLU Networks with Random Hidden Weights
Function Forms of Simple ReLU Networks with Random Hidden Weights
Ka Long Keith Ho
Yoshinari Takeishi
Junichi Takeuchi
98
1
0
23 May 2025
Joker: Joint Optimization Framework for Lightweight Kernel Machines
Joker: Joint Optimization Framework for Lightweight Kernel Machines
Junhong Zhang
Zhihui Lai
167
0
0
23 May 2025
Towards Revealing the Effectiveness of Small-Scale Fine-tuning in R1-style Reinforcement Learning
Towards Revealing the Effectiveness of Small-Scale Fine-tuning in R1-style Reinforcement Learning
Yutong Chen
Jiandong Gao
Ji Wu
ALM
412
1
0
23 May 2025
Training-Free Reasoning and Reflection in MLLMs
Training-Free Reasoning and Reflection in MLLMs
Hongchen Wei
Zhenzhong Chen
OffRLVLMLRM
218
1
0
22 May 2025
How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning
How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning
Max Weltevrede
Moritz A. Zanger
M. Spaan
Wendelin Bohmer
OffRLFedML
323
0
0
22 May 2025
Bigger Isn't Always Memorizing: Early Stopping Overparameterized Diffusion Models
Bigger Isn't Always Memorizing: Early Stopping Overparameterized Diffusion Models
Alessandro Favero
Antonio Sclocchi
Matthieu Wyart
DiffM
281
9
0
22 May 2025
Small-to-Large Generalization: Data Influences Models Consistently Across Scale
Small-to-Large Generalization: Data Influences Models Consistently Across Scale
Alaa Khaddaj
Logan Engstrom
Aleksander Madry
TDIAI4CE
253
0
0
22 May 2025
Risk-Averse Reinforcement Learning with Itakura-Saito Loss
Risk-Averse Reinforcement Learning with Itakura-Saito Loss
Igor Udovichenko
Olivier Croissant
Anita Toleutaeva
Evgeny Burnaev
Alexander Korotin
164
0
0
22 May 2025
Directional Convergence, Benign Overfitting of Gradient Descent in leaky ReLU two-layer Neural Networks
Directional Convergence, Benign Overfitting of Gradient Descent in leaky ReLU two-layer Neural Networks
Ichiro Hashimoto
MLT
264
0
0
22 May 2025
TULiP: Test-time Uncertainty Estimation via Linearization and Weight Perturbation
TULiP: Test-time Uncertainty Estimation via Linearization and Weight Perturbation
Yuhui Zhang
Dongshen Wu
Yuichiro Wada
Takafumi Kanamori
OODD
485
1
0
22 May 2025
Convergence of Adam in Deep ReLU Networks via Directional Complexity and Kakeya Bounds
Convergence of Adam in Deep ReLU Networks via Directional Complexity and Kakeya Bounds
Anupama Sridhar
Alexander Johansen
242
0
0
21 May 2025
Certified Neural Approximations of Nonlinear Dynamics
Certified Neural Approximations of Nonlinear Dynamics
Frederik Baymler Mathiesen
Nikolaus Vertovec
Francesco Fabiano
Luca Laurenti
Alessandro Abate
225
1
0
21 May 2025
Hybrid Adaptive Modeling in Process Monitoring: Leveraging Sequence Encoders and Physics-Informed Neural Networks
Hybrid Adaptive Modeling in Process Monitoring: Leveraging Sequence Encoders and Physics-Informed Neural Networks
Mouad Elaarabi
Domenico Borzacchiello
Philippe Le Bot
Nathan Lauzeral
Sebastien Comas-Cardona
PINNAI4CE
243
0
0
20 May 2025
Rethink the Role of Deep Learning towards Large-scale Quantum Systems
Rethink the Role of Deep Learning towards Large-scale Quantum Systems
Yusheng Zhao
Chi Zhang
Yuxuan Du
AI4CE
139
3
0
20 May 2025
New Evidence of the Two-Phase Learning Dynamics of Neural Networks
New Evidence of the Two-Phase Learning Dynamics of Neural Networks
Zhanpeng Zhou
Yongyi Yang
Mahito Sugiyama
Junchi Yan
183
2
0
20 May 2025
Just One Layer Norm Guarantees Stable Extrapolation
Just One Layer Norm Guarantees Stable Extrapolation
Juliusz Ziomek
George Whittle
Michael A. Osborne
294
3
0
20 May 2025
Nonparametric Teaching for Graph Property Learners
Nonparametric Teaching for Graph Property Learners
Chen Zhang
Weixin Bu
Zhaochun Ren
Ziyue Liu
Yik-Chung Wu
Ngai Wong
327
2
0
20 May 2025
A Physics-Inspired Optimizer: Velocity Regularized Adam
A Physics-Inspired Optimizer: Velocity Regularized Adam
Pranav Vaidhyanathan
Lucas Schorling
Natalia Ares
Michael A. Osborne
ODL
398
0
0
19 May 2025
Enhancing Transformers Through Conditioned Embedded Tokens
Enhancing Transformers Through Conditioned Embedded Tokens
Hemanth Saratchandran
Simon Lucey
288
2
0
19 May 2025
A Local Polyak-Lojasiewicz and Descent Lemma of Gradient Descent For Overparametrized Linear Models
A Local Polyak-Lojasiewicz and Descent Lemma of Gradient Descent For Overparametrized Linear Models
Ziqing Xu
Hancheng Min
Salma Tarmoun
Enrique Mallada
Rene Vidal
233
2
0
16 May 2025
Is Supervised Learning Really That Different from Unsupervised?
Is Supervised Learning Really That Different from Unsupervised?
Oskar Allerbo
Thomas B. Schön
OODSSL
470
0
0
16 May 2025
The Power of Random Features and the Limits of Distribution-Free Gradient Descent
The Power of Random Features and the Limits of Distribution-Free Gradient Descent
Ari Karchmer
Eran Malach
235
0
0
15 May 2025
Towards scalable surrogate models based on Neural Fields for large scale aerodynamic simulations
Towards scalable surrogate models based on Neural Fields for large scale aerodynamic simulations
Giovanni Catalani
Jean Fesquet
Xavier Bertrand
Frédéric Tost
Michaël Bauerheim
Joseph Morlier
AI4CE
282
1
0
14 May 2025
Neural Multivariate Regression: Qualitative Insights from the Unconstrained Feature Model
Neural Multivariate Regression: Qualitative Insights from the Unconstrained Feature Model
George Andriopoulos
Soyuj Jung Basnet
Juan Guevara
Li Guo
George Andriopoulos
240
0
0
14 May 2025
SAD Neural Networks: Divergent Gradient Flows and Asymptotic Optimality via o-minimal Structures
SAD Neural Networks: Divergent Gradient Flows and Asymptotic Optimality via o-minimal Structures
Julian Kranz
Davide Gallon
Steffen Dereich
Arnulf Jentzen
184
3
0
14 May 2025
Slow Transition to Low-Dimensional Chaos in Heavy-Tailed Recurrent Neural Networks
Slow Transition to Low-Dimensional Chaos in Heavy-Tailed Recurrent Neural NetworksbioRxiv (bioRxiv), 2025
Yi Xie
Stefan Mihalas
Łukasz Kuśmierz
209
0
0
14 May 2025
Block-Biased Mamba for Long-Range Sequence Processing
Block-Biased Mamba for Long-Range Sequence Processing
Annan Yu
N. Benjamin Erichson
Mamba
291
2
0
13 May 2025
Previous
123456...474849
Next