Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2009.10713
Cited By
v1
v2
v3 (latest)
Towards a Mathematical Understanding of Neural Network-Based Machine Learning: what we know and what we don't
CSIAM Transactions on Applied Mathematics (CSIAM Trans. Appl. Math.), 2020
22 September 2020
E. Weinan
Chao Ma
Stephan Wojtowytsch
Lei Wu
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Towards a Mathematical Understanding of Neural Network-Based Machine Learning: what we know and what we don't"
50 / 90 papers shown
Allocation of Parameters in Transformers
Ruoxi Yu
Haotian Jiang
Jingpu Cheng
Penghao Yu
Qianxiao Li
Zhong Li
MoE
197
0
0
04 Oct 2025
Vector-Valued Reproducing Kernel Banach Spaces for Neural Networks and Operators
Sven Dummer
Tjeerd Jan Heeringa
José A. Iglesias
222
3
0
30 Sep 2025
Convergence for adaptive resampling of random Fourier features
Xin Huang
Aku Kammonen
Anamika Pandey
Mattias Sandberg
E. von Schwerin
Anders Szepessy
Raúl Tempone
139
1
0
03 Sep 2025
A Spin Glass Characterization of Neural Networks
Jun Li
171
1
0
10 Aug 2025
Sharp higher order convergence rates for the Adam optimizer
Steffen Dereich
Arnulf Jentzen
Adrian Riekert
ODL
349
3
0
28 Apr 2025
Non-convergence to the optimal risk for Adam and stochastic gradient descent optimization in the training of deep neural networks
Thang Do
Arnulf Jentzen
Adrian Riekert
322
4
0
03 Mar 2025
Robust Concept Erasure Using Task Vectors
Minh Pham
Kelly O. Marshall
Chinmay Hegde
Niv Cohen
505
28
0
21 Feb 2025
A note on the physical interpretation of neural PDE's
Sauro Succi
AI4CE
216
6
0
10 Feb 2025
High-dimensional classification problems with Barron regular boundaries under margin conditions
Neural Networks (NN), 2024
Jonathan García
Philipp Petersen
365
1
0
10 Dec 2024
Nonuniform random feature models using derivative information
Konstantin Pieper
Zezhong Zhang
Guannan Zhang
254
3
0
03 Oct 2024
Dimension-independent learning rates for high-dimensional classification problems
Andrés Felipe Lerma Pineda
P. Petersen
Simon Frieder
Thomas Lukasiewicz
213
1
0
26 Sep 2024
Fast training of accurate physics-informed neural networks without gradient descent
Chinmay Datar
Taniya Kapoor
Abhishek Chandra
Q. Sun
Erik Lien Bolager
Iryna Burak
Anna Veselovska
Massimo Fornasier
Felix Dietrich
PINN
AI4CE
410
5
0
31 May 2024
Approximation and Gradient Descent Training with Neural Networks
G. Welper
295
2
0
19 May 2024
Non-Convex Robust Hypothesis Testing using Sinkhorn Uncertainty Sets
Jie Wang
Rui Gao
Yao Xie
268
5
0
21 Mar 2024
Score-based generative models break the curse of dimensionality in learning a family of sub-Gaussian probability distributions
Frank Cole
Yuxuan Zhao
DiffM
448
8
0
12 Feb 2024
Transformers Learn Nonlinear Features In Context: Nonconvex Mean-field Dynamics on the Attention Landscape
Juno Kim
Taiji Suzuki
441
42
0
02 Feb 2024
On Excess Risk Convergence Rates of Neural Network Classifiers
Hyunouk Ko
Namjoon Suh
X. Huo
215
3
0
26 Sep 2023
Fundamental Limits of Deep Learning-Based Binary Classifiers Trained with Hinge Loss
T. Getu
Georges Kaddoum
M. Bennis
425
1
0
13 Sep 2023
Approximation Results for Gradient Descent trained Neural Networks
G. Welper
206
1
0
09 Sep 2023
Geometry and Local Recovery of Global Minima of Two-layer Neural Networks at Overparameterization
Leyang Zhang
Yaoyu Zhang
Yaoyu Zhang
506
5
0
01 Sep 2023
On the accuracy of interpolation based on single-layer artificial neural networks with a focus on defeating the Runge phenomenon
Soft Computing - A Fusion of Foundations, Methodologies and Applications (Soft Comput.), 2023
F. Auricchio
Maria Roberta Belardo
Gianluca Fabiani
Francesco Calabrò
A. Pascaner
408
6
0
21 Aug 2023
Sampling weights of deep neural networks
Neural Information Processing Systems (NeurIPS), 2023
Iryna Burak
Erik Lien Bolager
Chinmay Datar
Q. Sun
Felix Dietrich
BDL
UQCV
299
32
0
29 Jun 2023
Exploiting Noise as a Resource for Computation and Learning in Spiking Neural Networks
Gehua (Marcus) Ma
Rui Yan
Huajin Tang
558
37
0
25 May 2023
Embeddings between Barron spaces with higher order activation functions
Applied and Computational Harmonic Analysis (ACHA), 2023
T. J. Heeringa
L. Spek
Felix L. Schwenninger
C. Brune
255
5
0
25 May 2023
Human Semantic Segmentation using Millimeter-Wave Radar Sparse Point Clouds
International Conference on Computer Supported Cooperative Work in Design (CSCWD), 2023
Pengfei Song
Luoyu Mei
H. Cheng
182
3
0
27 Apr 2023
Understanding Overfitting in Adversarial Training via Kernel Regression
Teng Zhang
Kang Li
249
2
0
13 Apr 2023
Infinite-dimensional reservoir computing
Neural Networks (Neural Netw.), 2023
Lukas Gonon
Lyudmila Grigoryeva
Juan-Pablo Ortega
320
12
0
02 Apr 2023
On the existence of optimal shallow feedforward networks with ReLU activation
Steffen Dereich
Sebastian Kassing
266
5
0
06 Mar 2023
On the existence of minimizers in shallow residual ReLU neural network optimization landscapes
SIAM Journal on Numerical Analysis (SINUM), 2023
Steffen Dereich
Arnulf Jentzen
Sebastian Kassing
375
9
0
28 Feb 2023
A Brief Survey on the Approximation Theory for Sequence Modelling
Journal of Machine Learning (JML), 2023
Hao Jiang
Qianxiao Li
Zhong Li
Shida Wang
AI4TS
326
15
0
27 Feb 2023
Reinforcement Learning with Function Approximation: From Linear to Nonlinear
Journal of Machine Learning (JML), 2023
Jihao Long
Jiequn Han
376
8
0
20 Feb 2023
Selected aspects of complex, hypercomplex and fuzzy neural networks
A. Niemczynowicz
Radosław Antoni Kycia
Maciej Jaworski
A. Siemaszko
J. Calabuig
...
Baruch Schneider
Diana Berseghyan
Irina Perfiljeva
V. Novák
Piotr Artiemjew
412
2
0
29 Dec 2022
A Mathematical Framework for Learning Probability Distributions
Journal of Machine Learning (JML), 2022
Hongkang Yang
373
9
0
22 Dec 2022
Infinite-width limit of deep linear neural networks
Communications on Pure and Applied Mathematics (CPAM), 2022
Lénaïc Chizat
Maria Colombo
Xavier Fernández-Real
Alessio Figalli
221
26
0
29 Nov 2022
To be or not to be stable, that is the question: understanding neural networks for inverse problems
SIAM Journal on Scientific Computing (SISC), 2022
David Evangelista
J. Nagy
E. Morotti
E. L. Piccolomini
285
10
0
24 Nov 2022
Duality for Neural Networks through Reproducing Kernel Banach Spaces
Social Science Research Network (SSRN), 2022
L. Spek
T. J. Heeringa
Felix L. Schwenninger
C. Brune
534
18
0
09 Nov 2022
Asymptotic-Preserving Neural Networks for hyperbolic systems with diffusive scaling
Giulia Bertaglia
AI4CE
256
8
0
17 Oct 2022
Approximation results for Gradient Descent trained Shallow Neural Networks in
1
d
1d
1
d
R. Gentile
G. Welper
ODL
362
9
0
17 Sep 2022
Optimal bump functions for shallow ReLU networks: Weight decay, depth separation and the curse of dimensionality
Stephan Wojtowytsch
254
1
0
02 Sep 2022
Super-model ecosystem: A domain-adaptation perspective
Fengxiang He
Dacheng Tao
DiffM
208
1
0
30 Aug 2022
Approximation Power of Deep Neural Networks: an explanatory mathematical survey
Mohammad Motamed
229
4
0
19 Jul 2022
On bounds for norms of reparameterized ReLU artificial neural network parameters: sums of fractional powers of the Lipschitz norm control the network parameter vector
Arnulf Jentzen
T. Kröger
286
0
0
27 Jun 2022
Asymptotic-Preserving Neural Networks for multiscale hyperbolic models of epidemic spread
Mathematical Models and Methods in Applied Sciences (M3AS), 2022
Giulia Bertaglia
Chuan Lu
L. Pareschi
Xueyu Zhu
AI4CE
158
30
0
25 Jun 2022
A PDE-based Explanation of Extreme Numerical Sensitivities and Edge of Stability in Training Neural Networks
Journal of machine learning research (JMLR), 2022
Yuxin Sun
Dong Lao
G. Sundaramoorthi
A. Yezzi
510
3
0
04 Jun 2022
Understanding Deep Learning via Decision Boundary
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Shiye Lei
Fengxiang He
Yancheng Yuan
Dacheng Tao
242
25
0
03 Jun 2022
SRMD: Sparse Random Mode Decomposition
Communication on Applied Mathematics and Computation (CAMC), 2022
Nicholas Richardson
Hayden Schaeffer
Giang Tran
256
18
0
12 Apr 2022
HARFE: Hard-Ridge Random Feature Expansion
Sampling Theory, Signal Processing, and Data Analysis (TSPDA), 2022
Esha Saha
Hayden Schaeffer
Giang Tran
342
22
0
06 Feb 2022
Machine Learning in Heterogeneous Porous Materials
Martha DÉli
H. Deng
Cedric G. Fraces
K. Garikipati
L. Graham‐Brady
...
H. Tchelepi
B. Važić
Hari S. Viswanathan
H. Yoon
P. Zarzycki
AI4CE
247
12
0
04 Feb 2022
Complexity from Adaptive-Symmetries Breaking: Global Minima in the Statistical Mechanics of Deep Neural Networks
Shaun Li
AI4CE
257
1
0
03 Jan 2022
Deep neural networks for solving forward and inverse problems of (2+1)-dimensional nonlinear wave equations with rational solitons
Zijian Zhou
Li Wang
Zhenya Yan
323
2
0
28 Dec 2021
1
2
Next
Page 1 of 2