v1v2v3 (latest)

Towards a Mathematical Understanding of Neural Network-Based Machine Learning: what we know and what we don't

CSIAM Transactions on Applied Mathematics (CSIAM Trans. Appl. Math.), 2020

22 September 2020

Chao Ma

Papers citing "Towards a Mathematical Understanding of Neural Network-Based Machine Learning: what we know and what we don't"

50 / 90 papers shown

Allocation of Parameters in Transformers

197

04 Oct 2025

Vector-Valued Reproducing Kernel Banach Spaces for Neural Networks and Operators

Sven Dummer

Tjeerd Jan Heeringa

José A. Iglesias

222

30 Sep 2025

Convergence for adaptive resampling of random Fourier features

139

03 Sep 2025

A Spin Glass Characterization of Neural Networks

Jun Li

171

10 Aug 2025

Sharp higher order convergence rates for the Adam optimizer

349

28 Apr 2025

Non-convergence to the optimal risk for Adam and stochastic gradient descent optimization in the training of deep neural networks

Thang Do

Arnulf Jentzen

Adrian Riekert

322

03 Mar 2025

Robust Concept Erasure Using Task Vectors

505

21 Feb 2025

A note on the physical interpretation of neural PDE's

Sauro Succi

AI4CE

216

10 Feb 2025

High-dimensional classification problems with Barron regular boundaries under margin conditionsNeural Networks (NN), 2024

Jonathan García

Philipp Petersen

365

10 Dec 2024

Nonuniform random feature models using derivative information

Konstantin Pieper

Zezhong Zhang

Guannan Zhang

254

03 Oct 2024

Dimension-independent learning rates for high-dimensional classification problems

Andrés Felipe Lerma Pineda

P. Petersen

Simon Frieder

Thomas Lukasiewicz

213

26 Sep 2024

Fast training of accurate physics-informed neural networks without gradient descent

410

31 May 2024

Approximation and Gradient Descent Training with Neural Networks

G. Welper

295

19 May 2024

Non-Convex Robust Hypothesis Testing using Sinkhorn Uncertainty Sets

Jie Wang

Rui Gao

Yao Xie

268

21 Mar 2024

Score-based generative models break the curse of dimensionality in learning a family of sub-Gaussian probability distributions

Frank Cole

Yuxuan Zhao

DiffM

448

12 Feb 2024

Transformers Learn Nonlinear Features In Context: Nonconvex Mean-field Dynamics on the Attention Landscape

Juno Kim

Taiji Suzuki

441

02 Feb 2024

On Excess Risk Convergence Rates of Neural Network Classifiers

Hyunouk Ko

Namjoon Suh

X. Huo

215

26 Sep 2023

Fundamental Limits of Deep Learning-Based Binary Classifiers Trained with Hinge Loss

T. Getu

Georges Kaddoum

M. Bennis

425

13 Sep 2023

Approximation Results for Gradient Descent trained Neural Networks

G. Welper

206

09 Sep 2023

Geometry and Local Recovery of Global Minima of Two-layer Neural Networks at Overparameterization

Leyang Zhang

Yaoyu Zhang

506

01 Sep 2023

On the accuracy of interpolation based on single-layer artificial neural networks with a focus on defeating the Runge phenomenonSoft Computing - A Fusion of Foundations, Methodologies and Applications (Soft Comput.), 2023

F. Auricchio

Maria Roberta Belardo

Gianluca Fabiani

Francesco Calabrò

A. Pascaner

408

21 Aug 2023

Sampling weights of deep neural networksNeural Information Processing Systems (NeurIPS), 2023

299

29 Jun 2023

Exploiting Noise as a Resource for Computation and Learning in Spiking Neural Networks

Gehua (Marcus) Ma

Rui Yan

Huajin Tang

558

25 May 2023

Embeddings between Barron spaces with higher order activation functionsApplied and Computational Harmonic Analysis (ACHA), 2023

T. J. Heeringa

L. Spek

Felix L. Schwenninger

C. Brune

255

25 May 2023

Human Semantic Segmentation using Millimeter-Wave Radar Sparse Point CloudsInternational Conference on Computer Supported Cooperative Work in Design (CSCWD), 2023

Pengfei Song

Luoyu Mei

H. Cheng

182

27 Apr 2023

Understanding Overfitting in Adversarial Training via Kernel Regression

Teng Zhang

Kang Li

249

13 Apr 2023

Infinite-dimensional reservoir computingNeural Networks (Neural Netw.), 2023

Lukas Gonon

Lyudmila Grigoryeva

Juan-Pablo Ortega

320

02 Apr 2023

On the existence of optimal shallow feedforward networks with ReLU activation

Steffen Dereich

Sebastian Kassing

266

06 Mar 2023

On the existence of minimizers in shallow residual ReLU neural network optimization landscapesSIAM Journal on Numerical Analysis (SINUM), 2023

Steffen Dereich

Arnulf Jentzen

Sebastian Kassing

375

28 Feb 2023

A Brief Survey on the Approximation Theory for Sequence ModellingJournal of Machine Learning (JML), 2023

326

27 Feb 2023

Reinforcement Learning with Function Approximation: From Linear to NonlinearJournal of Machine Learning (JML), 2023

Jihao Long

Jiequn Han

376

20 Feb 2023

Selected aspects of complex, hypercomplex and fuzzy neural networks

A. Niemczynowicz

Radosław Antoni Kycia

...

412

29 Dec 2022

A Mathematical Framework for Learning Probability DistributionsJournal of Machine Learning (JML), 2022

Hongkang Yang

373

22 Dec 2022

Infinite-width limit of deep linear neural networksCommunications on Pure and Applied Mathematics (CPAM), 2022

Lénaïc Chizat

Maria Colombo

Xavier Fernández-Real

Alessio Figalli

221

29 Nov 2022

To be or not to be stable, that is the question: understanding neural networks for inverse problemsSIAM Journal on Scientific Computing (SISC), 2022

285

24 Nov 2022

Duality for Neural Networks through Reproducing Kernel Banach SpacesSocial Science Research Network (SSRN), 2022

L. Spek

T. J. Heeringa

Felix L. Schwenninger

C. Brune

534

09 Nov 2022

Asymptotic-Preserving Neural Networks for hyperbolic systems with diffusive scaling

Giulia Bertaglia

AI4CE

256

17 Oct 2022

Approximation results for Gradient Descent trained Shallow Neural Networks in

1d

R. Gentile

G. Welper

ODL

362

17 Sep 2022

Optimal bump functions for shallow ReLU networks: Weight decay, depth separation and the curse of dimensionality

Stephan Wojtowytsch

254

02 Sep 2022

Super-model ecosystem: A domain-adaptation perspective

Fengxiang He

Dacheng Tao

DiffM

208

30 Aug 2022

Approximation Power of Deep Neural Networks: an explanatory mathematical survey

Mohammad Motamed

229

19 Jul 2022

$On bounds for norms of reparameterized ReLU artificial neural network parameters: sums of fractional powers of the Lipschitz norm control the network parameter vector$

On bounds for norms of reparameterized ReLU artificial neural network parameters: sums of fractional powers of the Lipschitz norm control the network parameter vector

Arnulf Jentzen

T. Kröger

286

27 Jun 2022

Asymptotic-Preserving Neural Networks for multiscale hyperbolic models of epidemic spreadMathematical Models and Methods in Applied Sciences (M3AS), 2022

158

25 Jun 2022

A PDE-based Explanation of Extreme Numerical Sensitivities and Edge of Stability in Training Neural NetworksJournal of machine learning research (JMLR), 2022

510

04 Jun 2022

Understanding Deep Learning via Decision BoundaryIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022

242

03 Jun 2022

SRMD: Sparse Random Mode DecompositionCommunication on Applied Mathematics and Computation (CAMC), 2022

Nicholas Richardson

Hayden Schaeffer

Giang Tran

256

12 Apr 2022

HARFE: Hard-Ridge Random Feature ExpansionSampling Theory, Signal Processing, and Data Analysis (TSPDA), 2022

Esha Saha

Hayden Schaeffer

Giang Tran

342

06 Feb 2022

Machine Learning in Heterogeneous Porous Materials

...

247

04 Feb 2022

Complexity from Adaptive-Symmetries Breaking: Global Minima in the Statistical Mechanics of Deep Neural Networks

Shaun Li

AI4CE

257

03 Jan 2022

Deep neural networks for solving forward and inverse problems of (2+1)-dimensional nonlinear wave equations with rational solitons

Zijian Zhou

Li Wang

Zhenya Yan

323

28 Dec 2021