v1v2v3 (latest)

A Rigorous Framework for the Mean Field Limit of Multilayer Neural Networks

Mathematical Statistics and Learning (MSL), 2020

30 January 2020

Papers citing "A Rigorous Framework for the Mean Field Limit of Multilayer Neural Networks"

50 / 54 papers shown

Mean-Field Limits for Two-Layer Neural Networks Trained with Consensus-Based Optimization

William De Deyn

Michael Herty

Giovanni Samaey

212

26 Nov 2025

Block Coordinate Descent for Neural Networks Provably Finds Global Minima

Shunta Akiyama

177

26 Oct 2025

Global Convergence and Rich Feature Learning in

L

-Layer Infinite-Width Neural Networks under

μ

307

12 Mar 2025

Understanding the training of infinitely deep and wide ResNets with Conditional Optimal TransportCommunications on Pure and Applied Mathematics (CPAM), 2024

Raphael Barboni

Gabriel Peyré

Franccois-Xavier Vialard

466

19 Mar 2024

A Survey on Statistical Theory of Deep Learning: Approximation, Training Dynamics, and Generative ModelsAnnual Review of Statistics and Its Application (ARSIA), 2024

Namjoon Suh

Guang Cheng

MedIm

492

14 Jan 2024

Wide Deep Neural Networks with Gaussian Weights are Very Close to Gaussian Processes

Dario Trevisan

UQCV BDL

378

18 Dec 2023

How Over-Parameterization Slows Down Gradient Descent in Matrix Sensing: The Curses of Symmetry and InitializationInternational Conference on Learning Representations (ICLR), 2023

Nuoya Xiong

Lijun Ding

Simon S. Du

549

03 Oct 2023

JoMA: Demystifying Multilayer Transformers via JOint Dynamics of MLP and AttentionInternational Conference on Learning Representations (ICLR), 2023

Yuandong Tian

461

01 Oct 2023

Mode Connectivity and Data Heterogeneity of Federated Learning

325

29 Sep 2023

Fundamental Limits of Deep Learning-Based Binary Classifiers Trained with Hinge Loss

T. Getu

Georges Kaddoum

M. Bennis

431

13 Sep 2023

Six Lectures on Linearized Neural NetworksJournal of Statistical Mechanics: Theory and Experiment (J. Stat. Mech.), 2023

Theodor Misiakiewicz

Andrea Montanari

398

25 Aug 2023

Fundamental limits of overparametrized shallow neural networks for supervised learning

Francesco Camilli

D. Tieplova

Jean Barbier

269

11 Jul 2023

Neural Hilbert Ladders: Multi-Layer Neural Networks in Function SpaceJournal of machine learning research (JMLR), 2023

Zhengdao Chen

525

03 Jul 2023

Feature-Learning Networks Are Consistent Across Widths At Realistic ScalesNeural Information Processing Systems (NeurIPS), 2023

Nikhil Vyas

Alexander B. Atanasov

506

28 May 2023

Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer TransformerNeural Information Processing Systems (NeurIPS), 2023

602

112

25 May 2023

Depth Dependence of

μ

P Learning Rates in ReLU MLPs

Srinadh Bhojanapalli

214

13 May 2023

Depth Separation with Multilayer Mean-Field NetworksInternational Conference on Learning Representations (ICLR), 2023

327

03 Apr 2023

Global Optimality of Elman-type RNN in the Mean-Field RegimeInternational Conference on Machine Learning (ICML), 2023

183

12 Mar 2023

PAPAL: A Provable PArticle-based Primal-Dual ALgorithm for Mixed Nash Equilibrium

Shihong Ding

Hanze Dong

Cong Fang

Zhouchen Lin

Tong Zhang

265

02 Mar 2023

Over-Parameterization Exponentially Slows Down Gradient Descent for Learning a Single NeuronAnnual Conference Computational Learning Theory (COLT), 2023

Weihang Xu

S. Du

436

20 Feb 2023

M22: A Communication-Efficient Algorithm for Federated Learning Inspired by Rate-DistortionIEEE Transactions on Communications (IEEE Trans. Commun.), 2023

Jun Chen

260

23 Jan 2023

Uniform-in-time propagation of chaos for mean field Langevin dynamicsAnnales De L Institut Henri Poincare-probabilites Et Statistiques (Ann. Inst. Henri Poincaré Probab. Stat.), 2022

Fan Chen

Zhenjie Ren

Song-bo Wang

494

06 Dec 2022

On the symmetries in the dynamics of wide two-layer neural networksElectronic Research Archive (ERA), 2022

Karl Hajjar

Lénaïc Chizat

324

16 Nov 2022

A Functional-Space Mean-Field Theory of Partially-Trained Three-Layer Neural Networks

Zhengdao Chen

Eric Vanden-Eijnden

Joan Bruna

MLT

418

28 Oct 2022

Mean-field analysis for heavy ball methods: Dropout-stability, connectivity, and global convergence

Diyuan Wu

Vyacheslav Kungurtsev

Marco Mondelli

240

13 Oct 2022

Analysis of the rate of convergence of an over-parametrized deep neural network estimate learned by gradient descentIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2022

Michael Kohler

A. Krzyżak

294

04 Oct 2022

On the universal consistency of an over-parametrized deep neural network estimate learned by gradient descentAnnals of the Institute of Statistical Mathematics (AISM), 2022

Selina Drews

Michael Kohler

256

30 Aug 2022

Limitations of the NTK for Understanding Generalization in Deep Learning

Nikhil Vyas

Yamini Bansal

Preetum Nakkiran

359

20 Jun 2022

Mean-Field Analysis of Two-Layer Neural Networks: Global Optimality with Linear Convergence Rates

313

19 May 2022

Self-Consistent Dynamical Field Theory of Kernel Evolution in Wide Neural NetworksNeural Information Processing Systems (NeurIPS), 2022

Blake Bordelon

Cengiz Pehlevan

MLT

458

123

19 May 2022

On Feature Learning in Neural Networks with Global Convergence GuaranteesInternational Conference on Learning Representations (ICLR), 2022

Zhengdao Chen

Eric Vanden-Eijnden

Joan Bruna

MLT

337

22 Apr 2022

Quantitative Gaussian Approximation of Randomly Initialized Deep Neural Networks

Andrea Basteri

Dario Trevisan

BDL

310

14 Mar 2022

Complexity from Adaptive-Symmetries Breaking: Global Minima in the Statistical Mechanics of Deep Neural Networks

Shaun Li

AI4CE

261

03 Jan 2022

Gradient flows on graphons: existence, convergence, continuity equations

293

18 Nov 2021

Mean-field Analysis of Piecewise Linear Solutions for Wide ReLU NetworksJournal of machine learning research (JMLR), 2021

Aleksandr Shevchenko

Vyacheslav Kungurtsev

Marco Mondelli

MLT

339

03 Nov 2021

Limiting fluctuation and trajectorial stability of multilayer neural networks with mean field trainingNeural Information Processing Systems (NeurIPS), 2021

H. Pham

Phan-Minh Nguyen

162

29 Oct 2021

Gradient Descent on Infinitely Wide Neural Networks: Global Convergence and Generalization

Francis R. Bach

Lénaïc Chizat

MLT

183

15 Oct 2021

On the Global Convergence of Gradient Descent for multi-layer ResNets in the mean-field regime

277

06 Oct 2021

A theory of representation learning gives a deep generalisation of kernel methodsInternational Conference on Machine Learning (ICML), 2021

683

30 Aug 2021

Understanding Deflation Process in Over-parametrized Tensor DecompositionNeural Information Processing Systems (NeurIPS), 2021

239

11 Jun 2021

Heavy Tails in SGD and Compressibility of Overparametrized Neural NetworksNeural Information Processing Systems (NeurIPS), 2021

326

07 Jun 2021

Overparameterization of deep ResNet: zero loss and mean-field analysisJournal of machine learning research (JMLR), 2021

345

30 May 2021

Global Convergence of Three-layer Neural Networks in the Mean Field RegimeInternational Conference on Learning Representations (ICLR), 2021

H. Pham

Phan-Minh Nguyen

MLT AI4CE

376

11 May 2021

Deep Nonparametric Regression on Approximate Manifolds: Non-Asymptotic Error Bounds with Polynomial PrefactorsAnnals of Statistics (Ann. Stat.), 2021

Yuling Jiao

Guohao Shen

Yuanyuan Lin

Jian Huang

483

14 Apr 2021

A Local Convergence Theory for Mildly Over-Parameterized Two-Layer Neural NetworkAnnual Conference Computational Learning Theory (COLT), 2021

Mo Zhou

Rong Ge

Chi Jin

398

04 Feb 2021

Particle Dual Averaging: Optimization of Mean Field Neural Networks with Global Convergence Rate AnalysisNeural Information Processing Systems (NeurIPS), 2020

Atsushi Nitanda

Denny Wu

Taiji Suzuki

550

31 Dec 2020

Mathematical Models of Overparameterized Neural NetworksProceedings of the IEEE (Proc. IEEE), 2020

Cong Fang

Hanze Dong

Tong Zhang

340

27 Dec 2020

Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime

Andrea Agazzi

Jianfeng Lu

259

22 Oct 2020

A Dynamical Central Limit Theorem for Shallow Neural Networks

Zhengdao Chen

Grant M. Rotskoff

Joan Bruna

Eric Vanden-Eijnden

364

21 Aug 2020

A Note on the Global Convergence of Multilayer Neural Networks in the Mean Field Regime

H. Pham

Phan-Minh Nguyen

MLT AI4CE

173

16 Jun 2020