v1v2v3 (latest)

Global optimality conditions for deep neural networks

8 July 2017

Papers citing "Global optimality conditions for deep neural networks"

50 / 79 papers shown

Distributionally Robust Optimization via Diffusion Ambiguity Modeling

Jiaqi Wen

Jianyi Yang

164

26 Oct 2025

The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning

Milad Aghajohari

Kamran Chitsaz

Amirhossein Kazemnejad

308

08 Oct 2025

Backward Oversmoothing: why is it hard to train deep Graph Neural Networks?

Nicolas Keriven

321

22 May 2025

Exploring Loss Landscapes through the Lens of Spin Glass Theory

314

30 Jul 2024

Towards Training Without Depth Limits: Batch Normalization Without Gradient ExplosionInternational Conference on Learning Representations (ICLR), 2023

258

03 Oct 2023

Transferring Learning Trajectories of Neural NetworksInternational Conference on Learning Representations (ICLR), 2023

Daiki Chijiwa

335

23 May 2023

Critical Points and Convergence Analysis of Generative Deep Linear Networks Trained with Bures-Wasserstein LossInternational Conference on Machine Learning (ICML), 2023

Pierre Bréchet

Katerina Papagiannouli

Jing An

Guido Montúfar

432

06 Mar 2023

A Dynamics Theory of Implicit Regularization in Deep Low-Rank Matrix Factorization

425

29 Dec 2022

Piecewise Linear Neural Networks and Deep LearningNature Reviews Methods Primers (NRMP), 2022

Xiaolin Huang

199

18 Jun 2022

Parameter Convex Neural Networks

112

11 Jun 2022

Memorization-Dilation: Modeling Neural Collapse Under Label Noise

Eyke Hüllermeier

318

11 Jun 2022

Statistical Guarantees for Approximate Stationary Points of Shallow Neural Networks

Mahsa Taheri

Fang Xie

Johannes Lederer

403

09 May 2022

Low-Pass Filtering SGD for Recovering Flat Optima in the Deep Learning Optimization LandscapeInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

Devansh Bisla

Jing Wang

A. Choromańska

387

20 Jan 2022

Complexity from Adaptive-Symmetries Breaking: Global Minima in the Statistical Mechanics of Deep Neural Networks

Shaun Li

AI4CE

261

03 Jan 2022

On the Global Convergence of Gradient Descent for multi-layer ResNets in the mean-field regime

277

06 Oct 2021

Convergence of gradient descent for learning linear neural networksAdvances in Continuous and Discrete Models (ACDM), 2021

Gabin Maxime Nguegnang

Holger Rauhut

Ulrich Terstiege

MLT

393

04 Aug 2021

The loss landscape of deep linear neural networks: a second-order analysis

El Mehdi Achour

Franccois Malgouyres

Sébastien Gerchinovitz

ODL

296

28 Jul 2021

Analysis and Optimisation of Bellman Residual Errors with Neural Function Approximation

541

16 Jun 2021

Overparameterization of deep ResNet: zero loss and mean-field analysisJournal of machine learning research (JMLR), 2021

345

30 May 2021

A Geometric Analysis of Neural Collapse with Unconstrained FeaturesNeural Information Processing Systems (NeurIPS), 2021

Qing Qu

358

251

06 May 2021

Noether: The More Things Change, the More Stay the Same

Grzegorz Gluch

R. Urbanke

208

12 Apr 2021

Training Deep Neural Networks via Branch-and-Bound

316

05 Apr 2021

Spurious Local Minima Are Common for Deep Neural Networks with Piecewise Linear ActivationsIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021

Bo Liu

176

25 Feb 2021

When Are Solutions Connected in Deep Networks?Neural Information Processing Systems (NeurIPS), 2021

Quynh N. Nguyen

Pierre Bréchet

Marco Mondelli

438

18 Feb 2021

The Landscape of Multi-Layer Linear Neural Network From the Perspective of Algebraic Geometry

Xiuyi Yang

138

30 Jan 2021

A Convergence Theory Towards Practical Over-parameterized Deep Neural Networks

289

12 Jan 2021

A Survey on Neural Network InterpretabilityIEEE Transactions on Emerging Topics in Computational Intelligence (IEEE TETCI), 2020

709

880

28 Dec 2020

A Modular Analysis of Provable Acceleration via Polyak's Momentum: Training a Wide ReLU Network and a Deep Linear NetworkInternational Conference on Machine Learning (ICML), 2020

Jun-Kun Wang

Chi-Heng Lin

Jacob D. Abernethy

726

04 Oct 2020

From Symmetry to Geometry: Tractable Nonconvex Problems

Yuqian Zhang

Qing Qu

John N. Wright

444

14 Jul 2020

Ridge Regression with Over-Parametrized Two-Layer Networks Converge to Ridgelet Spectrum

239

07 Jul 2020

The Global Landscape of Neural Networks: An Overview

269

02 Jul 2020

Piecewise linear activations substantially shape the loss surfaces of neural networksInternational Conference on Learning Representations (ICLR), 2020

222

27 Mar 2020

Some Geometrical and Topological Properties of DNNs' Decision Boundaries

Bo Liu

Mengya Shen

AAML

284

07 Mar 2020

On the Global Convergence of Training Deep Linear ResNetsInternational Conference on Learning Representations (ICLR), 2020

Difan Zou

Philip M. Long

Quanquan Gu

240

02 Mar 2020

Understanding Global Loss Landscape of One-hidden-layer ReLU Networks, Part 1: Theory

Bo Liu

FAtt MLT

443

12 Feb 2020

Learning CHARME models with neural networks

José G. Gómez-García

M. Fadili

C. Chesneau

179

08 Feb 2020

Provable Benefit of Orthogonal Initialization in Optimizing Deep Linear NetworksInternational Conference on Learning Representations (ICLR), 2020

Wei Hu

Lechao Xiao

Jeffrey Pennington

279

135

16 Jan 2020

Optimization for deep learning: theory and algorithms

Tian Ding

ODL

487

183

19 Dec 2019

How Much Over-parameterization Is Sufficient to Learn Deep ReLU Networks?International Conference on Learning Representations (ICLR), 2019

Zixiang Chen

Yuan Cao

Difan Zou

Quanquan Gu

407

132

27 Nov 2019

Bregman Proximal Framework for Deep Linear Neural Networks

Mahesh Chandra Mukkamala

321

08 Oct 2019

Pure and Spurious Critical Points: a Geometric Study of Linear NetworksInternational Conference on Learning Representations (ICLR), 2019

Matthew Trager

Kathlén Kohn

Joan Bruna

269

03 Oct 2019

Student Specialization in Deep ReLU Networks With Finite Width and Input Dimension

Yuandong Tian

MLT

295

30 Sep 2019

Distance Geometry and Data ScienceTOP - An Official Journal of the Spanish Society of Statistics and Operations Research (TOP), 2019

Leo Liberti

159

18 Sep 2019

Neural Architecture Search by Estimation of Network Structure Distributions

270

19 Aug 2019

Are deep ResNets provably better than linear predictors?Neural Information Processing Systems (NeurIPS), 2019

Chulhee Yun

S. Sra

Ali Jadbabaie

467

09 Jul 2019

Semi-Implicit Generative Model

Mingzhang Yin

Mingyuan Zhou

VLM GAN

228

29 May 2019

Fine-grained Optimization of Deep Neural NetworksNeural Information Processing Systems (NeurIPS), 2019

Mete Ozay

ODL

228

22 May 2019

Orthogonal Deep Neural NetworksIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019

Shuai Li

236

153

15 May 2019

Generalization Error Bounds of Gradient Descent for Learning Over-parameterized Deep ReLU Networks

Yuan Cao

Quanquan Gu

ODL MLT AI4CE

770

166

04 Feb 2019

Depth creates no more spurious local minima

Li Zhang

285

28 Jan 2019