v1v2v3v4 (latest)

Spurious Valleys in Two-layer Neural Network Optimization Landscapes

18 February 2018

Luca Venturi

Afonso S. Bandeira

Joan Bruna

ArXiv (abs)PDF HTML

Papers citing "Spurious Valleys in Two-layer Neural Network Optimization Landscapes"

50 / 51 papers shown

A topological description of loss surfaces based on Betti NumbersNeural Networks (NN), 2024

Maria Sofia Bucarelli

Giuseppe Alessio D’Inverno

Monica Bianchini

F. Scarselli

Fabrizio Silvestri

185

08 Jan 2024

Minimum norm interpolation by perceptra: Explicit regularization and implicit biasNeural Information Processing Systems (NeurIPS), 2023

Jiyoung Park

Ian Pelakh

Stephan Wojtowytsch

251

10 Nov 2023

A qualitative difference between gradient flows of convex functions in finite- and infinite-dimensional Hilbert spaces

Jonathan W. Siegel

Stephan Wojtowytsch

275

26 Oct 2023

NTK-SAP: Improving neural network pruning by aligning training dynamicsInternational Conference on Learning Representations (ICLR), 2023

Yite Wang

Dawei Li

Tian Ding

329

06 Apr 2023

When Expressivity Meets Trainability: Fewer than

n

Neurons Can WorkNeural Information Processing Systems (NeurIPS), 2022

366

21 Oct 2022

Plateau in Monotonic Linear Interpolation -- A "Biased" View of Loss Landscape for Deep NetworksInternational Conference on Learning Representations (ICLR), 2022

557

03 Oct 2022

Deep Architecture Connectivity Matters for Its Convergence: A Fine-Grained AnalysisNeural Information Processing Systems (NeurIPS), 2022

323

11 May 2022

Deep learning, stochastic gradient descent and diffusion mapsJournal of Computational Mathematics and Data Science (JCMDS), 2022

Carmina Fjellström

Kaj Nyström

DiffM

329

04 Apr 2022

Global Convergence Analysis of Deep Linear Networks with A One-neuron Layer

Kun Chen

Dachao Lin

Zhihua Zhang

234

08 Jan 2022

Complexity from Adaptive-Symmetries Breaking: Global Minima in the Statistical Mechanics of Deep Neural Networks

Shaun Li

AI4CE

256

03 Jan 2022

What Happens after SGD Reaches Zero Loss? --A Mathematical Framework

Zhiyuan Li

Tianhao Wang

Sanjeev Arora

MLT

472

119

13 Oct 2021

Exponentially Many Local Minima in Quantum Neural Networks

Xuchen You

Xiaodi Wu

366

06 Oct 2021

Analyzing Monotonic Linear Interpolation in Neural Network Loss Landscapes

552

22 Apr 2021

Spurious Local Minima Are Common for Deep Neural Networks with Piecewise Linear ActivationsIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021

Bo Liu

167

25 Feb 2021

Noisy Gradient Descent Converges to Flat Minima for Nonconvex Matrix FactorizationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021

239

24 Feb 2021

A Note on Connectivity of Sublevel Sets in Deep Learning

Quynh N. Nguyen

MLT

319

21 Jan 2021

A Convergence Theory Towards Practical Over-parameterized Deep Neural Networks

288

12 Jan 2021

Towards a Better Global Loss Landscape of GANs

292

10 Nov 2020

DessiLBI: Exploring Structural Sparsity of Deep Networks via Differential Inclusion Paths

228

04 Jul 2020

PDE constraints on smooth hierarchical functions computed by neural networks

Khashayar Filom

Konrad Paul Kording

Roozbeh Farhoodi

156

18 May 2020

The critical locus of overparameterized neural networks

Y. Cooper

UQCV

265

08 May 2020

Compressive sensing with un-trained neural networks: Gradient descent finds the smoothest approximation

Reinhard Heckel

Mahdi Soltanolkotabi

209

07 May 2020

Some Geometrical and Topological Properties of DNNs' Decision Boundaries

Bo Liu

Mengya Shen

AAML

281

07 Mar 2020

$Learning the mapping $\mathbf{x}\mapsto \sum_{i=1}^d x_i^2$: the cost of finding the needle in a haystack$

Learning the mapping

\mathbf{x}\mapsto \sum_{i=1}^d x_i^2

: the cost of finding the needle in a haystack

Jiefu Zhang

Leonardo Zepeda-Núnez

Xingtai Lv

Lin Lin

134

24 Feb 2020

Understanding Global Loss Landscape of One-hidden-layer ReLU Networks, Part 1: Theory

Bo Liu

FAtt MLT

432

12 Feb 2020

Landscape Connectivity and Dropout Stability of SGD Solutions for Over-parameterized Neural NetworksInternational Conference on Machine Learning (ICML), 2019

Aleksandr Shevchenko

Marco Mondelli

488

20 Dec 2019

Optimization for deep learning: theory and algorithms

Tian Ding

ODL

485

183

19 Dec 2019

Stationary Points of Shallow Neural Networks with Quadratic Activation Function

D. Gamarnik

Eren C. Kizildag

Ilias Zadik

256

03 Dec 2019

Sub-Optimal Local Minima Exist for Neural Networks with Almost All Non-Linear Activations

Tian Ding

Dawei Li

Tian Ding

404

04 Nov 2019

Denoising and Regularization via Exploiting the Structural Bias of Convolutional GeneratorsInternational Conference on Learning Representations (ICLR), 2019

Reinhard Heckel

Mahdi Soltanolkotabi

DiffM

301

31 Oct 2019

Nearly Minimal Over-Parametrization of Shallow Neural Networks

Armin Eftekhari

Chaehwan Song

Volkan Cevher

226

09 Oct 2019

Pure and Spurious Critical Points: a Geometric Study of Linear NetworksInternational Conference on Learning Representations (ICLR), 2019

Matthew Trager

Kathlén Kohn

Joan Bruna

269

03 Oct 2019

Generating Accurate Pseudo-labels in Semi-Supervised Learning and Avoiding Overconfident Predictions via Hermite Polynomial ActivationsComputer Vision and Pattern Recognition (CVPR), 2019

Vishnu Suresh Lokhande

Songwong Tasneeyapant

Abhay Venkatesh

Sathya Ravi

Vikas Singh

207

12 Sep 2019

Additive function approximation in the brain

K. Harris

269

05 Sep 2019

Gradient Dynamics of Shallow Univariate ReLU NetworksNeural Information Processing Systems (NeurIPS), 2019

Francis Williams

Matthew Trager

Claudio Silva

Daniele Panozzo

Denis Zorin

Joan Bruna

204

18 Jun 2019

Explaining Landscape Connectivity of Low-cost Solutions for Multilayer NetsNeural Information Processing Systems (NeurIPS), 2019

541

102

14 Jun 2019

A mean-field limit for certain deep neural networks

380

01 Jun 2019

On the Expressive Power of Deep Polynomial Neural NetworksNeural Information Processing Systems (NeurIPS), 2019

Joe Kileel

Matthew Trager

Joan Bruna

251

102

29 May 2019

Exploring Structural Sparsity of Deep Networks via Inverse Scale SpacesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019

361

23 May 2019

The Landscape of the Planted Clique Problem: Dense subgraphs and the Overlap Gap Property

D. Gamarnik

Ilias Zadik

217

15 Apr 2019

Gradient Descent with Early Stopping is Provably Robust to Label Noise for Overparameterized Neural Networks

591

392

27 Mar 2019

Towards moderate overparameterization: global convergence guarantees for training shallow neural networksIEEE Journal on Selected Areas in Information Theory (JSAIT), 2019

Samet Oymak

Mahdi Soltanolkotabi

344

341

12 Feb 2019

Mean Field Limit of the Learning Dynamics of Multilayer Neural Networks

Phan-Minh Nguyen

AI4CE

264

07 Feb 2019

Depth creates no more spurious local minima

Li Zhang

282

28 Jan 2019

On Connected Sublevel Sets in Deep Learning

Quynh N. Nguyen

432

106

22 Jan 2019

On the Benefit of Width for Neural Networks: Disappearance of Bad Basins

Dawei Li

Tian Ding

715

28 Dec 2018

Overparameterized Nonlinear Learning: Gradient Descent Takes the Shortest Path?

Samet Oymak

Mahdi Soltanolkotabi

ODL

360

194

25 Dec 2018

A jamming transition from under- to over-parametrization affects loss landscape and generalization

467

161

22 Oct 2018

On the loss landscape of a class of deep neural networks with no bad local valleysInternational Conference on Learning Representations (ICLR), 2018

Quynh N. Nguyen

Mahesh Chandra Mukkamala

Matthias Hein

518

27 Sep 2018

On the Global Convergence of Gradient Descent for Over-parameterized Models using Optimal Transport

Lénaïc Chizat

Francis R. Bach

618

818

24 May 2018