v1v2 (latest)

Local minima in training of neural networks

19 November 2016

G. Swirszcz

Wojciech M. Czarnecki

Razvan Pascanu

ODL

ArXiv (abs)PDF HTML

Papers citing "Local minima in training of neural networks"

42 / 42 papers shown

Fine-Tuning Hybrid Physics-Informed Neural Networks for Vehicle Dynamics Model EstimationIFAC-PapersOnLine (IFAC-PapersOnLine), 2024

Shiming Fang

Kaiyan Yu

134

29 Sep 2024

Algebraic Complexity and Neurovariety of Linear Convolutional Networks

Vahid Shahverdi

352

29 Jan 2024

Black holes and the loss landscape in machine learningJournal of High Energy Physics (JHEP), 2023

P. Kumar

Taniya Mandal

Swapnamay Mondal

221

26 Jun 2023

Probabilistic Solar Proxy Forecasting with Neural Network Ensembles

Joshua D. Daniell

P. Mehta

03 Jun 2023

On the existence of optimal shallow feedforward networks with ReLU activation

Steffen Dereich

Sebastian Kassing

243

06 Mar 2023

On the existence of minimizers in shallow residual ReLU neural network optimization landscapesSIAM Journal on Numerical Analysis (SINUM), 2023

Steffen Dereich

Arnulf Jentzen

Sebastian Kassing

309

28 Feb 2023

Special Properties of Gradient Descent with Large Learning RatesInternational Conference on Machine Learning (ICML), 2022

Amirkeivan Mohtashami

Martin Jaggi

Sebastian U. Stich

MLT

367

30 May 2022

Overparameterization Improves StyleGAN Inversion

Yohan Poirier-Ginter

Alexandre Lessard

Ryan Smith

Jean-François Lalonde

182

12 May 2022

On the Omnipresence of Spurious Local Minima in Certain Neural Network Training ProblemsConstructive approximation (Constr. Approx.), 2022

C. Christof

Julia Kowalczyk

338

23 Feb 2022

Complexity from Adaptive-Symmetries Breaking: Global Minima in the Statistical Mechanics of Deep Neural Networks

Shaun Li

AI4CE

237

03 Jan 2022

Dynamic Neural Diversification: Path to Computationally Sustainable Neural Networks

Alexander Kovalenko

Pavel Kordík

Magda Friedjungová

142

20 Sep 2021

Spurious Local Minima Are Common for Deep Neural Networks with Piecewise Linear ActivationsIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021

Bo Liu

147

25 Feb 2021

Understanding Implicit Regularization in Over-Parameterized Single Index ModelJournal of the American Statistical Association (JASA), 2020

Jianqing Fan

Zhuoran Yang

Mengxin Yu

327

16 Jul 2020

A Generative Neural Network Framework for Automated Software Testing

Leonid Joffe

David J. Clark

177

29 Jun 2020

AutoOD: Automated Outlier Detection via Curiosity-guided Search and Self-imitation Learning

Daochen Zha

210

19 Jun 2020

Intra-Processing Methods for Debiasing Neural Networks

Yash Savani

Colin White

G. NaveenSundar

247

15 Jun 2020

Non-convergence of stochastic gradient descent in the training of deep neural networksJournal of Complexity (J. Complexity), 2020

Patrick Cheridito

Arnulf Jentzen

Florian Rossmannek

252

12 Jun 2020

Symmetry & critical points for a model shallow neural network

Yossi Arjevani

M. Field

580

23 Mar 2020

Some Geometrical and Topological Properties of DNNs' Decision Boundaries

Bo Liu

Mengya Shen

AAML

246

07 Mar 2020

Lane-Merging Using Policy-based Reinforcement Learning and Post-OptimizationInternational Conference on Intelligent Transportation Systems (ITSC), 2019

131

06 Mar 2020

Truth or Backpropaganda? An Empirical Investigation of Deep Learning TheoryInternational Conference on Learning Representations (ICLR), 2019

547

01 Oct 2019

Are deep ResNets provably better than linear predictors?Neural Information Processing Systems (NeurIPS), 2019

Chulhee Yun

S. Sra

Ali Jadbabaie

437

09 Jul 2019

Visualising Basins of Attraction for the Cross-Entropy and the Squared Error Neural Network Loss Functions

Anna Sergeevna Bosman

A. Engelbrecht

Mardé Helbig

205

08 Jan 2019

Non-attracting Regions of Local Minima in Deep and Wide Neural Networks

Henning Petzka

C. Sminchisescu

281

16 Dec 2018

Intrinsic Geometric Vulnerability of High-Dimensional Artificial Intelligence

Luca Bortolussi

G. Sanguinetti

AAML

259

08 Nov 2018

The loss surface of deep linear networks viewed through the algebraic geometry lens

262

17 Oct 2018

Backtracking gradient descent method for general

C^1

functions, with applications to Deep Learning

T. Truong

T. H. Nguyen

182

15 Aug 2018

Learning Dynamics of Linear Denoising Autoencoders

207

14 Jun 2018

Hierarchical clustering with deep Q-learning

Richard Forster

A. Fulop

148

28 May 2018

The Loss Surface of XOR Artificial Neural Networks

341

06 Apr 2018

Spurious Valleys in Two-layer Neural Network Optimization Landscapes

Luca Venturi

Afonso S. Bandeira

Joan Bruna

388

18 Feb 2018

Small nonlinearities in activation functions create bad local minima in neural networks

391

10 Feb 2018

Critical Percolation as a Framework to Analyze the Training of Deep Networks

Zohar Ringel

Rodrigo Andrade de Bem

136

06 Feb 2018

Visualizing the Loss Landscape of Neural NetsNeural Information Processing Systems (NeurIPS), 2017

Hao Li

723

2,212

28 Dec 2017

Spurious Local Minima are Common in Two-Layer ReLU Neural NetworksInternational Conference on Machine Learning (ICML), 2017

Itay Safran

Ohad Shamir

525

280

24 Dec 2017

Second-Order Optimization for Non-Convex Machine Learning: An Empirical Study

Peng Xu

Farbod Roosta-Khorasani

Michael W. Mahoney

ODL

230

160

25 Aug 2017

Newton-Type Methods for Non-Convex Optimization Under Inexact Hessian Information

Peng Xu

Farbod Roosta-Khorasani

Michael W. Mahoney

601

221

23 Aug 2017

Cosmological model discrimination with Deep Learning

207

17 Jul 2017

Deep Semi-Random Features for Nonlinear Function Approximation

570

28 Feb 2017

Depth Creates No Bad Local Minima

Haihao Lu

Kenji Kawaguchi

ODL FAtt

342

125

27 Feb 2017

Exponentially vanishing sub-optimal local minima in multilayer neural networksInternational Conference on Learning Representations (ICLR), 2017

Daniel Soudry

Elad Hoffer

430

100

19 Feb 2017

Topology and Geometry of Half-Rectified Network Optimization

C. Freeman

Joan Bruna

824

243

04 Nov 2016