Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1611.06310
Cited By
v1
v2 (latest)
Local minima in training of neural networks
19 November 2016
G. Swirszcz
Wojciech M. Czarnecki
Razvan Pascanu
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Local minima in training of neural networks"
42 / 42 papers shown
Fine-Tuning Hybrid Physics-Informed Neural Networks for Vehicle Dynamics Model Estimation
IFAC-PapersOnLine (IFAC-PapersOnLine), 2024
Shiming Fang
Kaiyan Yu
134
7
0
29 Sep 2024
Algebraic Complexity and Neurovariety of Linear Convolutional Networks
Vahid Shahverdi
352
9
0
29 Jan 2024
Black holes and the loss landscape in machine learning
Journal of High Energy Physics (JHEP), 2023
P. Kumar
Taniya Mandal
Swapnamay Mondal
221
2
0
26 Jun 2023
Probabilistic Solar Proxy Forecasting with Neural Network Ensembles
Joshua D. Daniell
P. Mehta
92
10
0
03 Jun 2023
On the existence of optimal shallow feedforward networks with ReLU activation
Steffen Dereich
Sebastian Kassing
243
5
0
06 Mar 2023
On the existence of minimizers in shallow residual ReLU neural network optimization landscapes
SIAM Journal on Numerical Analysis (SINUM), 2023
Steffen Dereich
Arnulf Jentzen
Sebastian Kassing
309
9
0
28 Feb 2023
Special Properties of Gradient Descent with Large Learning Rates
International Conference on Machine Learning (ICML), 2022
Amirkeivan Mohtashami
Martin Jaggi
Sebastian U. Stich
MLT
367
16
0
30 May 2022
Overparameterization Improves StyleGAN Inversion
Yohan Poirier-Ginter
Alexandre Lessard
Ryan Smith
Jean-François Lalonde
182
4
0
12 May 2022
On the Omnipresence of Spurious Local Minima in Certain Neural Network Training Problems
Constructive approximation (Constr. Approx.), 2022
C. Christof
Julia Kowalczyk
338
10
0
23 Feb 2022
Complexity from Adaptive-Symmetries Breaking: Global Minima in the Statistical Mechanics of Deep Neural Networks
Shaun Li
AI4CE
237
1
0
03 Jan 2022
Dynamic Neural Diversification: Path to Computationally Sustainable Neural Networks
Alexander Kovalenko
Pavel Kordík
Magda Friedjungová
142
1
0
20 Sep 2021
Spurious Local Minima Are Common for Deep Neural Networks with Piecewise Linear Activations
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
Bo Liu
147
11
0
25 Feb 2021
Understanding Implicit Regularization in Over-Parameterized Single Index Model
Journal of the American Statistical Association (JASA), 2020
Jianqing Fan
Zhuoran Yang
Mengxin Yu
327
23
0
16 Jul 2020
A Generative Neural Network Framework for Automated Software Testing
Leonid Joffe
David J. Clark
177
2
0
29 Jun 2020
AutoOD: Automated Outlier Detection via Curiosity-guided Search and Self-imitation Learning
Yuening Li
Zhengzhang Chen
Daochen Zha
Kaixiong Zhou
Haifeng Jin
Haifeng Chen
Helen Zhou
210
18
0
19 Jun 2020
Intra-Processing Methods for Debiasing Neural Networks
Yash Savani
Colin White
G. NaveenSundar
247
49
0
15 Jun 2020
Non-convergence of stochastic gradient descent in the training of deep neural networks
Journal of Complexity (J. Complexity), 2020
Patrick Cheridito
Arnulf Jentzen
Florian Rossmannek
252
39
0
12 Jun 2020
Symmetry & critical points for a model shallow neural network
Yossi Arjevani
M. Field
580
15
0
23 Mar 2020
Some Geometrical and Topological Properties of DNNs' Decision Boundaries
Bo Liu
Mengya Shen
AAML
246
3
0
07 Mar 2020
Lane-Merging Using Policy-based Reinforcement Learning and Post-Optimization
International Conference on Intelligent Transportation Systems (ITSC), 2019
Patrick Hart
Leonard Rychly
Alois Knoll
OffRL
131
17
0
06 Mar 2020
Truth or Backpropaganda? An Empirical Investigation of Deep Learning Theory
International Conference on Learning Representations (ICLR), 2019
Micah Goldblum
Jonas Geiping
Avi Schwarzschild
Michael Moeller
Tom Goldstein
547
36
0
01 Oct 2019
Are deep ResNets provably better than linear predictors?
Neural Information Processing Systems (NeurIPS), 2019
Chulhee Yun
S. Sra
Ali Jadbabaie
437
14
0
09 Jul 2019
Visualising Basins of Attraction for the Cross-Entropy and the Squared Error Neural Network Loss Functions
Anna Sergeevna Bosman
A. Engelbrecht
Mardé Helbig
205
83
0
08 Jan 2019
Non-attracting Regions of Local Minima in Deep and Wide Neural Networks
Henning Petzka
C. Sminchisescu
281
12
0
16 Dec 2018
Intrinsic Geometric Vulnerability of High-Dimensional Artificial Intelligence
Luca Bortolussi
G. Sanguinetti
AAML
259
4
0
08 Nov 2018
The loss surface of deep linear networks viewed through the algebraic geometry lens
D. Mehta
Tianran Chen
Tingting Tang
J. Hauenstein
ODL
262
35
0
17 Oct 2018
Backtracking gradient descent method for general
C
1
C^1
C
1
functions, with applications to Deep Learning
T. Truong
T. H. Nguyen
182
10
0
15 Aug 2018
Learning Dynamics of Linear Denoising Autoencoders
Arnu Pretorius
Steve Kroon
Herman Kamper
AI4CE
207
28
0
14 Jun 2018
Hierarchical clustering with deep Q-learning
Richard Forster
A. Fulop
148
0
0
28 May 2018
The Loss Surface of XOR Artificial Neural Networks
D. Mehta
Xiaojun Zhao
Edgar A. Bernal
D. Wales
341
19
0
06 Apr 2018
Spurious Valleys in Two-layer Neural Network Optimization Landscapes
Luca Venturi
Afonso S. Bandeira
Joan Bruna
388
75
0
18 Feb 2018
Small nonlinearities in activation functions create bad local minima in neural networks
Chulhee Yun
S. Sra
Ali Jadbabaie
ODL
391
97
0
10 Feb 2018
Critical Percolation as a Framework to Analyze the Training of Deep Networks
Zohar Ringel
Rodrigo Andrade de Bem
136
3
0
06 Feb 2018
Visualizing the Loss Landscape of Neural Nets
Neural Information Processing Systems (NeurIPS), 2017
Hao Li
Zheng Xu
Gavin Taylor
Christoph Studer
Tom Goldstein
723
2,212
0
28 Dec 2017
Spurious Local Minima are Common in Two-Layer ReLU Neural Networks
International Conference on Machine Learning (ICML), 2017
Itay Safran
Ohad Shamir
525
280
0
24 Dec 2017
Second-Order Optimization for Non-Convex Machine Learning: An Empirical Study
Peng Xu
Farbod Roosta-Khorasani
Michael W. Mahoney
ODL
230
160
0
25 Aug 2017
Newton-Type Methods for Non-Convex Optimization Under Inexact Hessian Information
Peng Xu
Farbod Roosta-Khorasani
Michael W. Mahoney
601
221
0
23 Aug 2017
Cosmological model discrimination with Deep Learning
Jorit Schmelzle
Aurelien Lucchi
T. Kacprzak
A. Amara
R. Sgier
Alexandre Réfrégier
Thomas Hofmann
207
40
0
17 Jul 2017
Deep Semi-Random Features for Nonlinear Function Approximation
Kenji Kawaguchi
Bo Xie
Vikas Verma
Le Song
570
16
0
28 Feb 2017
Depth Creates No Bad Local Minima
Haihao Lu
Kenji Kawaguchi
ODL
FAtt
342
125
0
27 Feb 2017
Exponentially vanishing sub-optimal local minima in multilayer neural networks
International Conference on Learning Representations (ICLR), 2017
Daniel Soudry
Elad Hoffer
430
100
0
19 Feb 2017
Topology and Geometry of Half-Rectified Network Optimization
C. Freeman
Joan Bruna
824
243
0
04 Nov 2016
1
Page 1 of 1