ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1406.2572
  4. Cited By
Identifying and attacking the saddle point problem in high-dimensional
  non-convex optimization

Identifying and attacking the saddle point problem in high-dimensional non-convex optimization

Neural Information Processing Systems (NeurIPS), 2014
10 June 2014
Yann N. Dauphin
Razvan Pascanu
Çağlar Gülçehre
Dong Wang
Surya Ganguli
Yoshua Bengio
    ODL
ArXiv (abs)PDFHTML

Papers citing "Identifying and attacking the saddle point problem in high-dimensional non-convex optimization"

50 / 632 papers shown
On the loss landscape of a class of deep neural networks with no bad
  local valleys
On the loss landscape of a class of deep neural networks with no bad local valleysInternational Conference on Learning Representations (ICLR), 2018
Quynh N. Nguyen
Mahesh Chandra Mukkamala
Matthias Hein
370
89
0
27 Sep 2018
Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview
Nonconvex Optimization Meets Low-Rank Matrix Factorization: An OverviewIEEE Transactions on Signal Processing (IEEE Trans. Signal Process.), 2018
Yuejie Chi
Yue M. Lu
Yuxin Chen
424
459
0
25 Sep 2018
Second-order Guarantees of Distributed Gradient Algorithms
Second-order Guarantees of Distributed Gradient Algorithms
Amir Daneshmand
G. Scutari
Vyacheslav Kungurtsev
444
63
0
23 Sep 2018
A Deep Learning Framework for Unsupervised Affine and Deformable Image
  Registration
A Deep Learning Framework for Unsupervised Affine and Deformable Image Registration
B. D. de Vos
F. Berendsen
M. Viergever
Hessam Sokooti
Marius Staring
Ivana Isgum
MedIm
222
743
0
17 Sep 2018
Hubless keypoint-based 3D deformable groupwise registration
Hubless keypoint-based 3D deformable groupwise registration
Rémi Agier
S. Valette
R. Kéchichian
L. Fanton
R. Prost
237
11
0
11 Sep 2018
Evaluation of Neural Networks for Image Recognition Applications:
  Designing a 0-1 MILP Model of a CNN to create adversarials
Evaluation of Neural Networks for Image Recognition Applications: Designing a 0-1 MILP Model of a CNN to create adversarials
Lucas Schelkes
HAI
99
1
0
01 Sep 2018
Identifying Implementation Bugs in Machine Learning based Image
  Classifiers using Metamorphic Testing
Identifying Implementation Bugs in Machine Learning based Image Classifiers using Metamorphic Testing
Anurag Dwarakanath
Manish Ahuja
Samarth Sikand
Raghotham M. Rao
Jagadeesh Chandra J. C. Bose
Neville Dubash
Sanjay Podder
VLM
99
188
0
16 Aug 2018
Backtracking gradient descent method for general $C^1$ functions, with
  applications to Deep Learning
Backtracking gradient descent method for general C1C^1C1 functions, with applications to Deep Learning
T. Truong
T. H. Nguyen
167
10
0
15 Aug 2018
On the Analysis of Trajectories of Gradient Descent in the Optimization
  of Deep Neural Networks
On the Analysis of Trajectories of Gradient Descent in the Optimization of Deep Neural Networks
Adepu Ravi Sankar
Vishwak Srinivasan
V. Balasubramanian
67
1
0
21 Jul 2018
Uncertainty and Interpretability in Convolutional Neural Networks for
  Semantic Segmentation of Colorectal Polyps
Uncertainty and Interpretability in Convolutional Neural Networks for Semantic Segmentation of Colorectal PolypsInternational Workshop on Machine Learning for Signal Processing (MLSP), 2018
Kristoffer Wickstrøm
Michael C. Kampffmeyer
Robert Jenssen
UQCV
152
77
0
16 Jul 2018
On the Relation Between the Sharpest Directions of DNN Loss and the SGD
  Step Length
On the Relation Between the Sharpest Directions of DNN Loss and the SGD Step Length
Stanislaw Jastrzebski
Zachary Kenton
Nicolas Ballas
Asja Fischer
Yoshua Bengio
Amos Storkey
ODL
659
128
0
13 Jul 2018
Geometric Generalization Based Zero-Shot Learning Dataset Infinite
  World: Simple Yet Powerful
Geometric Generalization Based Zero-Shot Learning Dataset Infinite World: Simple Yet Powerful
R. Chidambaram
Michael C. Kampffmeyer
Willie Neiswanger
Xiaodan Liang
T. Lachmann
Eric Xing
96
0
0
10 Jul 2018
Troubling Trends in Machine Learning Scholarship
Troubling Trends in Machine Learning ScholarshipQueue (ACM Queue), 2018
Zachary Chase Lipton
Jacob Steinhardt
232
323
0
09 Jul 2018
The Goldilocks zone: Towards better understanding of neural network loss
  landscapes
The Goldilocks zone: Towards better understanding of neural network loss landscapesAAAI Conference on Artificial Intelligence (AAAI), 2018
Stanislav Fort
Adam Scherlis
203
53
0
06 Jul 2018
SPIDER: Near-Optimal Non-Convex Optimization via Stochastic Path
  Integrated Differential Estimator
SPIDER: Near-Optimal Non-Convex Optimization via Stochastic Path Integrated Differential EstimatorNeural Information Processing Systems (NeurIPS), 2018
Cong Fang
C. J. Li
Zhouchen Lin
Tong Zhang
437
639
0
04 Jul 2018
Fuzzy Logic Interpretation of Quadratic Networks
Fuzzy Logic Interpretation of Quadratic Networks
Fenglei Fan
Ge Wang
207
7
0
04 Jul 2018
Trust-Region Algorithms for Training Responses: Machine Learning Methods
  Using Indefinite Hessian Approximations
Trust-Region Algorithms for Training Responses: Machine Learning Methods Using Indefinite Hessian Approximations
Jennifer B. Erway
J. Griffin
Roummel F. Marcia
Riadh Omheni
323
26
0
01 Jul 2018
Algorithms for solving optimization problems arising from deep neural
  net models: smooth problems
Algorithms for solving optimization problems arising from deep neural net models: smooth problems
Vyacheslav Kungurtsev
Tomás Pevný
121
6
0
30 Jun 2018
PCA of high dimensional random walks with comparison to neural network
  training
PCA of high dimensional random walks with comparison to neural network training
J. Antognini
Jascha Narain Sohl-Dickstein
OOD
110
28
0
22 Jun 2018
Finding Local Minima via Stochastic Nested Variance Reduction
Finding Local Minima via Stochastic Nested Variance Reduction
Dongruo Zhou
Pan Xu
Quanquan Gu
194
24
0
22 Jun 2018
Stochastic Nested Variance Reduction for Nonconvex Optimization
Stochastic Nested Variance Reduction for Nonconvex Optimization
Dongruo Zhou
Pan Xu
Quanquan Gu
253
158
0
20 Jun 2018
Neural Tangent Kernel: Convergence and Generalization in Neural Networks
Neural Tangent Kernel: Convergence and Generalization in Neural Networks
Arthur Jacot
Franck Gabriel
Clément Hongler
2.8K
3,667
0
20 Jun 2018
Deep Global-Connected Net With The Generalized Multi-Piecewise ReLU
  Activation in Deep Learning
Deep Global-Connected Net With The Generalized Multi-Piecewise ReLU Activation in Deep Learning
Zhi Chen
P. Ho
79
3
0
19 Jun 2018
Spurious Local Minima of Deep ReLU Neural Networks in the Neural Tangent
  Kernel Regime
Spurious Local Minima of Deep ReLU Neural Networks in the Neural Tangent Kernel Regime
T. Nitta
122
0
0
13 Jun 2018
Full deep neural network training on a pruned weight budget
Full deep neural network training on a pruned weight budget
Maximilian Golub
G. Lemieux
Mieszko Lis
229
30
0
11 Jun 2018
Universal Statistics of Fisher Information in Deep Neural Networks: Mean
  Field Approach
Universal Statistics of Fisher Information in Deep Neural Networks: Mean Field Approach
Ryo Karakida
S. Akaho
S. Amari
FedML
527
162
0
04 Jun 2018
Interpreting Deep Learning: The Machine Learning Rorschach Test?
Interpreting Deep Learning: The Machine Learning Rorschach Test?
Adam S. Charles
AAMLHAIAI4CE
204
9
0
01 Jun 2018
Understanding Generalization and Optimization Performance of Deep CNNs
Understanding Generalization and Optimization Performance of Deep CNNs
Pan Zhou
Jiashi Feng
MLT
228
51
0
28 May 2018
Entropy and mutual information in models of deep neural networks
Entropy and mutual information in models of deep neural networks
Marylou Gabrié
Andre Manoel
Clément Luneau
Jean Barbier
N. Macris
Florent Krzakala
Lenka Zdeborová
224
194
0
24 May 2018
A Two-Stage Subspace Trust Region Approach for Deep Neural Network
  Training
A Two-Stage Subspace Trust Region Approach for Deep Neural Network Training
V. Dudar
Giovanni Chierchia
Émilie Chouzenoux
J. Pesquet
V. Semenov
64
5
0
23 May 2018
Mean Field Theory of Activation Functions in Deep Neural Networks
Mean Field Theory of Activation Functions in Deep Neural Networks
M. Milletarí
Thiparat Chotibut
P. E. Trevisanutto
126
4
0
22 May 2018
Universal discriminative quantum neural networks
Universal discriminative quantum neural networks
Hongxiang Chen
Leonard Wossnig
Simone Severini
Hartmut Neven
Masoud Mohseni
162
89
0
22 May 2018
Deep Learning with Cinematic Rendering: Fine-Tuning Deep Neural Networks
  Using Photorealistic Medical Images
Deep Learning with Cinematic Rendering: Fine-Tuning Deep Neural Networks Using Photorealistic Medical Images
Faisal Mahmood
Richard J. Chen
S. Sudarsky
Daphne Yu
Nicholas J. Durr
MedIm
162
48
0
22 May 2018
Small steps and giant leaps: Minimal Newton solvers for Deep Learning
Small steps and giant leaps: Minimal Newton solvers for Deep Learning
João F. Henriques
Sébastien Ehrhardt
Samuel Albanie
Andrea Vedaldi
ODL
154
23
0
21 May 2018
The global optimum of shallow neural network is attained by ridgelet
  transform
The global optimum of shallow neural network is attained by ridgelet transform
Sho Sonoda
Isao Ishikawa
Masahiro Ikeda
Kei Hagihara
Y. Sawano
Takuo Matsubara
Noboru Murata
167
1
0
19 May 2018
Interpolatron: Interpolation or Extrapolation Schemes to Accelerate
  Optimization for Deep Neural Networks
Interpolatron: Interpolation or Extrapolation Schemes to Accelerate Optimization for Deep Neural Networks
Guangzeng Xie
Yitan Wang
Shuchang Zhou
Zhihua Zhang
72
3
0
17 May 2018
Local Saddle Point Optimization: A Curvature Exploitation Approach
Local Saddle Point Optimization: A Curvature Exploitation Approach
Leonard Adolphs
Hadi Daneshmand
Aurelien Lucchi
Thomas Hofmann
321
112
0
15 May 2018
Measuring the Intrinsic Dimension of Objective Landscapes
Measuring the Intrinsic Dimension of Objective LandscapesInternational Conference on Learning Representations (ICLR), 2018
Chunyuan Li
Heerad Farkhoor
Rosanne Liu
J. Yosinski
305
479
0
24 Apr 2018
On Gradient-Based Learning in Continuous Games
On Gradient-Based Learning in Continuous Games
Eric Mazumdar
Lillian J. Ratliff
S. Shankar Sastry
299
150
0
16 Apr 2018
The Loss Surface of XOR Artificial Neural Networks
The Loss Surface of XOR Artificial Neural Networks
D. Mehta
Xiaojun Zhao
Edgar A. Bernal
D. Wales
251
19
0
06 Apr 2018
DeepSigns: A Generic Watermarking Framework for IP Protection of Deep
  Learning Models
DeepSigns: A Generic Watermarking Framework for IP Protection of Deep Learning Models
B. Rouhani
Huili Chen
F. Koushanfar
268
48
0
02 Apr 2018
A Survey on Deep Learning Methods for Robot Vision
A Survey on Deep Learning Methods for Robot Vision
Javier Ruiz-del-Solar
P. Loncomilla
Naiomi Soto
168
64
0
28 Mar 2018
Task Agnostic Continual Learning Using Online Variational Bayes
Task Agnostic Continual Learning Using Online Variational Bayes
Chen Zeno
Itay Golan
Elad Hoffer
Daniel Soudry
CLLFedMLBDL
353
124
0
27 Mar 2018
Information Theoretic Interpretation of Deep learning
Information Theoretic Interpretation of Deep learning
Tianchen Zhao
FAtt
176
2
0
21 Mar 2018
Comparing Dynamics: Deep Neural Networks versus Glassy Systems
Comparing Dynamics: Deep Neural Networks versus Glassy Systems
Carlo Albert
Levent Sagun
Mario Geiger
S. Spigler
Gerard Ben Arous
C. Cammarota
Yann LeCun
Matthieu Wyart
Giulio Biroli
AI4CE
317
124
0
19 Mar 2018
Replica Symmetry Breaking in Bipartite Spin Glasses and Neural Networks
Replica Symmetry Breaking in Bipartite Spin Glasses and Neural Networks
Gavin Hartnett
Edward Parker
Edward Geist
306
24
0
17 Mar 2018
Escaping Saddles with Stochastic Gradients
Escaping Saddles with Stochastic GradientsInternational Conference on Machine Learning (ICML), 2018
Hadi Daneshmand
Jonas Köhler
Aurelien Lucchi
Thomas Hofmann
132
169
0
15 Mar 2018
GossipGraD: Scalable Deep Learning using Gossip Communication based
  Asynchronous Gradient Descent
GossipGraD: Scalable Deep Learning using Gossip Communication based Asynchronous Gradient Descent
J. Daily
Abhinav Vishnu
Charles Siegel
T. Warfel
Vinay C. Amatya
153
99
0
15 Mar 2018
Accelerating Natural Gradient with Higher-Order Invariance
Accelerating Natural Gradient with Higher-Order Invariance
Yang Song
Jiaming Song
Stefano Ermon
189
26
0
04 Mar 2018
Essentially No Barriers in Neural Network Energy Landscape
Essentially No Barriers in Neural Network Energy LandscapeInternational Conference on Machine Learning (ICML), 2018
Felix Dräxler
K. Veschgini
M. Salmhofer
Fred Hamprecht
MoMe
471
484
0
02 Mar 2018
Previous
123...1011121389
Next