ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1406.2572
  4. Cited By
Identifying and attacking the saddle point problem in high-dimensional
  non-convex optimization

Identifying and attacking the saddle point problem in high-dimensional non-convex optimization

Neural Information Processing Systems (NeurIPS), 2014
10 June 2014
Yann N. Dauphin
Razvan Pascanu
Çağlar Gülçehre
Dong Wang
Surya Ganguli
Yoshua Bengio
    ODL
ArXiv (abs)PDFHTML

Papers citing "Identifying and attacking the saddle point problem in high-dimensional non-convex optimization"

32 / 632 papers shown
Convergent Learning: Do different neural networks learn the same
  representations?
Convergent Learning: Do different neural networks learn the same representations?
Shouqing Yang
J. Yosinski
Jeff Clune
Hod Lipson
John E. Hopcroft
SSL
309
398
0
24 Nov 2015
On the energy landscape of deep networks
On the energy landscape of deep networks
Pratik Chaudhari
Stefano Soatto
ODL
321
27
0
20 Nov 2015
Online Batch Selection for Faster Training of Neural Networks
Online Batch Selection for Faster Training of Neural Networks
I. Loshchilov
Katharina Eggensperger
ODL
440
328
0
19 Nov 2015
On the interplay of network structure and gradient convergence in deep
  learning
On the interplay of network structure and gradient convergence in deep learning
V. Ithapu
Sathya Ravi
Vikas Singh
367
3
0
17 Nov 2015
On the Quality of the Initial Basin in Overspecified Neural Networks
On the Quality of the Initial Basin in Overspecified Neural Networks
Itay Safran
Ohad Shamir
227
129
0
13 Nov 2015
Reducing the Training Time of Neural Networks by Partitioning
Reducing the Training Time of Neural Networks by Partitioning
C. Miranda
F. J. Zuben
129
8
0
10 Nov 2015
When Are Nonconvex Problems Not Scary?
When Are Nonconvex Problems Not Scary?
Ju Sun
Qing Qu
John N. Wright
306
170
0
21 Oct 2015
$\ell_1$-regularized Neural Networks are Improperly Learnable in
  Polynomial Time
ℓ1\ell_1ℓ1​-regularized Neural Networks are Improperly Learnable in Polynomial Time
Yuchen Zhang
Jason D. Lee
Sai Li
292
103
0
13 Oct 2015
A Primer on Neural Network Models for Natural Language Processing
A Primer on Neural Network Models for Natural Language Processing
Yoav Goldberg
AI4CE
245
1,193
0
02 Oct 2015
What is Holding Back Convnets for Detection?
What is Holding Back Convnets for Detection?German Conference on Pattern Recognition (DAGM), 2015
Bojan Pepik
Rodrigo Benenson
Tobias Ritschel
Bernt Schiele
ObjD
224
65
0
12 Aug 2015
Integrated Inference and Learning of Neural Factors in Structural
  Support Vector Machines
Integrated Inference and Learning of Neural Factors in Structural Support Vector MachinesPattern Recognition (PR), 2015
Rein Houthooft
F. Turck
DRL
114
3
0
03 Aug 2015
Training Very Deep Networks
Training Very Deep NetworksNeural Information Processing Systems (NeurIPS), 2015
R. Srivastava
Klaus Greff
Jürgen Schmidhuber
453
1,726
0
22 Jul 2015
Fairness Constraints: Mechanisms for Fair Classification
Fairness Constraints: Mechanisms for Fair Classification
Muhammad Bilal Zafar
Isabel Valera
Manuel Gomez Rodriguez
Krishna P. Gummadi
FaML
441
50
0
19 Jul 2015
The local convexity of solving systems of quadratic equations
The local convexity of solving systems of quadratic equations
Christopher D. White
Sujay Sanghavi
Rachel A. Ward
247
72
0
25 Jun 2015
Adaptive Normalized Risk-Averting Training For Deep Neural Networks
Adaptive Normalized Risk-Averting Training For Deep Neural NetworksAAAI Conference on Artificial Intelligence (AAAI), 2015
Zhiguang Wang
Tim Oates
J. Lo
226
7
0
08 Jun 2015
Cyclical Learning Rates for Training Neural Networks
Cyclical Learning Rates for Training Neural NetworksIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2015
L. Smith
ODL
740
2,791
0
03 Jun 2015
Learning with hidden variables
Learning with hidden variablesCurrent Opinion in Neurobiology (Curr Opin Neurobiol), 2015
Y. Roudi
Graham Taylor
135
16
0
01 Jun 2015
Saddle-free Hessian-free Optimization
Saddle-free Hessian-free Optimization
Martín Arjovsky
ODL
101
2
0
30 May 2015
A Critical Review of Recurrent Neural Networks for Sequence Learning
A Critical Review of Recurrent Neural Networks for Sequence Learning
Zachary Chase Lipton
John Berkowitz
Charles Elkan
675
2,529
0
29 May 2015
Deep Neural Networks with Random Gaussian Weights: A Universal
  Classification Strategy?
Deep Neural Networks with Random Gaussian Weights: A Universal Classification Strategy?
Raja Giryes
Guillermo Sapiro
A. Bronstein
564
189
0
30 Apr 2015
On Graduated Optimization for Stochastic Non-Convex Problems
On Graduated Optimization for Stochastic Non-Convex Problems
Elad Hazan
Kfir Y. Levy
Shai Shalev-Shwartz
206
121
0
12 Mar 2015
Escaping From Saddle Points --- Online Stochastic Gradient for Tensor
  Decomposition
Escaping From Saddle Points --- Online Stochastic Gradient for Tensor Decomposition
Rong Ge
Furong Huang
Chi Jin
Yang Yuan
343
1,101
0
06 Mar 2015
Equilibrated adaptive learning rates for non-convex optimization
Equilibrated adaptive learning rates for non-convex optimization
Yann N. Dauphin
H. D. Vries
Yoshua Bengio
ODL
204
377
0
15 Feb 2015
Explorations on high dimensional landscapes
Explorations on high dimensional landscapesInternational Conference on Learning Representations (ICLR), 2014
Levent Sagun
V. U. Güney
Gerard Ben Arous
Yann LeCun
255
65
0
20 Dec 2014
FitNets: Hints for Thin Deep Nets
FitNets: Hints for Thin Deep NetsInternational Conference on Learning Representations (ICLR), 2014
Adriana Romero
Nicolas Ballas
Samira Ebrahimi Kahou
Antoine Chassang
C. Gatta
Yoshua Bengio
FedML
1.3K
4,357
0
19 Dec 2014
Qualitatively characterizing neural network optimization problems
Qualitatively characterizing neural network optimization problemsInternational Conference on Learning Representations (ICLR), 2014
Ian Goodfellow
Oriol Vinyals
Andrew M. Saxe
ODL
510
550
0
19 Dec 2014
On the Stability of Deep Networks
On the Stability of Deep NetworksInternational Conference on Learning Representations (ICLR), 2014
Raja Giryes
Guillermo Sapiro
A. Bronstein
243
15
0
18 Dec 2014
Expanded Alternating Optimization of Nonconvex Functions with
  Applications to Matrix Factorization and Penalized Regression
Expanded Alternating Optimization of Nonconvex Functions with Applications to Matrix Factorization and Penalized Regression
W. James Murdoch
Mu Zhu
138
3
0
12 Dec 2014
New insights and perspectives on the natural gradient method
New insights and perspectives on the natural gradient methodJournal of machine learning research (JMLR), 2014
James Martens
ODL
1.2K
708
0
03 Dec 2014
The Loss Surfaces of Multilayer Networks
The Loss Surfaces of Multilayer NetworksInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2014
A. Choromańska
Mikael Henaff
Michaël Mathieu
Gerard Ben Arous
Yann LeCun
ODL
768
1,263
0
30 Nov 2014
Fast, Robust and Non-convex Subspace Recovery
Fast, Robust and Non-convex Subspace Recovery
Gilad Lerman
Tyler Maunu
251
83
0
24 Jun 2014
Knowledge Matters: Importance of Prior Information for Optimization
Knowledge Matters: Importance of Prior Information for OptimizationJournal of machine learning research (JMLR), 2013
Çağlar Gülçehre
Yoshua Bengio
325
170
0
17 Jan 2013
Previous
123...111213