Identifying and attacking the saddle point problem in high-dimensional non-convex optimization

Neural Information Processing Systems (NeurIPS), 2014

10 June 2014

Papers citing "Identifying and attacking the saddle point problem in high-dimensional non-convex optimization"

50 / 632 papers shown

On the loss landscape of a class of deep neural networks with no bad local valleysInternational Conference on Learning Representations (ICLR), 2018

Quynh N. Nguyen

Mahesh Chandra Mukkamala

Matthias Hein

370

27 Sep 2018

Nonconvex Optimization Meets Low-Rank Matrix Factorization: An OverviewIEEE Transactions on Signal Processing (IEEE Trans. Signal Process.), 2018

Yuejie Chi

Yue M. Lu

Yuxin Chen

424

459

25 Sep 2018

Second-order Guarantees of Distributed Gradient Algorithms

Amir Daneshmand

G. Scutari

Vyacheslav Kungurtsev

444

23 Sep 2018

A Deep Learning Framework for Unsupervised Affine and Deformable Image Registration

Ivana Isgum

222

743

17 Sep 2018

Hubless keypoint-based 3D deformable groupwise registration

237

11 Sep 2018

Evaluation of Neural Networks for Image Recognition Applications: Designing a 0-1 MILP Model of a CNN to create adversarials

Lucas Schelkes

HAI

01 Sep 2018

Identifying Implementation Bugs in Machine Learning based Image Classifiers using Metamorphic Testing

Jagadeesh Chandra J. C. Bose

Neville Dubash

Sanjay Podder

VLM

188

16 Aug 2018

Backtracking gradient descent method for general

C^1

functions, with applications to Deep Learning

T. Truong

T. H. Nguyen

167

15 Aug 2018

On the Analysis of Trajectories of Gradient Descent in the Optimization of Deep Neural Networks

Adepu Ravi Sankar

Vishwak Srinivasan

V. Balasubramanian

21 Jul 2018

Uncertainty and Interpretability in Convolutional Neural Networks for Semantic Segmentation of Colorectal PolypsInternational Workshop on Machine Learning for Signal Processing (MLSP), 2018

Kristoffer Wickstrøm

Michael C. Kampffmeyer

Robert Jenssen

UQCV

152

16 Jul 2018

On the Relation Between the Sharpest Directions of DNN Loss and the SGD Step Length

Stanislaw Jastrzebski

Amos Storkey

659

128

13 Jul 2018

Geometric Generalization Based Zero-Shot Learning Dataset Infinite World: Simple Yet Powerful

R. Chidambaram

Michael C. Kampffmeyer

Willie Neiswanger

Xiaodan Liang

T. Lachmann

Eric Xing

10 Jul 2018

Troubling Trends in Machine Learning ScholarshipQueue (ACM Queue), 2018

Zachary Chase Lipton

Jacob Steinhardt

232

323

09 Jul 2018

The Goldilocks zone: Towards better understanding of neural network loss landscapesAAAI Conference on Artificial Intelligence (AAAI), 2018

Stanislav Fort

Adam Scherlis

203

06 Jul 2018

SPIDER: Near-Optimal Non-Convex Optimization via Stochastic Path Integrated Differential EstimatorNeural Information Processing Systems (NeurIPS), 2018

Cong Fang

C. J. Li

Zhouchen Lin

Tong Zhang

437

639

04 Jul 2018

Fuzzy Logic Interpretation of Quadratic Networks

Fenglei Fan

Ge Wang

207

04 Jul 2018

Trust-Region Algorithms for Training Responses: Machine Learning Methods Using Indefinite Hessian Approximations

323

01 Jul 2018

Algorithms for solving optimization problems arising from deep neural net models: smooth problems

Vyacheslav Kungurtsev

Tomás Pevný

121

30 Jun 2018

PCA of high dimensional random walks with comparison to neural network training

J. Antognini

Jascha Narain Sohl-Dickstein

OOD

110

22 Jun 2018

Finding Local Minima via Stochastic Nested Variance Reduction

Dongruo Zhou

Pan Xu

Quanquan Gu

194

22 Jun 2018

Stochastic Nested Variance Reduction for Nonconvex Optimization

Dongruo Zhou

Pan Xu

Quanquan Gu

253

158

20 Jun 2018

Neural Tangent Kernel: Convergence and Generalization in Neural Networks

Arthur Jacot

Franck Gabriel

Clément Hongler

2.8K

3,667

20 Jun 2018

Deep Global-Connected Net With The Generalized Multi-Piecewise ReLU Activation in Deep Learning

Zhi Chen

P. Ho

19 Jun 2018

Spurious Local Minima of Deep ReLU Neural Networks in the Neural Tangent Kernel Regime

T. Nitta

122

13 Jun 2018

Full deep neural network training on a pruned weight budget

Maximilian Golub

G. Lemieux

Mieszko Lis

229

11 Jun 2018

Universal Statistics of Fisher Information in Deep Neural Networks: Mean Field Approach

527

162

04 Jun 2018

Interpreting Deep Learning: The Machine Learning Rorschach Test?

Adam S. Charles

AAML HAI AI4CE

204

01 Jun 2018

Understanding Generalization and Optimization Performance of Deep CNNs

Pan Zhou

Jiashi Feng

MLT

228

28 May 2018

Entropy and mutual information in models of deep neural networks

Lenka Zdeborová

224

194

24 May 2018

A Two-Stage Subspace Trust Region Approach for Deep Neural Network Training

23 May 2018

Mean Field Theory of Activation Functions in Deep Neural Networks

M. Milletarí

Thiparat Chotibut

P. E. Trevisanutto

126

22 May 2018

Universal discriminative quantum neural networks

Hongxiang Chen

Leonard Wossnig

Simone Severini

Hartmut Neven

Masoud Mohseni

162

22 May 2018

Deep Learning with Cinematic Rendering: Fine-Tuning Deep Neural Networks Using Photorealistic Medical Images

162

22 May 2018

Small steps and giant leaps: Minimal Newton solvers for Deep Learning

João F. Henriques

Sébastien Ehrhardt

Samuel Albanie

Andrea Vedaldi

ODL

154

21 May 2018

The global optimum of shallow neural network is attained by ridgelet transform

167

19 May 2018

Interpolatron: Interpolation or Extrapolation Schemes to Accelerate Optimization for Deep Neural Networks

Guangzeng Xie

Yitan Wang

Shuchang Zhou

Zhihua Zhang

17 May 2018

Local Saddle Point Optimization: A Curvature Exploitation Approach

321

112

15 May 2018

Measuring the Intrinsic Dimension of Objective LandscapesInternational Conference on Learning Representations (ICLR), 2018

305

479

24 Apr 2018

On Gradient-Based Learning in Continuous Games

Eric Mazumdar

Lillian J. Ratliff

S. Shankar Sastry

299

150

16 Apr 2018

The Loss Surface of XOR Artificial Neural Networks

251

06 Apr 2018

DeepSigns: A Generic Watermarking Framework for IP Protection of Deep Learning Models

B. Rouhani

Huili Chen

F. Koushanfar

268

02 Apr 2018

A Survey on Deep Learning Methods for Robot Vision

Javier Ruiz-del-Solar

P. Loncomilla

Naiomi Soto

168

28 Mar 2018

Task Agnostic Continual Learning Using Online Variational Bayes

353

124

27 Mar 2018

Information Theoretic Interpretation of Deep learning

Tianchen Zhao

FAtt

176

21 Mar 2018

Comparing Dynamics: Deep Neural Networks versus Glassy Systems

317

124

19 Mar 2018

Replica Symmetry Breaking in Bipartite Spin Glasses and Neural Networks

Gavin Hartnett

Edward Parker

Edward Geist

306

17 Mar 2018

Escaping Saddles with Stochastic GradientsInternational Conference on Machine Learning (ICML), 2018

132

169

15 Mar 2018

GossipGraD: Scalable Deep Learning using Gossip Communication based Asynchronous Gradient Descent

153

15 Mar 2018

Accelerating Natural Gradient with Higher-Order Invariance

Yang Song

Jiaming Song

Stefano Ermon

189

04 Mar 2018

Essentially No Barriers in Neural Network Energy LandscapeInternational Conference on Machine Learning (ICML), 2018

471

484

02 Mar 2018