Equilibrated adaptive learning rates for non-convex optimization

15 February 2015

Papers citing "Equilibrated adaptive learning rates for non-convex optimization"

9 / 9 papers shown

Title
SGD for Structured Nonconvex Functions: Learning Rates, Minibatching and Interpolation Robert Mansel Gower Othmane Sebbouh Nicolas Loizou 73 75 0 18 Jun 2020
Train longer, generalize better: closing the generalization gap in large batch training of neural networks Elad Hoffer Itay Hubara Daniel Soudry ODL 146 799 0 24 May 2017
Identifying and attacking the saddle point problem in high-dimensional non-convex optimization Yann N. Dauphin Razvan Pascanu Çağlar Gülçehre Kyunghyun Cho Surya Ganguli Yoshua Bengio ODL 111 1,380 0 10 Jun 2014
Unit Tests for Stochastic Optimization Tom Schaul Ioannis Antonoglou David Silver 48 91 0 20 Dec 2013
Revisiting Natural Gradient for Deep Networks Razvan Pascanu Yoshua Bengio ODL 104 388 0 16 Jan 2013
ADADELTA: An Adaptive Learning Rate Method Matthew D. Zeiler ODL 115 6,619 0 22 Dec 2012
Theano: new features and speed improvements Frédéric Bastien Pascal Lamblin Razvan Pascanu James Bergstra Ian Goodfellow Arnaud Bergeron Nicolas Bouchard David Warde-Farley Yoshua Bengio 83 1,420 0 23 Nov 2012
Estimating the Hessian by Back-propagating Curvature James Martens Ilya Sutskever Kevin Swersky 73 80 0 27 Jun 2012
Krylov Subspace Descent for Deep Learning Oriol Vinyals Daniel Povey ODL 60 148 0 18 Nov 2011