v1v2v3v4v5 (latest)

L4: Practical loss-based stepsize adaptation for deep learning

14 February 2018

Papers citing "L4: Practical loss-based stepsize adaptation for deep learning"

30 / 30 papers shown

An Adaptive Stochastic Gradient Method with Non-negative Gauss-Newton Stepsizes

Antonio Orvieto

Lin Xiao

351

05 Jul 2024

Stochastic Polyak Step-sizes and Momentum: Convergence Guarantees and Practical Performance

Dimitris Oikonomou

Nicolas Loizou

389

06 Jun 2024

Single-Call Stochastic Extragradient Methods for Structured Non-monotone Variational Inequalities: Improved Analysis under Weaker ConditionsNeural Information Processing Systems (NeurIPS), 2023

S. Choudhury

Eduard A. Gorbunov

Nicolas Loizou

359

27 Feb 2023

QLABGrad: a Hyperparameter-Free and Convergence-Guaranteed Scheme for Deep LearningAAAI Conference on Artificial Intelligence (AAAI), 2023

Minghan Fu

Fang-Xiang Wu

ODL

361

01 Feb 2023

Making SGD Parameter-FreeAnnual Conference Computational Learning Theory (COLT), 2022

Y. Carmon

Oliver Hinder

451

04 May 2022

Amortized Proximal OptimizationNeural Information Processing Systems (NeurIPS), 2022

Paul Vicol

380

28 Feb 2022

A Stochastic Bundle Method for Interpolating Networks

248

29 Jan 2022

Stochastic Mirror Descent: Convergence Analysis and Adaptive Variants via the Mirror Stochastic Polyak Stepsize

Ryan DÓrazio

Nicolas Loizou

I. Laradji

Ioannis Mitliagkas

549

28 Oct 2021

Using a one dimensional parabolic model of the full-batch loss to estimate learning rates during training

215

31 Aug 2021

KOALA: A Kalman Optimization Algorithm with Loss Adaptivity

475

07 Jul 2021

LRTuner: A Learning Rate Tuner for Deep Neural Networks

177

30 May 2021

Empirically explaining SGD from a line search perspectiveInternational Conference on Artificial Neural Networks (ICANN), 2021

Max Mutschler

A. Zell

ODL LRM

378

31 Mar 2021

How to decay your learning rate

Aitor Lewkowycz

365

23 Mar 2021

A Probabilistically Motivated Learning Rate Adaptation for Stochastic Optimization

135

22 Feb 2021

Self-Tuning Stochastic Optimization with Curvature-Aware Gradient Filtering

247

09 Nov 2020

A straightforward line search approach on the expected empirical loss for stochastic deep learning problems

Max Mutschler

A. Zell

247

02 Oct 2020

Adaptive Hierarchical Hyper-gradient Descent

Renlong Jie

Junbin Gao

A. Vasnev

Minh-Ngoc Tran

213

17 Aug 2020

MLR-SNet: Transferable LR Schedules for Heterogeneous TasksIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020

384

29 Jul 2020

Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers

930

195

03 Jul 2020

SGD for Structured Nonconvex Functions: Learning Rates, Minibatching and Interpolation

Robert Mansel Gower

Othmane Sebbouh

Nicolas Loizou

501

18 Jun 2020

AdaS: Adaptive Scheduling of Stochastic Gradients

Mahdi S. Hosseini

Konstantinos N. Plataniotis

ODL

210

11 Jun 2020

Generalized Reinforcement Meta Learning for Few-Shot Optimization

146

04 May 2020

Stochastic Polyak Step-size for SGD: An Adaptive Learning Rate for Fast ConvergenceInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020

506

225

24 Feb 2020

Training Neural Networks for and by InterpolationInternational Conference on Machine Learning (ICML), 2019

276

13 Jun 2019

Painless Stochastic Gradient: Interpolation, Line-Search, and Convergence RatesNeural Information Processing Systems (NeurIPS), 2019

568

239

24 May 2019

Parabolic Approximation Line Search for DNNs

Max Mutschler

A. Zell

ODL

422

28 Mar 2019

DeepOBS: A Deep Learning Optimizer Benchmark Suite

492

13 Mar 2019

LOSSGRAD: automatic learning rate in gradient descent

235

20 Feb 2019

Collaborative Sampling in Generative Adversarial Networks

Yuejiang Liu

Parth Kothari

Alexandre Alahi

TTA

453

02 Feb 2019

Step Size Matters in Deep Learning

Kamil Nar

S. Shankar Sastry

122

22 May 2018