Less Regret via Online Conditioning

25 February 2010

Papers citing "Less Regret via Online Conditioning"

50 / 50 papers shown

Improved Convergence in Parameter-Agnostic Error Feedback through Momentum

18 Nov 2025

AdaGrad Meets Muon: Adaptive Stepsizes for Orthogonal Updates

Minxin Zhang

Yuxuan Liu

Hayden Schaeffer

191

03 Sep 2025

ASGO: Adaptive Structured Gradient Optimization

430

26 Mar 2025

Structured Preconditioners in Adaptive Optimization: A Unified Analysis

274

13 Mar 2025

Bandit and Delayed Feedback in Online Structured Prediction

327

26 Feb 2025

Efficiently Solving Discounted MDPs with Predictions on Transition Matrices

Lixing Lyu

Jiashuo Jiang

Wang Chi Cheung

275

24 Feb 2025

Temporal Context Consistency Above All: Enhancing Long-Term Anticipation by Learning and Enforcing Temporal Constraints

Alberto Maté

Mariella Dimiccoli

AI4TS

293

27 Dec 2024

Tilted Sharpness-Aware Minimization

Tian Li

Wanrong Zhu

J. Bilmes

313

30 Oct 2024

Large Batch Analysis for Adagrad Under Anisotropic Smoothness

Yuxing Liu

Boyao Wang

Tong Zhang

256

21 Jun 2024

Low-Resource Machine Translation through the Lens of Personalized Federated LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Chris Biemann

186

18 Jun 2024

An Equivalence Between Static and Dynamic Regret Minimization

Andrew Jacobsen

Francesco Orabona

254

03 Jun 2024

Directional Smoothness and Gradient Methods: Convergence and Adaptivity

423

06 Mar 2024

Revisiting Convergence of AdaGrad with Relaxed Assumptions

Yusu Hong

Junhong Lin

275

21 Feb 2024

AdAdaGrad: Adaptive Batch Size Schemes for Adaptive Gradient Methods

382

17 Feb 2024

AdaBatchGrad: Combining Adaptive Batch Size and Adaptive Step Size

Alexander Gasnikov

206

07 Feb 2024

On Convergence of Adam for Stochastic Optimization under Relaxed AssumptionsNeural Information Processing Systems (NeurIPS), 2024

Yusu Hong

Junhong Lin

405

06 Feb 2024

Parameter-Agnostic Optimization under Relaxed SmoothnessInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023

Florian Hübler

Junchi Yang

Xiang Li

Niao He

262

06 Nov 2023

High Probability Convergence of Adam Under Unbounded Gradients and Affine Variance Noise

Yusu Hong

Junhong Lin

212

03 Nov 2023

Improved Algorithms for Adversarial Bandits with Unbounded Losses

Mingyu Chen

Xuezhou Zhang

218

03 Oct 2023

Adaptive SGD with Polyak stepsize and Line-search: Robust Convergence and Variance ReductionNeural Information Processing Systems (NeurIPS), 2023

Xiao-Yan Jiang

Sebastian U. Stich

242

11 Aug 2023

Normalized Gradients for All

Francesco Orabona

206

10 Aug 2023

Online Inventory Problems: Beyond the i.i.d. Setting with Online Convex OptimizationNeural Information Processing Systems (NeurIPS), 2023

150

12 Jul 2023

Prodigy: An Expeditiously Adaptive Parameter-Free LearnerInternational Conference on Machine Learning (ICML), 2023

Konstantin Mishchenko

Aaron Defazio

ODL

420

106

09 Jun 2023

Generalized Implicit Follow-The-Regularized-LeaderInternational Conference on Machine Learning (ICML), 2023

Keyi Chen

Francesco Orabona

FedML

171

31 May 2023

Parameter-free projected gradient descent

Evgenii Chzhen

Christophe Giraud

Jean-Michel Poggi

224

31 May 2023

SignSVRG: fixing SignSGD via variance reduction

Evgenii Chzhen

S. Schechtman

235

22 May 2023

Two Sides of One Coin: the Limits of Untuned SGD and the Power of Adaptive MethodsNeural Information Processing Systems (NeurIPS), 2023

Junchi Yang

Xiang Li

Ilyas Fatkhullin

Niao He

218

21 May 2023

Beyond Uniform Smoothness: A Stopped Analysis of Adaptive SGDAnnual Conference Computational Learning Theory (COLT), 2023

Matthew Faw

Litu Rout

Constantine Caramanis

Sanjay Shakkottai

311

13 Feb 2023

$Rethinking Warm-Starts with Predictions: Learning Predictions Close to Sets of Optimal Solutions for Faster $\text{L}$-/$\text{L}^\natural$-Convex Function Minimization$

Rethinking Warm-Starts with Predictions: Learning Predictions Close to Sets of Optimal Solutions for Faster

\text{L}

\text{L}^\natural

-Convex Function MinimizationInternational Conference on Machine Learning (ICML), 2023

Shinsaku Sakaue

Taihei Oki

192

02 Feb 2023

Learning-Rate-Free Learning by D-AdaptationInternational Conference on Machine Learning (ICML), 2023

Aaron Defazio

Konstantin Mishchenko

476

107

18 Jan 2023

Multi-Agent Reinforcement Learning with Reward DelaysConference on Learning for Dynamics & Control (L4DC), 2022

Yuyang Zhang

Runyu Zhang

Yu Gu

Na Li

243

02 Dec 2022

Differentially Private Adaptive Optimization with Delayed PreconditionersInternational Conference on Learning Representations (ICLR), 2022

253

01 Dec 2022

Adaptive Stochastic Variance Reduction for Non-convex Finite-Sum MinimizationNeural Information Processing Systems (NeurIPS), 2022

257

03 Nov 2022

Nest Your Adaptive Algorithm for Parameter-Agnostic Nonconvex Minimax OptimizationNeural Information Processing Systems (NeurIPS), 2022

Junchi Yang

Xiang Li

Niao He

ODL

259

01 Jun 2022

Convergence of First-Order Methods for Constrained Nonconvex Optimization with Dependent DataInternational Conference on Machine Learning (ICML), 2022

Ahmet Alacaoglu

Hanbaek Lyu

267

29 Mar 2022

The Power of Adaptivity in SGD: Self-Tuning Step Sizes with Unbounded Gradients and Affine VarianceAnnual Conference Computational Learning Theory (COLT), 2022

Matthew Faw

Isidoros Tziotis

Constantine Caramanis

Aryan Mokhtari

Sanjay Shakkottai

Rachel A. Ward

229

11 Feb 2022

Online Learning to Transport via the Minimal Selection PrincipleAnnual Conference Computational Learning Theory (COLT), 2022

185

09 Feb 2022

Cooperative Online Learning in Stochastic and Adversarial MDPsInternational Conference on Machine Learning (ICML), 2022

Tal Lancewicki

Aviv A. Rosenberg

Yishay Mansour

290

31 Jan 2022

On the Convergence of mSGD and AdaGrad for Stochastic OptimizationInternational Conference on Learning Representations (ICLR), 2022

Ruinan Jin

Yu Xing

Xingkang He

124

26 Jan 2022

Stochastic Mirror Descent: Convergence Analysis and Adaptive Variants via the Mirror Stochastic Polyak Stepsize

Ryan DÓrazio

Nicolas Loizou

I. Laradji

Ioannis Mitliagkas

422

28 Oct 2021

On the Last Iterate Convergence of Momentum MethodsInternational Conference on Algorithmic Learning Theory (ALT), 2021

Xiaoyun Li

Mingrui Liu

Francesco Orabona

317

13 Feb 2021

Black-Box Reductions for Parameter-free Online Learning in Banach Spaces

Ashok Cutkosky

Francesco Orabona

249

164

17 Feb 2018

Training Deep Networks without Learning Rates Through Coin Betting

Francesco Orabona

Tatiana Tommasi

ODL

237

22 May 2017

Scale-Free Online Learning

Francesco Orabona

D. Pál

255

114

08 Jan 2016

Scale-Free Algorithms for Online Linear Optimization

Francesco Orabona

D. Pál

ODL

206

19 Feb 2015

Simultaneous Model Selection and Optimization through Parameter-free Stochastic LearningNeural Information Processing Systems (NeurIPS), 2014

Francesco Orabona

282

106

15 Jun 2014

A Survey of Algorithms and Analysis for Adaptive Online Learning

H. B. McMahan

FedML

262

14 Mar 2014

Large-Scale Learning with Less RAM via RandomizationInternational Conference on Machine Learning (ICML), 2013

101

19 Mar 2013

A Unified View of Regularized Dual Averaging and Mirror Descent with Implicit Updates

H. B. McMahan

249

16 Sep 2010

Adaptive Bound Optimization for Online Convex OptimizationAnnual Conference Computational Learning Theory (COLT), 2010

H. B. McMahan

Matthew J. Streeter

ODL

342

411

26 Feb 2010