On the optimality of the Hedge algorithm in the stochastic regime

v1v2v3 (latest)

On the optimality of the Hedge algorithm in the stochastic regime

5 September 2018

Jaouad Mourtada

Stéphane Gaïffas

ArXiv (abs)PDF HTML

Papers citing "On the optimality of the Hedge algorithm in the stochastic regime"

19 / 19 papers shown

Title
A Blackbox Approach to Best of Both Worlds in Bandits and Beyond Christoph Dann Chen-Yu Wei Julian Zimmert 73 24 0 20 Feb 2023
Simultaneously Learning Stochastic and Adversarial Bandits under the Position-Based Model Chengju Chen Canzhe Zhao Shuai Li 37 5 0 12 Jul 2022
A Best-of-Both-Worlds Algorithm for Bandits with Delayed Feedback Saeed Masoudian Julian Zimmert Yevgeny Seldin 71 20 0 29 Jun 2022
Stochastic Online Learning with Feedback Graphs: Finite-Time and Asymptotic Optimality T. V. Marinov M. Mohri Julian Zimmert 114 6 0 20 Jun 2022
Adversarially Robust Multi-Armed Bandit Algorithm with Variance-Dependent Regret Bounds Shinji Ito Taira Tsuchiya Junya Honda AAML 43 17 0 14 Jun 2022
A Regret-Variance Trade-Off in Online Learning Dirk van der Hoeven Nikita Zhivotovskiy Nicolò Cesa-Bianchi 56 7 0 06 Jun 2022
A Near-Optimal Best-of-Both-Worlds Algorithm for Online Learning with Feedback Graphs Chloé Rouyer Dirk van der Hoeven Nicolò Cesa-Bianchi Yevgeny Seldin 90 17 0 01 Jun 2022
Online Learning with Bounded Recall Jon Schneider Kiran Vodrahalli 68 1 0 28 May 2022
On Optimal Robustness to Adversarial Corruption in Online Decision Problems Shinji Ito 77 22 0 22 Sep 2021
Contextual Games: Multi-Agent Learning with Side Information Pier Giuseppe Sessa Ilija Bogunovic Andreas Krause Maryam Kamgarpour 88 21 0 13 Jul 2021
Best-Case Lower Bounds in Online Learning Cristóbal Guzmán Nishant A. Mehta Ali Mortazavi 17 1 0 23 Jun 2021
The best of both worlds: stochastic and adversarial episodic MDPs with unknown transition Tiancheng Jin Longbo Huang Haipeng Luo 84 42 0 08 Jun 2021
Sequential Ski Rental Problem Anant Shah A. Rajkumar 24 3 0 13 Apr 2021
Multiplicative Reweighting for Robust Neural Network Optimization Noga Bar Tomer Koren Raja Giryes OOD NoLa 83 9 0 24 Feb 2021
Near-Optimal Algorithms for Differentially Private Online Learning in a Stochastic Environment Bingshan Hu Zhiming Huang Nishant A. Mehta Nidhi Hegde FedML 62 1 0 16 Feb 2021
MetaGrad: Adaptation using Multiple Learning Rates in Online Learning T. Erven Wouter M. Koolen Dirk van der Hoeven ODL 107 23 0 12 Feb 2021
Prediction with Corrupted Expert Advice I Zaghloul Amir Idan Attias Tomer Koren Roi Livni Yishay Mansour 69 40 0 24 Feb 2020
Learning The Best Expert Efficiently Daron Anderson D. Leith 34 1 0 11 Nov 2019
Thompson Sampling for Adversarial Bit Prediction Yuval Lewi Haim Kaplan Yishay Mansour 16 2 0 21 Jun 2019