Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1809.01382
Cited By
v1
v2
v3 (latest)
On the optimality of the Hedge algorithm in the stochastic regime
5 September 2018
Jaouad Mourtada
Stéphane Gaïffas
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"On the optimality of the Hedge algorithm in the stochastic regime"
19 / 19 papers shown
Title
A Blackbox Approach to Best of Both Worlds in Bandits and Beyond
Christoph Dann
Chen-Yu Wei
Julian Zimmert
73
24
0
20 Feb 2023
Simultaneously Learning Stochastic and Adversarial Bandits under the Position-Based Model
Chengju Chen
Canzhe Zhao
Shuai Li
37
5
0
12 Jul 2022
A Best-of-Both-Worlds Algorithm for Bandits with Delayed Feedback
Saeed Masoudian
Julian Zimmert
Yevgeny Seldin
71
20
0
29 Jun 2022
Stochastic Online Learning with Feedback Graphs: Finite-Time and Asymptotic Optimality
T. V. Marinov
M. Mohri
Julian Zimmert
114
6
0
20 Jun 2022
Adversarially Robust Multi-Armed Bandit Algorithm with Variance-Dependent Regret Bounds
Shinji Ito
Taira Tsuchiya
Junya Honda
AAML
43
17
0
14 Jun 2022
A Regret-Variance Trade-Off in Online Learning
Dirk van der Hoeven
Nikita Zhivotovskiy
Nicolò Cesa-Bianchi
56
7
0
06 Jun 2022
A Near-Optimal Best-of-Both-Worlds Algorithm for Online Learning with Feedback Graphs
Chloé Rouyer
Dirk van der Hoeven
Nicolò Cesa-Bianchi
Yevgeny Seldin
90
17
0
01 Jun 2022
Online Learning with Bounded Recall
Jon Schneider
Kiran Vodrahalli
68
1
0
28 May 2022
On Optimal Robustness to Adversarial Corruption in Online Decision Problems
Shinji Ito
77
22
0
22 Sep 2021
Contextual Games: Multi-Agent Learning with Side Information
Pier Giuseppe Sessa
Ilija Bogunovic
Andreas Krause
Maryam Kamgarpour
88
21
0
13 Jul 2021
Best-Case Lower Bounds in Online Learning
Cristóbal Guzmán
Nishant A. Mehta
Ali Mortazavi
17
1
0
23 Jun 2021
The best of both worlds: stochastic and adversarial episodic MDPs with unknown transition
Tiancheng Jin
Longbo Huang
Haipeng Luo
84
42
0
08 Jun 2021
Sequential Ski Rental Problem
Anant Shah
A. Rajkumar
24
3
0
13 Apr 2021
Multiplicative Reweighting for Robust Neural Network Optimization
Noga Bar
Tomer Koren
Raja Giryes
OOD
NoLa
83
9
0
24 Feb 2021
Near-Optimal Algorithms for Differentially Private Online Learning in a Stochastic Environment
Bingshan Hu
Zhiming Huang
Nishant A. Mehta
Nidhi Hegde
FedML
62
1
0
16 Feb 2021
MetaGrad: Adaptation using Multiple Learning Rates in Online Learning
T. Erven
Wouter M. Koolen
Dirk van der Hoeven
ODL
107
23
0
12 Feb 2021
Prediction with Corrupted Expert Advice
I Zaghloul Amir
Idan Attias
Tomer Koren
Roi Livni
Yishay Mansour
69
40
0
24 Feb 2020
Learning The Best Expert Efficiently
Daron Anderson
D. Leith
34
1
0
11 Nov 2019
Thompson Sampling for Adversarial Bit Prediction
Yuval Lewi
Haim Kaplan
Yishay Mansour
16
2
0
21 Jun 2019
1