A Blackbox Approach to Best of Both Worlds in Bandits and Beyond

Annual Conference Computational Learning Theory (COLT), 2023

20 February 2023

Papers citing "A Blackbox Approach to Best of Both Worlds in Bandits and Beyond"

26 / 26 papers shown

Adapting to Stochastic and Adversarial Losses in Episodic MDPs with Aggregate Bandit Feedback

253

20 Oct 2025

FLEET: Formal Language-Grounded Scheduling for Heterogeneous Robot Teams

171

08 Oct 2025

Efficient Best-of-Both-Worlds Algorithms for Contextual Combinatorial Semi-Bandits

123

26 Aug 2025

Revisiting Follow-the-Perturbed-Leader with Unbounded Perturbations in Bandit Problems

226

26 Aug 2025

A Near-optimal, Scalable and Parallelizable Framework for Stochastic Bandits Robust to Adversarial Corruptions and Beyond

Zicheng Hu

Cheng Chen

449

11 Feb 2025

Tracking Most Significant Shifts in Infinite-Armed Bandits

Joe Suk

Jung-hun Kim

383

31 Jan 2025

A Model Selection Approach for Corruption Robust Reinforcement LearningInternational Conference on Algorithmic Learning Theory (ALT), 2021

Chen-Yu Wei

Christoph Dann

Julian Zimmert

393

31 Dec 2024

How Does Variance Shape the Regret in Contextual Bandits?Neural Information Processing Systems (NeurIPS), 2024

500

16 Oct 2024

Corruption-Robust Linear Bandits: Minimax Optimality and Gap-Dependent MisspecificationNeural Information Processing Systems (NeurIPS), 2024

510

10 Oct 2024

uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABsInternational Conference on Learning Representations (ICLR), 2024

Yu Chen

Jiatai Huang

Yan Dai

Longbo Huang

458

04 Oct 2024

$A Simple and Adaptive Learning Rate for FTRL in Online Learning with Minimax Regret of $Θ(T^{2/3})$ and its Application to Best-of-Both-Worlds$

A Simple and Adaptive Learning Rate for FTRL in Online Learning with Minimax Regret of

Θ(T^{2/3})

and its Application to Best-of-Both-Worlds

Taira Tsuchiya

Shinji Ito

481

30 May 2024

LC-Tsallis-INF: Generalized Best-of-Both-Worlds Linear Contextual Bandits

Masahiro Kato

Shinji Ito

587

05 Mar 2024

Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Analysis and Best-of-Both-Worlds

Shinji Ito

Taira Tsuchiya

Junya Honda

488

01 Mar 2024

Information Capacity Regret Bounds for Bandits with Mediator Feedback

Khaled Eldowa

Nicolò Cesa-Bianchi

Alberto Maria Metelli

Marcello Restelli

254

15 Feb 2024

Exploration by Optimization with Hybrid Regularizers: Logarithmic Regret with Adversarial Robustness in Partial Monitoring

Taira Tsuchiya

Shinji Ito

Junya Honda

308

13 Feb 2024

Efficient Contextual Bandits with Uninformed Feedback Graphs

245

12 Feb 2024

Best-of-Both-Worlds Linear Contextual Bandits

Masahiro Kato

Shinji Ito

309

27 Dec 2023

Best-of-Both-Worlds Algorithms for Linear Contextual Bandits

Fabio Vitale

338

24 Dec 2023

Towards Optimal Regret in Adversarial Linear MDPs with Bandit FeedbackInternational Conference on Learning Representations (ICLR), 2023

Haolin Liu

Chen-Yu Wei

Julian Zimmert

313

17 Oct 2023

Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual BanditsNeural Information Processing Systems (NeurIPS), 2023

Haolin Liu

Chen-Yu Wei

Julian Zimmert

291

02 Sep 2023

On Interpolating Experts and Multi-Armed BanditsInternational Conference on Machine Learning (ICML), 2023

Houshuang Chen

Yuchen He

Chihao Zhang

327

14 Jul 2023

Stability-penalty-adaptive follow-the-regularized-leader: Sparsity, game-dependency, and best-of-both-worldsNeural Information Processing Systems (NeurIPS), 2023

Taira Tsuchiya

Shinji Ito

Junya Honda

326

26 May 2023

On the Minimax Regret for Online Learning with Feedback GraphsNeural Information Processing Systems (NeurIPS), 2023

248

24 May 2023

Implicitly normalized forecaster with clipping for linear and non-linear heavy-tailed multi-armed banditsComputational Management Science (CMS), 2023

Alexander Gasnikov

364

11 May 2023

Accelerated Rates between Stochastic and Adversarial Online Convex Optimization

326

06 Mar 2023

Best-of-Three-Worlds Linear Bandit Algorithm with Variance-Adaptive Regret BoundsAnnual Conference Computational Learning Theory (COLT), 2023

Shinji Ito

Kei Takemura

192

24 Feb 2023