Regret Minimization in Heavy-Tailed Bandits

Annual Conference Computational Learning Theory (COLT), 2021

7 February 2021

Papers citing "Regret Minimization in Heavy-Tailed Bandits"

20 / 20 papers shown

Learning When Not to Learn: Risk-Sensitive Abstention in Bandits with Unbounded Rewards

Sarah Liaw

Benjamin Plaut

210

16 Oct 2025

Robust Batched Bandits

137

04 Oct 2025

Optimal e-value testing for properly constrained hypotheses

Eugenio Clerico

217

30 Dec 2024

Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024

Ambrus Tamás

Szabolcs Szentpéteri

Balázs Csanád Csáji

219

09 Jun 2024

Fast UCB-type algorithms for stochastic bandits with heavy and super heavy symmetric noiseAdaptive Agents and Multi-Agent Systems (AAMAS), 2024

269

10 Feb 2024

(ε, u)

-Adaptive Regret Minimization in Heavy-Tailed Bandits

Gianmarco Genalti

Lupo Marsigli

Nicola Gatti

Alberto Maria Metelli

308

04 Oct 2023

Nash Regret Guarantees for Linear BanditsNeural Information Processing Systems (NeurIPS), 2023

Ayush Sawarni

Soumybrata Pal

Siddharth Barman

352

03 Oct 2023

CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded Stochastic CorruptionInternational Conference on Algorithmic Learning Theory (ALT), 2023

Shubhada Agrawal

Timothée Mathieu

D. Basu

Odalric-Ambrym Maillard

291

28 Sep 2023

Allocating Divisible Resources on Arms with Unknown and Random RewardsAnnual Conference Computational Learning Theory (COLT), 2023

Yi Xiong

Siyuan Li

257

28 Jun 2023

Optimal Best-Arm Identification in Bandits with Access to Offline Data

Shubhada Agrawal

Sandeep Juneja

Karthikeyan Shanmugam

A. Suggala

327

15 Jun 2023

Differentially Private Episodic Reinforcement Learning with Heavy-tailed RewardsInternational Conference on Machine Learning (ICML), 2023

445

01 Jun 2023

Regret Distribution in Stochastic Bandits: Optimal Trade-off between Expectation and Tail Risk

D. Simchi-Levi

Zeyu Zheng

Feng Zhu

167

10 Apr 2023

A General Recipe for the Analysis of Randomized Multi-Armed Bandit Algorithms

Dorian Baudry

Kazuya Suzuki

Junya Honda

281

10 Mar 2023

Optimality of Thompson Sampling with Noninformative Priors for Pareto BanditsInternational Conference on Machine Learning (ICML), 2023

365

03 Feb 2023

Non-Asymptotic Analysis of a UCB-based Top Two AlgorithmNeural Information Processing Systems (NeurIPS), 2022

Marc Jourdan

Rémy Degenne

501

11 Oct 2022

Multi-Armed Bandits with Self-Information RewardsIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2022

Nir Weinberger

M. Yemini

149

06 Sep 2022

Top Two Algorithms RevisitedNeural Information Processing Systems (NeurIPS), 2022

353

13 Jun 2022

Catoni-style confidence sequences for heavy-tailed mean estimationStochastic Processes and their Applications (SPA), 2022

Hongjian Wang

Aaditya Ramdas

937

02 Feb 2022

Regret Minimization in Isotonic, Heavy-Tailed Contextual Bandits via Adaptive Confidence Bands

S. Chatterjee

Subhabrata Sen

OffRL

214

19 Oct 2021

Optimal Best-Arm Identification Methods for Tail-Risk Measures

Shubhada Agrawal

Wouter M. Koolen

Sandeep Juneja

318

17 Aug 2020