v1v2 (latest)

A Closer Look at Small-loss Bounds for Bandits with Graph Feedback

Annual Conference Computational Learning Theory (COLT), 2020

2 February 2020

Papers citing "A Closer Look at Small-loss Bounds for Bandits with Graph Feedback"

22 / 22 papers shown

Data-Dependent Regret Bounds for Constrained MABs

Gianmarco Genalti

Francesco Emanuele Stradi

Matteo Castiglioni

A. Marchesi

N. Gatti

485

26 May 2025

Online Two-Sided Markets: Many Buyers Enhance Learning

Anna Lunghi

Matteo Castiglioni

A. Marchesi

290

03 Mar 2025

uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABsInternational Conference on Learning Representations (ICLR), 2024

Yu Chen

Jiatai Huang

Yan Dai

Longbo Huang

466

04 Oct 2024

Graph Neural Thompson Sampling

Shuang Wu

Arash A. Amini

420

15 Jun 2024

Incentive-compatible Bandits: Importance Weighting No More

Julian Zimmert

T. V. Marinov

300

10 May 2024

Efficient Contextual Bandits with Uninformed Feedback Graphs

250

12 Feb 2024

Online Network Source Optimization with Graph-Kernel MAB

Laura Toni

P. Frossard

355

07 Jul 2023

Nearly Optimal Algorithms with Sublinear Computational Complexity for Online Kernel RegressionInternational Conference on Machine Learning (ICML), 2023

Junfan Li

Shizhong Liao

262

14 Jun 2023

Stability-penalty-adaptive follow-the-regularized-leader: Sparsity, game-dependency, and best-of-both-worldsNeural Information Processing Systems (NeurIPS), 2023

Taira Tsuchiya

Shinji Ito

Junya Honda

330

26 May 2023

Practical Contextual Bandits with Feedback GraphsNeural Information Processing Systems (NeurIPS), 2023

406

17 Feb 2023

Improved High-Probability Regret for Adversarial Bandits with Time-Varying Feedback GraphsInternational Conference on Algorithmic Learning Theory (ALT), 2022

257

04 Oct 2022

Regret Minimization and Convergence to Equilibria in General-sum Markov GamesInternational Conference on Machine Learning (ICML), 2022

498

28 Jul 2022

Stochastic Online Learning with Feedback Graphs: Finite-Time and Asymptotic OptimalityNeural Information Processing Systems (NeurIPS), 2022

T. V. Marinov

M. Mohri

Julian Zimmert

263

20 Jun 2022

Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear BanditsAnnual Conference Computational Learning Theory (COLT), 2022

267

12 Feb 2022

Adaptivity and Non-stationarity: Problem-dependent Dynamic Regret for Online Convex OptimizationJournal of machine learning research (JMLR), 2021

434

29 Dec 2021

Best-of-All-Worlds Bounds for Online Learning with Feedback Graphs

Liad Erez

Tomer Koren

169

20 Jul 2021

The best of both worlds: stochastic and adversarial episodic MDPs with unknown transitionNeural Information Processing Systems (NeurIPS), 2021

Tiancheng Jin

Longbo Huang

Haipeng Luo

280

08 Jun 2021

Understanding Bandits with Graph FeedbackNeural Information Processing Systems (NeurIPS), 2021

Houshuang Chen

Zengfeng Huang

Shuai Li

Chihao Zhang

220

29 May 2021

Adversarial Linear Contextual Bandits with Graph-Structured Side ObservationsAAAI Conference on Artificial Intelligence (AAAI), 2020

329

10 Dec 2020

Minimax Regret for Stochastic Shortest Path with Adversarial Costs and Known Transition

Liyu Chen

Haipeng Luo

Chen-Yu Wei

638

07 Dec 2020

Bias no more: high-probability data-dependent regret bounds for adversarial bandits and MDPsNeural Information Processing Systems (NeurIPS), 2020

399

14 Jun 2020

Simultaneously Learning Stochastic and Adversarial Episodic MDPs with Known TransitionNeural Information Processing Systems (NeurIPS), 2020

Tiancheng Jin

Haipeng Luo

347

10 Jun 2020