Tight Bounds for Bandit Combinatorial Optimization

Annual Conference Computational Learning Theory (COLT), 2017

24 February 2017

Papers citing "Tight Bounds for Bandit Combinatorial Optimization"

16 / 16 papers shown

Instance-Dependent Regret Bounds for Nonstochastic Linear Partial Monitoring

Federico Di Gennaro

Khaled Eldowa

Nicolò Cesa-Bianchi

157

22 Oct 2025

On the Universal Near Optimality of Hedge in Combinatorial Settings

170

20 Oct 2025

Efficient Near-Optimal Algorithm for Online Shortest Paths in Directed Acyclic Graphs with Bandit Feedback Against Adaptive AdversariesAnnual Conference Computational Learning Theory (COLT), 2025

778

01 Apr 2025

Adversarial Combinatorial Semi-bandits with Graph Feedback

Yuxiao Wen

563

26 Feb 2025

$No-Regret M${}^{\natural}$-Concave Function Maximization: Stochastic Bandit Algorithms and Hardness of Adversarial Full-Information Setting$

No-Regret M

{}^{\natural}

-Concave Function Maximization: Stochastic Bandit Algorithms and Hardness of Adversarial Full-Information Setting

Taihei Oki

Shinsaku Sakaue

387

21 May 2024

Information Capacity Regret Bounds for Bandits with Mediator Feedback

Khaled Eldowa

Nicolò Cesa-Bianchi

Alberto Maria Metelli

Marcello Restelli

261

15 Feb 2024

Sum-max Submodular BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023

215

10 Nov 2023

On the Minimax Regret for Online Learning with Feedback GraphsNeural Information Processing Systems (NeurIPS), 2023

249

24 May 2023

Sampling Equilibria: Fast No-Regret Learning in Structured GamesACM-SIAM Symposium on Discrete Algorithms (SODA), 2022

635

26 Jan 2022

DART: aDaptive Accept RejecT for non-linear top-K subset identification

228

16 Nov 2020

Preference-based Reinforcement Learning with Finite-Time Guarantees

Aarti Singh

338

16 Jun 2020

Unifying mirror descent and dual averagingMathematical programming (Math. Program.), 2019

A. Juditsky

Joon Kwon

Eric Moulines

385

30 Oct 2019

Top-k Combinatorial Bandits with Full-Bandit FeedbackInternational Conference on Algorithmic Learning Theory (ALT), 2019

Idan Rejwan

Yishay Mansour

359

28 May 2019

Bandit Principal Component Analysis

W. Kotłowski

Gergely Neu

212

08 Feb 2019

Learning to Route Efficiently with End-to-End Feedback: The Value of Networked Structure

Ruihao Zhu

E. Modiano

245

24 Oct 2018

Exponential Weights on the Hypercube in Polynomial Time

Sudeep Raja Putta

Abhishek Shetty

169

12 Jun 2018