Improved Regret Bounds for Oracle-Based Adversarial Contextual Bandits

Neural Information Processing Systems (NeurIPS), 2016

1 June 2016

Papers citing "Improved Regret Bounds for Oracle-Based Adversarial Contextual Bandits"

28 / 28 papers shown

LC-Tsallis-INF: Generalized Best-of-Both-Worlds Linear Contextual Bandits

Masahiro Kato

Shinji Ito

597

05 Mar 2024

Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual BanditsNeural Information Processing Systems (NeurIPS), 2023

Haolin Liu

Chen-Yu Wei

Julian Zimmert

301

02 Sep 2023

Incentivizing High-Quality Content in Online Recommender Systems

420

13 Jun 2023

Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability

Aadirupa Saha

A. Krishnamurthy

326

24 Nov 2021

Contextual Games: Multi-Agent Learning with Side Information

240

13 Jul 2021

Boosting for Online Convex OptimizationInternational Conference on Machine Learning (ICML), 2021

Elad Hazan

Karan Singh

OffRL

183

18 Feb 2021

Adversarial Linear Contextual Bandits with Graph-Structured Side ObservationsAAAI Conference on Artificial Intelligence (AAAI), 2020

328

10 Dec 2020

Taking a hint: How to leverage loss predictors in contextual bandits?Annual Conference Computational Learning Theory (COLT), 2020

Chen-Yu Wei

Haipeng Luo

Alekh Agarwal

366

04 Mar 2020

Beyond UCB: Optimal and Efficient Contextual Bandits with Regression OraclesInternational Conference on Machine Learning (ICML), 2020

Dylan J. Foster

Alexander Rakhlin

677

240

12 Feb 2020

Reinforcement Learning in Factored MDPs: Oracle-Efficient Algorithms and Tighter Regret Bounds for the Non-Episodic Setting

Ziping Xu

Ambuj Tewari

332

06 Feb 2020

Efficient and Robust Algorithms for Adversarial Linear Contextual BanditsAnnual Conference Computational Learning Theory (COLT), 2020

Gergely Neu

Julia Olkhovskaya

481

01 Feb 2020

Fair Contextual Multi-Armed Bandits: Theory and ExperimentsConference on Uncertainty in Artificial Intelligence (UAI), 2019

267

13 Dec 2019

Online Pricing with Reserve Price Constraint for Personal Data MarketsIEEE Transactions on Knowledge and Data Engineering (TKDE), 2019

179

28 Nov 2019

OSOM: A simultaneously optimal algorithm for multi-armed and linear contextual banditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2019

Niladri S. Chatterji

Vidya Muthukumar

Peter L. Bartlett

274

24 May 2019

Introduction to Multi-Armed Bandits

Aleksandrs Slivkins

1.6K

1,218

15 Apr 2019

Bandit Multiclass Linear Classification: Efficient Algorithms for the Separable Case

241

06 Feb 2019

A New Algorithm for Non-stationary Contextual Bandits: Efficient, Optimal, and Parameter-free

374

143

03 Feb 2019

Adversarial Bandits with Knapsacks

Nicole Immorlica

Karthik Abinav Sankararaman

Robert Schapire

Aleksandrs Slivkins

797

133

28 Nov 2018

Contextual bandits with surrogate losses: Margin bounds and efficient algorithmsNeural Information Processing Systems (NeurIPS), 2018

Dylan J. Foster

A. Krishnamurthy

401

28 Jun 2018

Online Learning via the Differential Privacy Lens

341

27 Nov 2017

Disagreement-Based Combinatorial Pure Exploration: Sample Complexity Bounds and an Efficient Algorithm

Tongyi Cao

A. Krishnamurthy

240

21 Nov 2017

Small-loss bounds for online learning with partial information

Thodoris Lykouris

Karthik Sridharan

Éva Tardos

349

09 Nov 2017

Efficient Contextual Bandits in Non-stationary Worlds

394

152

05 Aug 2017

Adversarial Ranking for Language GenerationNeural Information Processing Systems (NeurIPS), 2017

421

349

31 May 2017

$Efficient Online Bandit Multiclass Learning with $\tilde{O}(\sqrt{T})$ Regret$

Efficient Online Bandit Multiclass Learning with

\tilde{O}(\sqrt{T})

RegretInternational Conference on Machine Learning (ICML), 2017

A. Beygelzimer

Francesco Orabona

Chicheng Zhang

293

25 Feb 2017

Corralling a Band of Bandit AlgorithmsAnnual Conference Computational Learning Theory (COLT), 2016

458

170

19 Dec 2016

Oracle-Efficient Online Learning and Auction Design

Jennifer Wortman Vaughan

243

05 Nov 2016

Risk-Aware Algorithms for Adversarial Contextual Bandits

Wen Sun

Debadeepta Dey

Ashish Kapoor

173

17 Oct 2016