v1v2v3v4 (latest)

Asymptotically Optimal Information-Directed Sampling

Annual Conference Computational Learning Theory (COLT), 2020

11 November 2020

Papers citing "Asymptotically Optimal Information-Directed Sampling"

23 / 23 papers shown

Optimal and Practical Batched Linear Bandit Algorithm

Sanghoon Yu

Min-hwan Oh

359

11 Jul 2025

An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints

262

28 May 2025

Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning

Qiaosheng Zhang

Chenjia Bai

Shuyue Hu

Zhen Wang

Xuelong Li

337

30 Apr 2024

Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023

Ahmadreza Moradipari

M. Pedramfar

Modjtaba Shokrian Zini

Vaneet Aggarwal

339

30 Oct 2023

Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and ApplicationsJournal of machine learning research (JMLR), 2023

Johannes Kirschner

Tor Lattimore

Andreas Krause

308

07 Feb 2023

On the Complexity of Representation Learning in Contextual Linear BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

Andrea Tirinzoni

Matteo Pirotta

A. Lazaric

258

19 Dec 2022

Risk-aware linear bandits with convex lossInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

Patrick Saux

Odalric-Ambrym Maillard

274

15 Sep 2022

Multi-Armed Bandits with Self-Information RewardsIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2022

Nir Weinberger

M. Yemini

149

06 Sep 2022

Non-Stationary Dynamic Pricing Via Actor-Critic Information-Directed Pricing

P. Liu

ChiHua Wang

Henghsiu Tsai

227

19 Aug 2022

On the Complexity of Adversarial Decision MakingNeural Information Processing Systems (NeurIPS), 2022

292

27 Jun 2022

Regret Bounds for Information-Directed Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

Botao Hao

Tor Lattimore

OffRL

307

09 Jun 2022

Contextual Information-Directed SamplingInternational Conference on Machine Learning (ICML), 2022

Botao Hao

Tor Lattimore

Chao Qin

425

22 May 2022

The price of unfairness in linear bandits with biased feedbackNeural Information Processing Systems (NeurIPS), 2022

339

18 Mar 2022

Truncated LinUCB for Stochastic Linear Bandits

Yanglei Song

Meng zhou

580

23 Feb 2022

Minimax Regret for Partial Monitoring: Infinite Outcomes and Rustichini's RegretAnnual Conference Computational Learning Theory (COLT), 2022

Tor Lattimore

201

22 Feb 2022

Dealing With Misspecification In Fixed-Confidence Linear Top-m IdentificationNeural Information Processing Systems (NeurIPS), 2021

Clémence Réda

Andrea Tirinzoni

Rémy Degenne

241

02 Nov 2021

The Value of Information When Deciding What to Learn

Dilip Arumugam

Benjamin Van Roy

199

26 Oct 2021

Apple Tasting Revisited: Bayesian Approaches to Partially Monitored Online Binary Classification

James A. Grant

David S. Leslie

296

29 Sep 2021

Information Directed Sampling for Sparse Linear BanditsNeural Information Processing Systems (NeurIPS), 2021

Botao Hao

Tor Lattimore

Wei Deng

264

29 May 2021

Bias-Robust Bayesian Optimization via Dueling BanditsInternational Conference on Machine Learning (ICML), 2021

Johannes Kirschner

Andreas Krause

323

25 May 2021

Reinforcement Learning, Bit by Bit

691

06 Mar 2021

An Efficient Pessimistic-Optimistic Algorithm for Stochastic Linear Bandits with General ConstraintsNeural Information Processing Systems (NeurIPS), 2021

415

10 Feb 2021

Experimental Design for Regret Minimization in Linear BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020

Andrew Wagenmaker

Julian Katz-Samuels

Kevin Jamieson

446

01 Nov 2020