v1v2v3 (latest)

Variational inference for the multi-armed contextual bandit

10 September 2017

Iñigo Urteaga

C. Wiggins

ArXiv (abs)PDF HTML

Papers citing "Variational inference for the multi-armed contextual bandit"

21 / 21 papers shown

EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement LearningAsian Conference on Machine Learning (ACML), 2025

299

17 Jan 2025

Stabilizing the Kumaraswamy Distribution

Max Wasserman

Gonzalo Mateos

BDL

220

01 Oct 2024

Bayesian Bandit Algorithms with Approximate Inference in Stochastic Linear Bandits

Ziyi Huang

Henry Lam

Haofeng Zhang

383

20 Jun 2024

Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits

296

08 Feb 2024

Improving sample efficiency of high dimensional Bayesian optimization with MCMC

197

05 Jan 2024

VITS : Variational Inference Thompson Sampling for contextual banditsInternational Conference on Machine Learning (ICML), 2023

Pierre Clavier

Tom Huix

Alain Durmus

369

19 Jul 2023

Multiplier Bootstrap-based ExplorationInternational Conference on Machine Learning (ICML), 2023

201

03 Feb 2023

Mixed-Effect Thompson SamplingInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

352

30 May 2022

Multi-armed bandits for resource efficient, online optimization of language model pre-training: the use case of dynamic maskingAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

216

24 Mar 2022

An Analysis of Ensemble SamplingNeural Information Processing Systems (NeurIPS), 2022

381

02 Mar 2022

Fast online inference for nonlinear contextual bandit based on Generative Adversarial Network

Yun-Da Tsai

Shou-De Lin

176

17 Feb 2022

Optimal Regret Is Achievable with Bounded Approximate Inference Error: An Enhanced Bayesian Upper Confidence Bound FrameworkNeural Information Processing Systems (NeurIPS), 2022

366

31 Jan 2022

Apple Tasting Revisited: Bayesian Approaches to Partially Monitored Online Binary Classification

James A. Grant

David S. Leslie

246

29 Sep 2021

Thompson Sampling with a Mixture PriorInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021

249

10 Jun 2021

Influence Diagram Bandits: Variational Thompson Sampling for Structured Bandit ProblemsInternational Conference on Machine Learning (ICML), 2020

Ole J. Mengshoel

200

09 Jul 2020

On Thompson Sampling with Langevin AlgorithmsInternational Conference on Machine Learning (ICML), 2020

Eric Mazumdar

251

23 Feb 2020

On Thompson Sampling for Smoother-than-Lipschitz BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020

James A. Grant

David S. Leslie

285

08 Jan 2020

Thompson Sampling with Approximate Inference

My Phan

Yasin Abbasi-Yadkori

Justin Domke

151

14 Aug 2019

Scalable Thompson Sampling via Optimal Transport

Ruiyi Zhang

Zheng Wen

Changyou Chen

Lawrence Carin

212

19 Feb 2019

Thompson Sampling for Noncompliant Bandits

Andrew Stirn

Tony Jebara

03 Dec 2018

Nonparametric Gaussian Mixture Models for the Multi-Armed Bandit

Iñigo Urteaga

C. Wiggins

294

08 Aug 2018