Langevin Monte Carlo for Contextual Bandits

Langevin Monte Carlo for Contextual Bandits

22 June 2022

Eric Mazumdar

Kamyar Azizzadenesheli

Anima Anandkumar

Papers citing "Langevin Monte Carlo for Contextual Bandits"

10 / 10 papers shown

Title
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits H. Bui Enrique Mallada Anqi Liu 97 0 0 08 Nov 2024
Stabilizing the Kumaraswamy Distribution Max Wasserman Gonzalo Mateos BDL 44 0 0 01 Oct 2024
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent Yingru Li Jiawei Xu Lei Han Zhi-Quan Luo BDL OffRL 20 6 0 05 Feb 2024
Zero-Inflated Bandits Haoyu Wei Runzhe Wan Lei Shi Rui Song 42 0 0 25 Dec 2023
VITS : Variational Inference Thompson Sampling for contextual bandits Pierre Clavier Tom Huix Alain Durmus 25 3 0 19 Jul 2023
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo Haque Ishfaq Qingfeng Lan Pan Xu A. R. Mahmood Doina Precup Anima Anandkumar Kamyar Azizzadenesheli BDL OffRL 26 20 0 29 May 2023
Hamiltonian Monte Carlo for efficient Gaussian sampling: long and random steps Simon Apers S. Gribling Dániel Szilágyi 28 10 0 26 Sep 2022
Optimal Regret Is Achievable with Bounded Approximate Inference Error: An Enhanced Bayesian Upper Confidence Bound Framework Ziyi Huang H. Lam A. Meisami Haofeng Zhang 34 4 0 31 Jan 2022
Faster Convergence of Stochastic Gradient Langevin Dynamics for Non-Log-Concave Sampling Difan Zou Pan Xu Quanquan Gu 38 35 0 19 Oct 2020
Stochastic Linear Contextual Bandits with Diverse Contexts Weiqiang Wu Jing Yang Cong Shen 45 13 0 05 Mar 2020