ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.11254
  4. Cited By
Langevin Monte Carlo for Contextual Bandits

Langevin Monte Carlo for Contextual Bandits

22 June 2022
Pan Xu
Hongkai Zheng
Eric Mazumdar
Kamyar Azizzadenesheli
Anima Anandkumar
ArXivPDFHTML

Papers citing "Langevin Monte Carlo for Contextual Bandits"

10 / 10 papers shown
Title
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
H. Bui
Enrique Mallada
Anqi Liu
97
0
0
08 Nov 2024
Stabilizing the Kumaraswamy Distribution
Stabilizing the Kumaraswamy Distribution
Max Wasserman
Gonzalo Mateos
BDL
44
0
0
01 Oct 2024
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice
  via HyperAgent
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent
Yingru Li
Jiawei Xu
Lei Han
Zhi-Quan Luo
BDL
OffRL
20
6
0
05 Feb 2024
Zero-Inflated Bandits
Zero-Inflated Bandits
Haoyu Wei
Runzhe Wan
Lei Shi
Rui Song
42
0
0
25 Dec 2023
VITS : Variational Inference Thompson Sampling for contextual bandits
VITS : Variational Inference Thompson Sampling for contextual bandits
Pierre Clavier
Tom Huix
Alain Durmus
25
3
0
19 Jul 2023
Provable and Practical: Efficient Exploration in Reinforcement Learning
  via Langevin Monte Carlo
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Haque Ishfaq
Qingfeng Lan
Pan Xu
A. R. Mahmood
Doina Precup
Anima Anandkumar
Kamyar Azizzadenesheli
BDL
OffRL
26
20
0
29 May 2023
Hamiltonian Monte Carlo for efficient Gaussian sampling: long and random
  steps
Hamiltonian Monte Carlo for efficient Gaussian sampling: long and random steps
Simon Apers
S. Gribling
Dániel Szilágyi
28
10
0
26 Sep 2022
Optimal Regret Is Achievable with Bounded Approximate Inference Error:
  An Enhanced Bayesian Upper Confidence Bound Framework
Optimal Regret Is Achievable with Bounded Approximate Inference Error: An Enhanced Bayesian Upper Confidence Bound Framework
Ziyi Huang
H. Lam
A. Meisami
Haofeng Zhang
34
4
0
31 Jan 2022
Faster Convergence of Stochastic Gradient Langevin Dynamics for
  Non-Log-Concave Sampling
Faster Convergence of Stochastic Gradient Langevin Dynamics for Non-Log-Concave Sampling
Difan Zou
Pan Xu
Quanquan Gu
38
35
0
19 Oct 2020
Stochastic Linear Contextual Bandits with Diverse Contexts
Stochastic Linear Contextual Bandits with Diverse Contexts
Weiqiang Wu
Jing Yang
Cong Shen
45
13
0
05 Mar 2020
1