Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.08803
Cited By
Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning
15 June 2023
Amin Karbasi
Nikki Lijing Kuang
Yi-An Ma
Siddharth Mitra
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning"
2 / 2 papers shown
Title
Toward Efficient Exploration by Large Language Model Agents
Dilip Arumugam
Thomas L. Griffiths
LLMAG
89
0
0
29 Apr 2025
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
105
99
0
15 Oct 2019
1