Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning

15 June 2023

Papers citing "Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning"

2 / 2 papers shown

Title
Toward Efficient Exploration by Large Language Model Agents Dilip Arumugam Thomas L. Griffiths LLMAG 89 0 0 29 Apr 2025
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes Chen-Yu Wei Mehdi Jafarnia-Jahromi Haipeng Luo Hiteshi Sharma R. Jain 105 99 0 15 Oct 2019