ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.08803
  4. Cited By
Langevin Thompson Sampling with Logarithmic Communication: Bandits and
  Reinforcement Learning

Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning

15 June 2023
Amin Karbasi
Nikki Lijing Kuang
Yi-An Ma
Siddharth Mitra
    OffRL
ArXivPDFHTML

Papers citing "Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning"

2 / 2 papers shown
Title
Toward Efficient Exploration by Large Language Model Agents
Toward Efficient Exploration by Large Language Model Agents
Dilip Arumugam
Thomas L. Griffiths
LLMAG
89
0
0
29 Apr 2025
Model-free Reinforcement Learning in Infinite-horizon Average-reward
  Markov Decision Processes
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
105
99
0
15 Oct 2019
1