ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.13294
  4. Cited By
Probabilistic Inference in Reinforcement Learning Done Right

Probabilistic Inference in Reinforcement Learning Done Right

22 November 2023
Jean Tarbouriech
Tor Lattimore
Brendan O'Donoghue
    BDL
    OffRL
ArXivPDFHTML

Papers citing "Probabilistic Inference in Reinforcement Learning Done Right"

7 / 7 papers shown
Title
Toward Efficient Exploration by Large Language Model Agents
Toward Efficient Exploration by Large Language Model Agents
Dilip Arumugam
Thomas L. Griffiths
LLMAG
87
0
0
29 Apr 2025
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
Stefano Viel
Luca Viano
V. Cevher
74
0
0
27 Feb 2025
Confronting Reward Model Overoptimization with Constrained RLHF
Confronting Reward Model Overoptimization with Constrained RLHF
Ted Moskovitz
Aaditya K. Singh
DJ Strouse
T. Sandholm
Ruslan Salakhutdinov
Anca D. Dragan
Stephen Marcus McAleer
24
47
0
06 Oct 2023
Fast Rates for Maximum Entropy Exploration
Fast Rates for Maximum Entropy Exploration
D. Tiapkin
Denis Belomestny
Daniele Calandriello
Eric Moulines
Rémi Munos
A. Naumov
Pierre Perrault
Yunhao Tang
Michal Valko
Pierre Menard
31
17
0
14 Mar 2023
On the connection between Bregman divergence and value in regularized
  Markov decision processes
On the connection between Bregman divergence and value in regularized Markov decision processes
Brendan O'Donoghue
OffRL
9
2
0
21 Oct 2022
Regret Bounds for Information-Directed Reinforcement Learning
Regret Bounds for Information-Directed Reinforcement Learning
Botao Hao
Tor Lattimore
OffRL
26
17
0
09 Jun 2022
UCB Momentum Q-learning: Correcting the bias without forgetting
UCB Momentum Q-learning: Correcting the bias without forgetting
Pierre Menard
O. D. Domingues
Xuedong Shang
Michal Valko
72
40
0
01 Mar 2021
1