Probabilistic Inference in Reinforcement Learning Done Right

22 November 2023

Papers citing "Probabilistic Inference in Reinforcement Learning Done Right"

7 / 7 papers shown

Title
Toward Efficient Exploration by Large Language Model Agents Dilip Arumugam Thomas L. Griffiths LLMAG 87 0 0 29 Apr 2025
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic Stefano Viel Luca Viano V. Cevher 74 0 0 27 Feb 2025
Confronting Reward Model Overoptimization with Constrained RLHF Ted Moskovitz Aaditya K. Singh DJ Strouse T. Sandholm Ruslan Salakhutdinov Anca D. Dragan Stephen Marcus McAleer 24 47 0 06 Oct 2023
Fast Rates for Maximum Entropy Exploration D. Tiapkin Denis Belomestny Daniele Calandriello Eric Moulines Rémi Munos A. Naumov Pierre Perrault Yunhao Tang Michal Valko Pierre Menard 31 17 0 14 Mar 2023
On the connection between Bregman divergence and value in regularized Markov decision processes Brendan O'Donoghue OffRL 9 2 0 21 Oct 2022
Regret Bounds for Information-Directed Reinforcement Learning Botao Hao Tor Lattimore OffRL 26 17 0 09 Jun 2022
UCB Momentum Q-learning: Correcting the bias without forgetting Pierre Menard O. D. Domingues Xuedong Shang Michal Valko 72 40 0 01 Mar 2021