Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.11340
Cited By
Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models
18 May 2023
Wenhao Ding
Tong Che
Ding Zhao
Marco Pavone
BDL
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models"
8 / 8 papers shown
Title
Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions
Kai Xu
Farid Tajaddodianfar
Ben Allison
16
0
0
16 Jun 2024
A Tractable Inference Perspective of Offline RL
Xuejie Liu
Anji Liu
Guy Van den Broeck
Yitao Liang
OffRL
34
1
0
31 Oct 2023
Diffusion Models: A Comprehensive Survey of Methods and Applications
Ling Yang
Zhilong Zhang
Yingxia Shao
Shenda Hong
Runsheng Xu
Yue Zhao
Wentao Zhang
Bin Cui
Ming-Hsuan Yang
DiffM
MedIm
224
1,296
0
02 Sep 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,881
0
04 Mar 2022
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
212
832
0
12 Oct 2021
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
212
413
0
16 Feb 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
329
1,944
0
04 May 2020
Deep Reinforcement Learning for Autonomous Driving: A Survey
B. R. Kiran
Ibrahim Sobh
V. Talpaert
Patrick Mannion
A. A. Sallab
S. Yogamani
P. Pérez
143
1,624
0
02 Feb 2020
1