Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning
with Energy-based Models

Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models

18 May 2023

Ding Zhao

Papers citing "Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models"

8 / 8 papers shown

Title
Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions Kai Xu Farid Tajaddodianfar Ben Allison 16 0 0 16 Jun 2024
A Tractable Inference Perspective of Offline RL Xuejie Liu Anji Liu Guy Van den Broeck Yitao Liang OffRL 34 1 0 31 Oct 2023
Diffusion Models: A Comprehensive Survey of Methods and Applications Ling Yang Zhilong Zhang Yingxia Shao Shenda Hong Runsheng Xu Yue Zhao Wentao Zhang Bin Cui Ming-Hsuan Yang DiffM MedIm 224 1,296 0 02 Sep 2022
Training language models to follow instructions with human feedback Long Ouyang Jeff Wu Xu Jiang Diogo Almeida Carroll L. Wainwright ... Amanda Askell Peter Welinder Paul Christiano Jan Leike Ryan J. Lowe OSLM ALM 303 11,881 0 04 Mar 2022
Offline Reinforcement Learning with Implicit Q-Learning Ilya Kostrikov Ashvin Nair Sergey Levine OffRL 212 832 0 12 Oct 2021
COMBO: Conservative Offline Model-Based Policy Optimization Tianhe Yu Aviral Kumar Rafael Rafailov Aravind Rajeswaran Sergey Levine Chelsea Finn OffRL 212 413 0 16 Feb 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems Sergey Levine Aviral Kumar George Tucker Justin Fu OffRL GP 329 1,944 0 04 May 2020
Deep Reinforcement Learning for Autonomous Driving: A Survey B. R. Kiran Ibrahim Sobh V. Talpaert Patrick Mannion A. A. Sallab S. Yogamani P. Pérez 143 1,624 0 02 Feb 2020