$f$ -Policy Gradients: A General Framework for Goal Conditioned RL using $f$ -Divergences

10 October 2023

Papers citing "$f$-Policy Gradients: A General Framework for Goal Conditioned RL using $f$-Divergences"

4 / 4 papers shown

Title
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning Caleb Chuck Fan Feng Carl Qi Chang Shi Siddhant Agarwal Amy Zhang S. Niekum 35 0 0 06 May 2025
Training language models to follow instructions with human feedback Long Ouyang Jeff Wu Xu Jiang Diogo Almeida Carroll L. Wainwright ... Amanda Askell Peter Welinder Paul Christiano Jan Leike Ryan J. Lowe OSLM ALM 303 11,730 0 04 Mar 2022
Reward (Mis)design for Autonomous Driving W. B. Knox A. Allievi Holger Banzhaf Felix Schmitt Peter Stone 67 112 0 28 Apr 2021
Asymmetric self-play for automatic goal discovery in robotic manipulation OpenAI OpenAI Matthias Plappert Raul Sampedro Tao Xu Ilge Akkaya ... Hyeonwoo Noh Lilian Weng Qiming Yuan Casey Chu Wojciech Zaremba SSL 60 76 0 13 Jan 2021