Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.06794
Cited By
f
f
f
-Policy Gradients: A General Framework for Goal Conditioned RL using
f
f
f
-Divergences
10 October 2023
Siddhant Agarwal
Ishan Durugkar
Peter Stone
Amy Zhang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"$f$-Policy Gradients: A General Framework for Goal Conditioned RL using $f$-Divergences"
4 / 4 papers shown
Title
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning
Caleb Chuck
Fan Feng
Carl Qi
Chang Shi
Siddhant Agarwal
Amy Zhang
S. Niekum
38
0
0
06 May 2025
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,881
0
04 Mar 2022
Reward (Mis)design for Autonomous Driving
W. B. Knox
A. Allievi
Holger Banzhaf
Felix Schmitt
Peter Stone
67
112
0
28 Apr 2021
Asymmetric self-play for automatic goal discovery in robotic manipulation
OpenAI OpenAI
Matthias Plappert
Raul Sampedro
Tao Xu
Ilge Akkaya
...
Hyeonwoo Noh
Lilian Weng
Qiming Yuan
Casey Chu
Wojciech Zaremba
SSL
60
76
0
13 Jan 2021
1