Transductive Off-policy Proximal Policy Optimization

Transductive Off-policy Proximal Policy Optimization

6 June 2024

Junliang Xing

Papers citing "Transductive Off-policy Proximal Policy Optimization"

10 / 10 papers shown

Title
Mildly Conservative Q-Learning for Offline Reinforcement Learning Jiafei Lyu Xiaoteng Ma Xiu Li Zongqing Lu OffRL 42 106 0 09 Jun 2022
Generalized Proximal Policy Optimization with Sample Reuse James Queeney I. Paschalidis Christos G. Cassandras OffRL 94 48 0 29 Oct 2021
Conservative Q-Learning for Offline Reinforcement Learning Aviral Kumar Aurick Zhou George Tucker Sergey Levine OffRL OnRL 71 1,780 0 08 Jun 2020
Off-Policy Actor-Critic with Shared Experience Replay Simon Schmitt Matteo Hessel Karen Simonyan OffRL 37 68 0 25 Sep 2019
Understanding Multi-Step Deep Reinforcement Learning: A Systematic Study of the DQN Target J. F. Hernandez-Garcia R. Sutton 27 61 0 22 Jan 2019
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures L. Espeholt Hubert Soyer Rémi Munos Karen Simonyan Volodymyr Mnih ... Vlad Firoiu Tim Harley Iain Dunning Shane Legg Koray Kavukcuoglu 89 1,584 0 05 Feb 2018
Deep Reinforcement Learning that Matters Peter Henderson Riashat Islam Philip Bachman Joelle Pineau Doina Precup David Meger OffRL 85 1,940 0 19 Sep 2017
Proximal Policy Optimization Algorithms John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov OffRL 179 18,685 0 20 Jul 2017
Constrained Policy Optimization Joshua Achiam David Held Aviv Tamar Pieter Abbeel 68 1,313 0 30 May 2017
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 97 13,174 0 09 Sep 2015