Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.03894
Cited By
Transductive Off-policy Proximal Policy Optimization
6 June 2024
Yaozhong Gan
Renye Yan
Xiaoyang Tan
Zhe Wu
Junliang Xing
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Transductive Off-policy Proximal Policy Optimization"
10 / 10 papers shown
Title
Mildly Conservative Q-Learning for Offline Reinforcement Learning
Jiafei Lyu
Xiaoteng Ma
Xiu Li
Zongqing Lu
OffRL
42
106
0
09 Jun 2022
Generalized Proximal Policy Optimization with Sample Reuse
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
94
48
0
29 Oct 2021
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
71
1,780
0
08 Jun 2020
Off-Policy Actor-Critic with Shared Experience Replay
Simon Schmitt
Matteo Hessel
Karen Simonyan
OffRL
37
68
0
25 Sep 2019
Understanding Multi-Step Deep Reinforcement Learning: A Systematic Study of the DQN Target
J. F. Hernandez-Garcia
R. Sutton
27
61
0
22 Jan 2019
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
89
1,584
0
05 Feb 2018
Deep Reinforcement Learning that Matters
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
85
1,940
0
19 Sep 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
179
18,685
0
20 Jul 2017
Constrained Policy Optimization
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
68
1,313
0
30 May 2017
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
97
13,174
0
09 Sep 2015
1