Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2009.04416
Cited By
Phasic Policy Gradient
International Conference on Machine Learning (ICML), 2020
9 September 2020
K. Cobbe
Jacob Hilton
Oleg Klimov
John Schulman
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Phasic Policy Gradient"
50 / 99 papers shown
Blindfolded Experts Generalize Better: Insights from Robotic Manipulation and Videogames
E. Zisselman
Mirco Mutti
Shelly Francis-Meretzki
Elisei Shafer
Aviv Tamar
OffRL
200
0
0
28 Oct 2025
Greener Deep Reinforcement Learning: Analysis of Energy and Carbon Efficiency Across Atari Benchmarks
Jason Gardner
Ayan Dutta
Swapnoneel Roy
O. P. Kreidl
Ladislau Bölöni
158
2
0
05 Sep 2025
Imitate Optimal Policy: Prevail and Induce Action Collapse in Policy Gradient
Zhongzhu Zhou
Yibo Yang
Ziyan Chen
Fengxiang Bie
Haojun Xia
Xiaoxia Wu
Robert Wu
Ben Athiwaratkun
Bernard Ghanem
Shuaiwen Leon Song
226
0
0
02 Sep 2025
Scaling DRL for Decision Making: A Survey on Data, Network, and Training Budget Strategies
Yi Ma
Hongyao Tang
Chenjun Xiao
Yaodong Yang
Wei Wei
Jianye Hao
Jiye Liang
OffRL
236
0
0
05 Aug 2025
Is Exploration or Optimization the Problem for Deep Reinforcement Learning?
Glen Berseth
OffRL
221
1
0
02 Aug 2025
Adaptive Network Security Policies via Belief Aggregation and Rollout
Kim Hammar
Yuchao Li
Tansu Alpcan
Emil C. Lupu
Dimitri P. Bertsekas
339
6
0
21 Jul 2025
Relative Entropy Pathwise Policy Optimization
C. Voelcker
Axel Brunnbauer
Marcel Hussing
Michal Nauman
Pieter Abbeel
Eric Eaton
Radu Grosu
Amir-massoud Farahmand
Igor Gilitschenski
491
1
0
15 Jul 2025
The Actor-Critic Update Order Matters for PPO in Federated Reinforcement Learning
Zhijie Xie
Shenghui Song
277
0
0
02 Jun 2025
Improving Value Estimation Critically Enhances Vanilla Policy Gradient
Tao Wang
Ruipeng Zhang
Sicun Gao
OffRL
245
4
0
25 May 2025
BQSched: A Non-intrusive Scheduler for Batch Concurrent Queries via Reinforcement Learning
IEEE International Conference on Data Engineering (ICDE), 2025
Chenhao Xu
Chunyu Chen
Jinglin Peng
Jiannan Wang
Jun Gao
OffRL
AI4TS
283
0
0
27 Apr 2025
A Reinforcement Learning Method for Environments with Stochastic Variables: Post-Decision Proximal Policy Optimization with Dual Critic Networks
L. Felizardo
Edoardo Fadda
Paolo Brandimarte
E. Del-Moral-Hernandez
Mariá Cristina Vasconcelos Nascimento
OffRL
376
1
0
07 Apr 2025
Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
International Conference on Learning Representations (ICLR), 2025
Samuel Garcin
Trevor A. McInroe
Pablo Samuel Castro
Prakash Panangaden
Christopher G. Lucas
David Abel
Stefano V. Albrecht
417
6
0
08 Mar 2025
Pre-Trained Video Generative Models as World Simulators
Haoran He
Yang Zhang
Guanbin Li
Zhihao Xu
Ling Pan
VGen
472
29
0
10 Feb 2025
Adaptive Data Exploitation in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Jianfeng Dong
Wenjun Zeng
OffRL
979
1
0
22 Jan 2025
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Tyler Clark
Mark Towers
Christine Evers
Jonathon Hare
OffRL
581
6
0
06 Nov 2024
Accelerating Task Generalisation with Multi-Level Skill Hierarchies
International Conference on Learning Representations (ICLR), 2024
Thomas P Cannon
Özgür Simsek
AI4CE
266
0
0
05 Nov 2024
Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive Approach
Neural Information Processing Systems (NeurIPS), 2024
Riccardo Poiani
Nicole Nobili
Alberto Maria Metelli
Marcello Restelli
188
3
0
17 Oct 2024
Improving Generalization on the ProcGen Benchmark with Simple Architectural Changes and Scale
Andrew Jesson
Yiding Jiang
OffRL
331
6
0
13 Oct 2024
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn
Neural Information Processing Systems (NeurIPS), 2024
Hongyao Tang
Glen Berseth
OffRL
363
11
0
07 Sep 2024
PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods
WooJae Jeon
KanJun Lee
Jeewoo Lee
OffRL
132
0
0
18 Jul 2024
Pretraining-finetuning Framework for Efficient Co-design: A Case Study on Quadruped Robot Parkour
Ci Chen
Jiyu Yu
Haojian Lu
Hongbo Gao
R. Xiong
Yue Wang
374
6
0
09 Jul 2024
Multi-Task Decision-Making for Multi-User 360 Video Processing over Wireless Networks
Babak Badnava
Jacob Chakareski
Morteza Hashemi
277
3
0
03 Jul 2024
Explore-Go: Leveraging Exploration for Generalisation in Deep Reinforcement Learning
Max Weltevrede
Felix Kaubek
M. Spaan
Wendelin Bohmer
321
0
0
12 Jun 2024
Representation Learning For Efficient Deep Multi-Agent Reinforcement Learning
Dom Huh
Prasant Mohapatra
283
4
0
05 Jun 2024
Multi-Agent Reinforcement Learning Meets Leaf Sequencing in Radiotherapy
Riqiang Gao
Florin-Cristian Ghesu
Simon Arberet
Shahab Basiri
Esa Kuusela
Martin Kraus
Dorin Comaniciu
A. Kamen
AI4CE
179
4
0
03 Jun 2024
Phasic Diversity Optimization for Population-Based Reinforcement Learning
Jingcheng Jiang
Haiyin Piao
Yu Fu
Yihang Hao
Chuanlu Jiang
Ziqi Wei
Xin Yang
285
1
0
17 Mar 2024
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
Shengyi Huang
Quentin Gallouedec
Florian Felten
Antonin Raffin
Rousslan Fernand Julien Dossa
...
Alexander Nikulin
Xiao Hu
Tianlin Liu
Jongwook Choi
Brent Yi
OffRL
312
23
0
05 Feb 2024
The Definitive Guide to Policy Gradients in Deep Reinforcement Learning: Theory, Algorithms and Implementations
Matthias Lehmann
356
9
0
24 Jan 2024
Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey on Hybrid Algorithms
IEEE Transactions on Evolutionary Computation (IEEE Trans. Evol. Comput.), 2024
Pengyi Li
Jianye Hao
Hongyao Tang
Xian Fu
Yan Zheng
Ke Tang
416
63
0
22 Jan 2024
A Survey Analyzing Generalization in Deep Reinforcement Learning
Ezgi Korkmaz
OffRL
341
10
0
04 Jan 2024
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
422
58
0
15 Dec 2023
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
557
9
0
13 Dec 2023
Guaranteed Trust Region Optimization via Two-Phase KL Penalization
K.R. Zentner
Ujjwal Puri
Zhehui Huang
Gaurav Sukhatme
OffRL
249
0
0
08 Dec 2023
DGMem: Learning Visual Navigation Policy without Any Labels by Dynamic Graph Memory
Wenzhe Cai
Teng Wang
Guangran Cheng
Lele Xu
Changyin Sun
397
5
0
30 Nov 2023
C-Procgen: Empowering Procgen with Controllable Contexts
Zhenxiong Tan
Kaixin Wang
Xinchao Wang
262
2
0
13 Nov 2023
Reward Scale Robustness for Proximal Policy Optimization via DreamerV3 Tricks
Neural Information Processing Systems (NeurIPS), 2023
Ryan Sullivan
Akarsh Kumar
Shengyi Huang
John P. Dickerson
Joseph Suárez
OffRL
227
8
0
26 Oct 2023
Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning
AAAI Conference on Artificial Intelligence (AAAI), 2023
Jiayu Chen
Zelai Xu
Yunfei Li
Chao Yu
Jiaming Song
Huazhong Yang
Fei Fang
Yu Wang
Yi Wu
332
7
0
07 Oct 2023
RLLTE: Long-Term Evolution Project of Reinforcement Learning
AAAI Conference on Artificial Intelligence (AAAI), 2023
Tao Lv
Zequn Zhang
Yang Xu
Shihao Luo
Bo Li
Xin Jin
Wenjun Zeng
OffRL
260
4
0
28 Sep 2023
Diagnosing and exploiting the computational demands of videos games for deep reinforcement learning
L. Govindarajan
Rex G Liu
Drew Linsley
A. Ashok
Max Reuter
M. Frank
Thomas Serre
OffRL
239
0
0
22 Sep 2023
Machine Learning Meets Advanced Robotic Manipulation
Information Fusion (Inf. Fusion), 2023
Saeid Nahavandi
R. Alizadehsani
D. Nahavandi
Chee Peng Lim
Kevin Kelly
Fernando Bello
279
29
0
22 Sep 2023
Guide Your Agent with Adaptive Multimodal Rewards
Neural Information Processing Systems (NeurIPS), 2023
Changyeon Kim
Younggyo Seo
Hao Liu
Lisa Lee
Jinwoo Shin
Honglak Lee
Kimin Lee
454
12
0
19 Sep 2023
Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
International Conference on Learning Representations (ICLR), 2023
Hao Sun
Alihan Huyuk
M. Schaar
OffRL
LRM
342
46
0
13 Sep 2023
Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning
Neural Information Processing Systems (NeurIPS), 2023
Seungyong Moon
Junyoung Yeom
Bumsoo Park
Hyun Oh Song
OffRL
446
8
0
07 Jul 2023
Correcting discount-factor mismatch in on-policy policy gradient methods
International Conference on Machine Learning (ICML), 2023
Fengdi Che
Gautham Vasan
A. R. Mahmood
OffRL
158
10
0
23 Jun 2023
Explore to Generalize in Zero-Shot RL
Neural Information Processing Systems (NeurIPS), 2023
E. Zisselman
Itai Lavie
Daniel Soudry
Aviv Tamar
405
23
0
05 Jun 2023
Truncating Trajectories in Monte Carlo Reinforcement Learning
International Conference on Machine Learning (ICML), 2023
Riccardo Poiani
Alberto Maria Metelli
Marcello Restelli
240
5
0
07 May 2023
DEIR: Efficient and Robust Exploration through Discriminative-Model-Based Episodic Intrinsic Rewards
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Shanchuan Wan
Yujin Tang
Yingtao Tian
Tomoyuki Kaneko
OffRL
184
8
0
21 Apr 2023
Using Offline Data to Speed-up Reinforcement Learning in Procedurally Generated Environments
Neurocomputing (Neurocomputing), 2023
Alain Andres
Lukas Schafer
Esther Villar-Rodriguez
Stefano V. Albrecht
Javier Del Ser
OffRL
OnRL
240
9
0
18 Apr 2023
CFlowNets: Continuous Control with Generative Flow Networks
International Conference on Learning Representations (ICLR), 2023
Yinchuan Li
Shuang Luo
Haozhi Wang
Jianye Hao
343
25
0
04 Mar 2023
Scaling laws for single-agent reinforcement learning
Jacob Hilton
Jie Tang
John Schulman
357
36
0
31 Jan 2023
1
2
Next
Page 1 of 2