ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.04416
  4. Cited By
Phasic Policy Gradient

Phasic Policy Gradient

International Conference on Machine Learning (ICML), 2020
9 September 2020
K. Cobbe
Jacob Hilton
Oleg Klimov
John Schulman
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Phasic Policy Gradient"

50 / 99 papers shown
Blindfolded Experts Generalize Better: Insights from Robotic Manipulation and Videogames
Blindfolded Experts Generalize Better: Insights from Robotic Manipulation and Videogames
E. Zisselman
Mirco Mutti
Shelly Francis-Meretzki
Elisei Shafer
Aviv Tamar
OffRL
200
0
0
28 Oct 2025
Greener Deep Reinforcement Learning: Analysis of Energy and Carbon Efficiency Across Atari Benchmarks
Greener Deep Reinforcement Learning: Analysis of Energy and Carbon Efficiency Across Atari Benchmarks
Jason Gardner
Ayan Dutta
Swapnoneel Roy
O. P. Kreidl
Ladislau Bölöni
158
2
0
05 Sep 2025
Imitate Optimal Policy: Prevail and Induce Action Collapse in Policy Gradient
Imitate Optimal Policy: Prevail and Induce Action Collapse in Policy Gradient
Zhongzhu Zhou
Yibo Yang
Ziyan Chen
Fengxiang Bie
Haojun Xia
Xiaoxia Wu
Robert Wu
Ben Athiwaratkun
Bernard Ghanem
Shuaiwen Leon Song
226
0
0
02 Sep 2025
Scaling DRL for Decision Making: A Survey on Data, Network, and Training Budget Strategies
Scaling DRL for Decision Making: A Survey on Data, Network, and Training Budget Strategies
Yi Ma
Hongyao Tang
Chenjun Xiao
Yaodong Yang
Wei Wei
Jianye Hao
Jiye Liang
OffRL
236
0
0
05 Aug 2025
Is Exploration or Optimization the Problem for Deep Reinforcement Learning?
Is Exploration or Optimization the Problem for Deep Reinforcement Learning?
Glen Berseth
OffRL
221
1
0
02 Aug 2025
Adaptive Network Security Policies via Belief Aggregation and Rollout
Adaptive Network Security Policies via Belief Aggregation and Rollout
Kim Hammar
Yuchao Li
Tansu Alpcan
Emil C. Lupu
Dimitri P. Bertsekas
339
6
0
21 Jul 2025
Relative Entropy Pathwise Policy Optimization
Relative Entropy Pathwise Policy Optimization
C. Voelcker
Axel Brunnbauer
Marcel Hussing
Michal Nauman
Pieter Abbeel
Eric Eaton
Radu Grosu
Amir-massoud Farahmand
Igor Gilitschenski
491
1
0
15 Jul 2025
The Actor-Critic Update Order Matters for PPO in Federated Reinforcement Learning
The Actor-Critic Update Order Matters for PPO in Federated Reinforcement Learning
Zhijie Xie
Shenghui Song
277
0
0
02 Jun 2025
Improving Value Estimation Critically Enhances Vanilla Policy Gradient
Improving Value Estimation Critically Enhances Vanilla Policy Gradient
Tao Wang
Ruipeng Zhang
Sicun Gao
OffRL
245
4
0
25 May 2025
BQSched: A Non-intrusive Scheduler for Batch Concurrent Queries via Reinforcement Learning
BQSched: A Non-intrusive Scheduler for Batch Concurrent Queries via Reinforcement LearningIEEE International Conference on Data Engineering (ICDE), 2025
Chenhao Xu
Chunyu Chen
Jinglin Peng
Jiannan Wang
Jun Gao
OffRLAI4TS
283
0
0
27 Apr 2025
A Reinforcement Learning Method for Environments with Stochastic Variables: Post-Decision Proximal Policy Optimization with Dual Critic Networks
A Reinforcement Learning Method for Environments with Stochastic Variables: Post-Decision Proximal Policy Optimization with Dual Critic Networks
L. Felizardo
Edoardo Fadda
Paolo Brandimarte
E. Del-Moral-Hernandez
Mariá Cristina Vasconcelos Nascimento
OffRL
376
1
0
07 Apr 2025
Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
Studying the Interplay Between the Actor and Critic Representations in Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Samuel Garcin
Trevor A. McInroe
Pablo Samuel Castro
Prakash Panangaden
Christopher G. Lucas
David Abel
Stefano V. Albrecht
417
6
0
08 Mar 2025
Pre-Trained Video Generative Models as World Simulators
Pre-Trained Video Generative Models as World Simulators
Haoran He
Yang Zhang
Guanbin Li
Zhihao Xu
Ling Pan
VGen
472
29
0
10 Feb 2025
Adaptive Data Exploitation in Deep Reinforcement Learning
Adaptive Data Exploitation in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Jianfeng Dong
Wenjun Zeng
OffRL
979
1
0
22 Jan 2025
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Tyler Clark
Mark Towers
Christine Evers
Jonathon Hare
OffRL
581
6
0
06 Nov 2024
Accelerating Task Generalisation with Multi-Level Skill Hierarchies
Accelerating Task Generalisation with Multi-Level Skill HierarchiesInternational Conference on Learning Representations (ICLR), 2024
Thomas P Cannon
Özgür Simsek
AI4CE
266
0
0
05 Nov 2024
Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive
  Approach
Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive ApproachNeural Information Processing Systems (NeurIPS), 2024
Riccardo Poiani
Nicole Nobili
Alberto Maria Metelli
Marcello Restelli
188
3
0
17 Oct 2024
Improving Generalization on the ProcGen Benchmark with Simple
  Architectural Changes and Scale
Improving Generalization on the ProcGen Benchmark with Simple Architectural Changes and Scale
Andrew Jesson
Yiding Jiang
OffRL
331
6
0
13 Oct 2024
Improving Deep Reinforcement Learning by Reducing the Chain Effect of
  Value and Policy Churn
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy ChurnNeural Information Processing Systems (NeurIPS), 2024
Hongyao Tang
Glen Berseth
OffRL
363
11
0
07 Sep 2024
PG-Rainbow: Using Distributional Reinforcement Learning in Policy
  Gradient Methods
PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods
WooJae Jeon
KanJun Lee
Jeewoo Lee
OffRL
132
0
0
18 Jul 2024
Pretraining-finetuning Framework for Efficient Co-design: A Case Study
  on Quadruped Robot Parkour
Pretraining-finetuning Framework for Efficient Co-design: A Case Study on Quadruped Robot Parkour
Ci Chen
Jiyu Yu
Haojian Lu
Hongbo Gao
R. Xiong
Yue Wang
374
6
0
09 Jul 2024
Multi-Task Decision-Making for Multi-User 360 Video Processing over
  Wireless Networks
Multi-Task Decision-Making for Multi-User 360 Video Processing over Wireless Networks
Babak Badnava
Jacob Chakareski
Morteza Hashemi
277
3
0
03 Jul 2024
Explore-Go: Leveraging Exploration for Generalisation in Deep
  Reinforcement Learning
Explore-Go: Leveraging Exploration for Generalisation in Deep Reinforcement Learning
Max Weltevrede
Felix Kaubek
M. Spaan
Wendelin Bohmer
321
0
0
12 Jun 2024
Representation Learning For Efficient Deep Multi-Agent Reinforcement
  Learning
Representation Learning For Efficient Deep Multi-Agent Reinforcement Learning
Dom Huh
Prasant Mohapatra
283
4
0
05 Jun 2024
Multi-Agent Reinforcement Learning Meets Leaf Sequencing in Radiotherapy
Multi-Agent Reinforcement Learning Meets Leaf Sequencing in Radiotherapy
Riqiang Gao
Florin-Cristian Ghesu
Simon Arberet
Shahab Basiri
Esa Kuusela
Martin Kraus
Dorin Comaniciu
A. Kamen
AI4CE
179
4
0
03 Jun 2024
Phasic Diversity Optimization for Population-Based Reinforcement
  Learning
Phasic Diversity Optimization for Population-Based Reinforcement Learning
Jingcheng Jiang
Haiyin Piao
Yu Fu
Yihang Hao
Chuanlu Jiang
Ziqi Wei
Xin Yang
285
1
0
17 Mar 2024
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement
  Learning
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
Shengyi Huang
Quentin Gallouedec
Florian Felten
Antonin Raffin
Rousslan Fernand Julien Dossa
...
Alexander Nikulin
Xiao Hu
Tianlin Liu
Jongwook Choi
Brent Yi
OffRL
312
23
0
05 Feb 2024
The Definitive Guide to Policy Gradients in Deep Reinforcement Learning:
  Theory, Algorithms and Implementations
The Definitive Guide to Policy Gradients in Deep Reinforcement Learning: Theory, Algorithms and Implementations
Matthias Lehmann
356
9
0
24 Jan 2024
Bridging Evolutionary Algorithms and Reinforcement Learning: A
  Comprehensive Survey on Hybrid Algorithms
Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey on Hybrid AlgorithmsIEEE Transactions on Evolutionary Computation (IEEE Trans. Evol. Comput.), 2024
Pengyi Li
Jianye Hao
Hongyao Tang
Xian Fu
Yan Zheng
Ke Tang
416
63
0
22 Jan 2024
A Survey Analyzing Generalization in Deep Reinforcement Learning
A Survey Analyzing Generalization in Deep Reinforcement Learning
Ezgi Korkmaz
OffRL
341
10
0
04 Jan 2024
Multi-agent Reinforcement Learning: A Comprehensive Survey
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
422
58
0
15 Dec 2023
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRLOOD
557
9
0
13 Dec 2023
Guaranteed Trust Region Optimization via Two-Phase KL Penalization
Guaranteed Trust Region Optimization via Two-Phase KL Penalization
K.R. Zentner
Ujjwal Puri
Zhehui Huang
Gaurav Sukhatme
OffRL
249
0
0
08 Dec 2023
DGMem: Learning Visual Navigation Policy without Any Labels by Dynamic
  Graph Memory
DGMem: Learning Visual Navigation Policy without Any Labels by Dynamic Graph Memory
Wenzhe Cai
Teng Wang
Guangran Cheng
Lele Xu
Changyin Sun
397
5
0
30 Nov 2023
C-Procgen: Empowering Procgen with Controllable Contexts
C-Procgen: Empowering Procgen with Controllable Contexts
Zhenxiong Tan
Kaixin Wang
Xinchao Wang
262
2
0
13 Nov 2023
Reward Scale Robustness for Proximal Policy Optimization via DreamerV3
  Tricks
Reward Scale Robustness for Proximal Policy Optimization via DreamerV3 TricksNeural Information Processing Systems (NeurIPS), 2023
Ryan Sullivan
Akarsh Kumar
Shengyi Huang
John P. Dickerson
Joseph Suárez
OffRL
227
8
0
26 Oct 2023
Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with
  Subgame Curriculum Learning
Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum LearningAAAI Conference on Artificial Intelligence (AAAI), 2023
Jiayu Chen
Zelai Xu
Yunfei Li
Chao Yu
Jiaming Song
Huazhong Yang
Fei Fang
Yu Wang
Yi Wu
332
7
0
07 Oct 2023
RLLTE: Long-Term Evolution Project of Reinforcement Learning
RLLTE: Long-Term Evolution Project of Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2023
Tao Lv
Zequn Zhang
Yang Xu
Shihao Luo
Bo Li
Xin Jin
Wenjun Zeng
OffRL
260
4
0
28 Sep 2023
Diagnosing and exploiting the computational demands of videos games for
  deep reinforcement learning
Diagnosing and exploiting the computational demands of videos games for deep reinforcement learning
L. Govindarajan
Rex G Liu
Drew Linsley
A. Ashok
Max Reuter
M. Frank
Thomas Serre
OffRL
239
0
0
22 Sep 2023
Machine Learning Meets Advanced Robotic Manipulation
Machine Learning Meets Advanced Robotic ManipulationInformation Fusion (Inf. Fusion), 2023
Saeid Nahavandi
R. Alizadehsani
D. Nahavandi
Chee Peng Lim
Kevin Kelly
Fernando Bello
279
29
0
22 Sep 2023
Guide Your Agent with Adaptive Multimodal Rewards
Guide Your Agent with Adaptive Multimodal RewardsNeural Information Processing Systems (NeurIPS), 2023
Changyeon Kim
Younggyo Seo
Hao Liu
Lisa Lee
Jinwoo Shin
Honglak Lee
Kimin Lee
454
12
0
19 Sep 2023
Query-Dependent Prompt Evaluation and Optimization with Offline Inverse
  RL
Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RLInternational Conference on Learning Representations (ICLR), 2023
Hao Sun
Alihan Huyuk
M. Schaar
OffRLLRM
342
46
0
13 Sep 2023
Discovering Hierarchical Achievements in Reinforcement Learning via
  Contrastive Learning
Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive LearningNeural Information Processing Systems (NeurIPS), 2023
Seungyong Moon
Junyoung Yeom
Bumsoo Park
Hyun Oh Song
OffRL
446
8
0
07 Jul 2023
Correcting discount-factor mismatch in on-policy policy gradient methods
Correcting discount-factor mismatch in on-policy policy gradient methodsInternational Conference on Machine Learning (ICML), 2023
Fengdi Che
Gautham Vasan
A. R. Mahmood
OffRL
158
10
0
23 Jun 2023
Explore to Generalize in Zero-Shot RL
Explore to Generalize in Zero-Shot RLNeural Information Processing Systems (NeurIPS), 2023
E. Zisselman
Itai Lavie
Daniel Soudry
Aviv Tamar
405
23
0
05 Jun 2023
Truncating Trajectories in Monte Carlo Reinforcement Learning
Truncating Trajectories in Monte Carlo Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023
Riccardo Poiani
Alberto Maria Metelli
Marcello Restelli
240
5
0
07 May 2023
DEIR: Efficient and Robust Exploration through
  Discriminative-Model-Based Episodic Intrinsic Rewards
DEIR: Efficient and Robust Exploration through Discriminative-Model-Based Episodic Intrinsic RewardsInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Shanchuan Wan
Yujin Tang
Yingtao Tian
Tomoyuki Kaneko
OffRL
184
8
0
21 Apr 2023
Using Offline Data to Speed-up Reinforcement Learning in Procedurally
  Generated Environments
Using Offline Data to Speed-up Reinforcement Learning in Procedurally Generated EnvironmentsNeurocomputing (Neurocomputing), 2023
Alain Andres
Lukas Schafer
Esther Villar-Rodriguez
Stefano V. Albrecht
Javier Del Ser
OffRLOnRL
240
9
0
18 Apr 2023
CFlowNets: Continuous Control with Generative Flow Networks
CFlowNets: Continuous Control with Generative Flow NetworksInternational Conference on Learning Representations (ICLR), 2023
Yinchuan Li
Shuang Luo
Haozhi Wang
Jianye Hao
343
25
0
04 Mar 2023
Scaling laws for single-agent reinforcement learning
Scaling laws for single-agent reinforcement learning
Jacob Hilton
Jie Tang
John Schulman
357
36
0
31 Jan 2023
12
Next
Page 1 of 2