ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.06347
  4. Cited By
Proximal Policy Optimization Algorithms
v1v2 (latest)

Proximal Policy Optimization Algorithms

20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Proximal Policy Optimization Algorithms"

50 / 11,421 papers shown
Deep Reinforcement Learning-Based Control for Stomach Coverage Scanning
  of Wireless Capsule Endoscopy
Deep Reinforcement Learning-Based Control for Stomach Coverage Scanning of Wireless Capsule EndoscopyIEEE International Conference on Robotics and Biomimetics (ROBIO), 2022
Yameng Zhang
Long Bai
Li Liu
Hongliang Ren
Max Q.-H. Meng
198
10
0
18 May 2023
Actor-Critic Methods using Physics-Informed Neural Networks: Control of
  a 1D PDE Model for Fluid-Cooled Battery Packs
Actor-Critic Methods using Physics-Informed Neural Networks: Control of a 1D PDE Model for Fluid-Cooled Battery Packs
Amartya Mukherjee
Jun Liu
126
2
0
18 May 2023
Lyapunov-Driven Deep Reinforcement Learning for Edge Inference Empowered
  by Reconfigurable Intelligent Surfaces
Lyapunov-Driven Deep Reinforcement Learning for Edge Inference Empowered by Reconfigurable Intelligent SurfacesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Kyriakos Stylianopoulos
Mattia Merluzzi
P. Lorenzo
G. C. Alexandropoulos
208
11
0
18 May 2023
Semantically Aligned Task Decomposition in Multi-Agent Reinforcement
  Learning
Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Wenhao Li
Dan Qiao
Baoxiang Wang
Xiangfeng Wang
Bo Jin
H. Zha
189
13
0
18 May 2023
A Unified Framework for Integrating Semantic Communication and
  AI-Generated Content in Metaverse
A Unified Framework for Integrating Semantic Communication and AI-Generated Content in MetaverseIEEE Network (IEEE Netw.), 2023
Yi-Lan Lin
Zhipeng Gao
Hongyang Du
Dusit Niyato
Jiawen Kang
Abbas Jamalipour
X. Shen
158
30
0
18 May 2023
Integrated Conflict Management for UAM with Strategic Demand Capacity
  Balancing and Learning-based Tactical Deconfliction
Integrated Conflict Management for UAM with Strategic Demand Capacity Balancing and Learning-based Tactical Deconfliction
Shulu Chen
A. Evans
Marc Brittain
Peng Wei
133
27
0
17 May 2023
SLiC-HF: Sequence Likelihood Calibration with Human Feedback
SLiC-HF: Sequence Likelihood Calibration with Human Feedback
Yao-Min Zhao
Rishabh Joshi
Tianqi Liu
Misha Khalman
Mohammad Saleh
Peter J. Liu
233
375
0
17 May 2023
Wasserstein Gradient Flows for Optimizing Gaussian Mixture Policies
Wasserstein Gradient Flows for Optimizing Gaussian Mixture PoliciesNeural Information Processing Systems (NeurIPS), 2023
Hanna Ziesche
Leonel Rozo
201
9
0
17 May 2023
Multi-Agent Reinforcement Learning: Methods, Applications, Visionary
  Prospects, and Challenges
Multi-Agent Reinforcement Learning: Methods, Applications, Visionary Prospects, and ChallengesIEEE Transactions on Intelligent Vehicles (TIV), 2023
Ziyuan Zhou
Guanjun Liu
Ying-Si Tang
299
34
0
17 May 2023
Model-based Validation as Probabilistic Inference
Model-based Validation as Probabilistic InferenceConference on Learning for Dynamics & Control (L4DC), 2023
Harrison Delecki
Anthony Corso
Mykel J. Kochenderfer
126
8
0
17 May 2023
Coagent Networks: Generalized and Scaled
Coagent Networks: Generalized and Scaled
James E. Kostas
Scott M. Jordan
Yash Chandak
Georgios Theocharous
Dhawal Gupta
Martha White
Bruno Castro da Silva
Philip S. Thomas
OffRL
99
0
0
16 May 2023
Addressing computational challenges in physical system simulations with
  machine learning
Addressing computational challenges in physical system simulations with machine learning
S. Ahamed
M. Uddin
AI4CE
180
2
0
16 May 2023
Trojan Playground: A Reinforcement Learning Framework for Hardware
  Trojan Insertion and Detection
Trojan Playground: A Reinforcement Learning Framework for Hardware Trojan Insertion and DetectionJournal of Supercomputing (JS), 2023
Amin Sarihi
Ahmad Patooghy
Peter Jamieson
Abdel-Hameed A. Badawy
190
14
0
16 May 2023
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning
  Research
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research
Jiaming Ji
Jiayi Zhou
Borong Zhang
Juntao Dai
Xuehai Pan
Ruiyang Sun
Weidong Huang
Yiran Geng
Mickel Liu
Yaodong Yang
OffRL
283
79
0
16 May 2023
MIMEx: Intrinsic Rewards from Masked Input Modeling
MIMEx: Intrinsic Rewards from Masked Input ModelingNeural Information Processing Systems (NeurIPS), 2023
Toru Lin
Allan Jabri
OffRL
260
7
0
15 May 2023
RL4F: Generating Natural Language Feedback with Reinforcement Learning
  for Repairing Model Outputs
RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model OutputsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Afra Feyza Akyürek
Ekin Akyürek
Aman Madaan
Ashwin Kalyan
Peter Clark
Derry Wijaya
Niket Tandon
ALMKELM
299
122
0
15 May 2023
A Theoretical Analysis of Optimistic Proximal Policy Optimization in
  Linear Markov Decision Processes
A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision ProcessesNeural Information Processing Systems (NeurIPS), 2023
Han Zhong
Tong Zhang
282
37
0
15 May 2023
AcroMonk: A Minimalist Underactuated Brachiating Robot
AcroMonk: A Minimalist Underactuated Brachiating RobotIEEE Robotics and Automation Letters (RA-L), 2023
M. Javadi
Daniel Harnack
P. Stocco
Shivesh Kumar
S. Vyas
Daniel Pizzutilo
Frank Kirchner
158
14
0
15 May 2023
Delay-Adapted Policy Optimization and Improved Regret for Adversarial
  MDP with Delayed Bandit Feedback
Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit FeedbackInternational Conference on Machine Learning (ICML), 2023
Tal Lancewicki
Aviv A. Rosenberg
Dmitry Sotnikov
186
3
0
13 May 2023
Stackelberg Decision Transformer for Asynchronous Action Coordination in
  Multi-Agent Systems
Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems
Bin Zhang
Hangyu Mao
Lijuan Li
Zhiwei Xu
Dapeng Li
Rui Zhao
Guoliang Fan
OffRL
230
5
0
13 May 2023
Quantile-Based Deep Reinforcement Learning using Two-Timescale Policy
  Gradient Algorithms
Quantile-Based Deep Reinforcement Learning using Two-Timescale Policy Gradient Algorithms
Jinyang Jiang
Jiaqiao Hu
Yijie Peng
160
5
0
12 May 2023
Policy Gradient Algorithms Implicitly Optimize by Continuation
Policy Gradient Algorithms Implicitly Optimize by Continuation
Adrien Bolland
Gilles Louppe
D. Ernst
276
4
0
11 May 2023
On Practical Robust Reinforcement Learning: Practical Uncertainty Set
  and Double-Agent Algorithm
On Practical Robust Reinforcement Learning: Practical Uncertainty Set and Double-Agent Algorithm
Ukjo Hwang
Songnam Hong
242
1
0
11 May 2023
Neural Lyapunov Control for Discrete-Time Systems
Neural Lyapunov Control for Discrete-Time SystemsNeural Information Processing Systems (NeurIPS), 2023
Junlin Wu
Andrew Clark
Y. Kantaros
Yevgeniy Vorobeychik
327
37
0
11 May 2023
GFlowNets with Human Feedback
GFlowNets with Human Feedback
Yinchuan Li
Shuang Luo
Yunfeng Shao
Jianye Hao
AI4CE
172
5
0
11 May 2023
Perpetual Humanoid Control for Real-time Simulated Avatars
Perpetual Humanoid Control for Real-time Simulated AvatarsIEEE International Conference on Computer Vision (ICCV), 2023
Zhengyi Luo
Jinkun Cao
Alexander Winkler
Kris Kitani
Weipeng Xu
482
189
0
10 May 2023
HoneyIoT: Adaptive High-Interaction Honeypot for IoT Devices Through
  Reinforcement Learning
HoneyIoT: Adaptive High-Interaction Honeypot for IoT Devices Through Reinforcement LearningWireless Network Security (WiSec), 2023
Chong Guan
Heting Liu
Guohong Cao
Sencun Zhu
T. L. La Porta
81
16
0
10 May 2023
Towards Scalable Adaptive Learning with Graph Neural Networks and
  Reinforcement Learning
Towards Scalable Adaptive Learning with Graph Neural Networks and Reinforcement LearningEducational Data Mining (EDM), 2023
Jean Vassoyan
Jill-Jênn Vie
Pirmin Lemberger
GNN
67
8
0
10 May 2023
Discovery of Optimal Quantum Error Correcting Codes via Reinforcement
  Learning
Discovery of Optimal Quantum Error Correcting Codes via Reinforcement LearningPhysical Review Applied (Phys. Rev. Appl.), 2023
V. P. Su
ChunJun Cao
Hong-Ye Hu
Y. Yanay
C. Tahan
Brian Swingle
159
24
0
10 May 2023
Mixture of personality improved Spiking actor network for efficient
  multi-agent cooperation
Mixture of personality improved Spiking actor network for efficient multi-agent cooperationFrontiers in Neuroscience (Front. Neurosci.), 2023
Xiyun Li
Ziyi Ni
Jingqing Ruan
Linghui Meng
Jing Shi
Tielin Zhang
Bo Xu
222
8
0
10 May 2023
Assessment of Reinforcement Learning Algorithms for Nuclear Power Plant
  Fuel Optimization
Assessment of Reinforcement Learning Algorithms for Nuclear Power Plant Fuel Optimization
Paul Seurin
K. Shirvan
267
14
0
09 May 2023
Reducing the Cost of Cycle-Time Tuning for Real-World Policy
  Optimization
Reducing the Cost of Cycle-Time Tuning for Real-World Policy OptimizationIEEE International Joint Conference on Neural Network (IJCNN), 2023
Homayoon Farrahi
Rupam Mahmood
172
5
0
09 May 2023
DexArt: Benchmarking Generalizable Dexterous Manipulation with
  Articulated Objects
DexArt: Benchmarking Generalizable Dexterous Manipulation with Articulated ObjectsComputer Vision and Pattern Recognition (CVPR), 2023
Chen Bao
Helin Xu
Yuzhe Qin
Xiaolong Wang
227
59
0
09 May 2023
Fine-tuning Language Models with Generative Adversarial Reward Modelling
Fine-tuning Language Models with Generative Adversarial Reward Modelling
Z. Yu
Lau Jia Jaw
Zhang Hui
Bryan Kian Hsiang Low
ALM
219
6
0
09 May 2023
SMAClite: A Lightweight Environment for Multi-Agent Reinforcement
  Learning
SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning
Adam Michalski
Filippos Christianos
Stefano V. Albrecht
117
6
0
09 May 2023
Safe Deep RL for Intraoperative Planning of Pedicle Screw Placement
Safe Deep RL for Intraoperative Planning of Pedicle Screw Placement
Yu Ao
H. Esfandiari
F. Carrillo
Yarden As
Mazda Farshad
Benjamin Grewe
Andreas Krause
Philipp Fuernstahl
211
1
0
09 May 2023
Flexible Job Shop Scheduling via Dual Attention Network Based
  Reinforcement Learning
Flexible Job Shop Scheduling via Dual Attention Network Based Reinforcement LearningIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Runqing Wang
G. Wang
Jian Sun
Fang Deng
Jie Chen
246
92
0
09 May 2023
Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection
Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior SelectionInternational Conference on Learning Representations (ICLR), 2023
Jiajun Fan
Yuzheng Zhuang
Yuecheng Liu
Jianye Hao
Sijin Yu
Jiangcheng Zhu
Hao Wang
Shutao Xia
206
23
0
09 May 2023
Local Optimization Achieves Global Optimality in Multi-Agent
  Reinforcement Learning
Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023
Yulai Zhao
Zhuoran Yang
Zhaoran Wang
Jason D. Lee
199
7
0
08 May 2023
Enhancing Knowledge Graph Construction Using Large Language Models
Enhancing Knowledge Graph Construction Using Large Language Models
Milena Trajanoska
Riste Stojanov
D. Trajanov
170
75
0
08 May 2023
Adaptive Learning Path Navigation Based on Knowledge Tracing and
  Reinforcement Learning
Adaptive Learning Path Navigation Based on Knowledge Tracing and Reinforcement Learning
Jyun-Yi Chen
Saeed Saeedvand
I-Wei Lai
65
10
0
08 May 2023
Generalized Universal Domain Adaptation with Generative Flow Networks
Generalized Universal Domain Adaptation with Generative Flow NetworksACM Multimedia (ACM MM), 2023
Didi Zhu
Yinchuan Li
Yunfeng Shao
Jianye Hao
Leilei Gan
Kun Kuang
Jun Xiao
Chao Wu
AI4CEOOD
250
24
0
08 May 2023
DeformerNet: Learning Bimanual Manipulation of 3D Deformable Objects
DeformerNet: Learning Bimanual Manipulation of 3D Deformable Objects
Bao Thach
Brian Y. Cho
Shing-Hei Ho
Tucker Hermans
Alan Kuntz
274
6
0
08 May 2023
Efficient Reinforcement Learning for Autonomous Driving with
  Parameterized Skills and Priors
Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors
Letian Wang
Jie Liu
Hao Shao
Wenshuo Wang
Ruobing Chen
Y. Liu
Steven L. Waslander
239
47
0
08 May 2023
Truncating Trajectories in Monte Carlo Reinforcement Learning
Truncating Trajectories in Monte Carlo Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023
Riccardo Poiani
Alberto Maria Metelli
Marcello Restelli
174
5
0
07 May 2023
Explaining RL Decisions with Trajectories
Explaining RL Decisions with TrajectoriesInternational Conference on Learning Representations (ICLR), 2023
Shripad Deshmukh
Arpan Dasgupta
Balaji Krishnamurthy
Nan Jiang
Chirag Agarwal
Georgios Theocharous
J. Subramanian
OffRL
204
7
0
06 May 2023
Autonomous Navigation for Robot-assisted Intraluminal and Endovascular
  Procedures: A Systematic Review
Autonomous Navigation for Robot-assisted Intraluminal and Endovascular Procedures: A Systematic ReviewIEEE Transactions on robotics (TRO), 2023
Ameya Pore
Zhen Li
Diego DallÁlba
A. Hernansanz
Elena De Momi
A. Menciassi
Alicia Casals Gelpí
J. Dankelman
Paolo Fiorini
E. V. Poorten
159
49
0
06 May 2023
Reducing Idleness in Financial Cloud Services via Multi-objective
  Evolutionary Reinforcement Learning based Load Balancer
Reducing Idleness in Financial Cloud Services via Multi-objective Evolutionary Reinforcement Learning based Load BalancerScience China Information Sciences (Sci China Inf Sci), 2023
Peng Yang
Laoming Zhang
Haifeng Liu
Guiying Li
171
47
0
05 May 2023
Towards Applying Powerful Large AI Models in Classroom Teaching:
  Opportunities, Challenges and Prospects
Towards Applying Powerful Large AI Models in Classroom Teaching: Opportunities, Challenges and Prospects
Kehui Tan
Tianqi Pang
Chenyou Fan
Song Yu
160
19
0
05 May 2023
Composite Motion Learning with Task Control
Composite Motion Learning with Task ControlACM Transactions on Graphics (TOG), 2023
Pei Xu
Xiumin Shang
Victor Zordan
Ioannis Karamouzas
238
42
0
05 May 2023
Previous
123...140141142...227228229
Next