Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1707.06347
Cited By
v1
v2 (latest)
Proximal Policy Optimization Algorithms
20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Proximal Policy Optimization Algorithms"
50 / 11,421 papers shown
Deep Reinforcement Learning-Based Control for Stomach Coverage Scanning of Wireless Capsule Endoscopy
IEEE International Conference on Robotics and Biomimetics (ROBIO), 2022
Yameng Zhang
Long Bai
Li Liu
Hongliang Ren
Max Q.-H. Meng
198
10
0
18 May 2023
Actor-Critic Methods using Physics-Informed Neural Networks: Control of a 1D PDE Model for Fluid-Cooled Battery Packs
Amartya Mukherjee
Jun Liu
126
2
0
18 May 2023
Lyapunov-Driven Deep Reinforcement Learning for Edge Inference Empowered by Reconfigurable Intelligent Surfaces
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Kyriakos Stylianopoulos
Mattia Merluzzi
P. Lorenzo
G. C. Alexandropoulos
208
11
0
18 May 2023
Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Wenhao Li
Dan Qiao
Baoxiang Wang
Xiangfeng Wang
Bo Jin
H. Zha
189
13
0
18 May 2023
A Unified Framework for Integrating Semantic Communication and AI-Generated Content in Metaverse
IEEE Network (IEEE Netw.), 2023
Yi-Lan Lin
Zhipeng Gao
Hongyang Du
Dusit Niyato
Jiawen Kang
Abbas Jamalipour
X. Shen
158
30
0
18 May 2023
Integrated Conflict Management for UAM with Strategic Demand Capacity Balancing and Learning-based Tactical Deconfliction
Shulu Chen
A. Evans
Marc Brittain
Peng Wei
133
27
0
17 May 2023
SLiC-HF: Sequence Likelihood Calibration with Human Feedback
Yao-Min Zhao
Rishabh Joshi
Tianqi Liu
Misha Khalman
Mohammad Saleh
Peter J. Liu
233
375
0
17 May 2023
Wasserstein Gradient Flows for Optimizing Gaussian Mixture Policies
Neural Information Processing Systems (NeurIPS), 2023
Hanna Ziesche
Leonel Rozo
201
9
0
17 May 2023
Multi-Agent Reinforcement Learning: Methods, Applications, Visionary Prospects, and Challenges
IEEE Transactions on Intelligent Vehicles (TIV), 2023
Ziyuan Zhou
Guanjun Liu
Ying-Si Tang
299
34
0
17 May 2023
Model-based Validation as Probabilistic Inference
Conference on Learning for Dynamics & Control (L4DC), 2023
Harrison Delecki
Anthony Corso
Mykel J. Kochenderfer
126
8
0
17 May 2023
Coagent Networks: Generalized and Scaled
James E. Kostas
Scott M. Jordan
Yash Chandak
Georgios Theocharous
Dhawal Gupta
Martha White
Bruno Castro da Silva
Philip S. Thomas
OffRL
99
0
0
16 May 2023
Addressing computational challenges in physical system simulations with machine learning
S. Ahamed
M. Uddin
AI4CE
180
2
0
16 May 2023
Trojan Playground: A Reinforcement Learning Framework for Hardware Trojan Insertion and Detection
Journal of Supercomputing (JS), 2023
Amin Sarihi
Ahmad Patooghy
Peter Jamieson
Abdel-Hameed A. Badawy
190
14
0
16 May 2023
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research
Jiaming Ji
Jiayi Zhou
Borong Zhang
Juntao Dai
Xuehai Pan
Ruiyang Sun
Weidong Huang
Yiran Geng
Mickel Liu
Yaodong Yang
OffRL
283
79
0
16 May 2023
MIMEx: Intrinsic Rewards from Masked Input Modeling
Neural Information Processing Systems (NeurIPS), 2023
Toru Lin
Allan Jabri
OffRL
260
7
0
15 May 2023
RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Afra Feyza Akyürek
Ekin Akyürek
Aman Madaan
Ashwin Kalyan
Peter Clark
Derry Wijaya
Niket Tandon
ALM
KELM
299
122
0
15 May 2023
A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes
Neural Information Processing Systems (NeurIPS), 2023
Han Zhong
Tong Zhang
282
37
0
15 May 2023
AcroMonk: A Minimalist Underactuated Brachiating Robot
IEEE Robotics and Automation Letters (RA-L), 2023
M. Javadi
Daniel Harnack
P. Stocco
Shivesh Kumar
S. Vyas
Daniel Pizzutilo
Frank Kirchner
158
14
0
15 May 2023
Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback
International Conference on Machine Learning (ICML), 2023
Tal Lancewicki
Aviv A. Rosenberg
Dmitry Sotnikov
186
3
0
13 May 2023
Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems
Bin Zhang
Hangyu Mao
Lijuan Li
Zhiwei Xu
Dapeng Li
Rui Zhao
Guoliang Fan
OffRL
230
5
0
13 May 2023
Quantile-Based Deep Reinforcement Learning using Two-Timescale Policy Gradient Algorithms
Jinyang Jiang
Jiaqiao Hu
Yijie Peng
160
5
0
12 May 2023
Policy Gradient Algorithms Implicitly Optimize by Continuation
Adrien Bolland
Gilles Louppe
D. Ernst
276
4
0
11 May 2023
On Practical Robust Reinforcement Learning: Practical Uncertainty Set and Double-Agent Algorithm
Ukjo Hwang
Songnam Hong
242
1
0
11 May 2023
Neural Lyapunov Control for Discrete-Time Systems
Neural Information Processing Systems (NeurIPS), 2023
Junlin Wu
Andrew Clark
Y. Kantaros
Yevgeniy Vorobeychik
327
37
0
11 May 2023
GFlowNets with Human Feedback
Yinchuan Li
Shuang Luo
Yunfeng Shao
Jianye Hao
AI4CE
172
5
0
11 May 2023
Perpetual Humanoid Control for Real-time Simulated Avatars
IEEE International Conference on Computer Vision (ICCV), 2023
Zhengyi Luo
Jinkun Cao
Alexander Winkler
Kris Kitani
Weipeng Xu
482
189
0
10 May 2023
HoneyIoT: Adaptive High-Interaction Honeypot for IoT Devices Through Reinforcement Learning
Wireless Network Security (WiSec), 2023
Chong Guan
Heting Liu
Guohong Cao
Sencun Zhu
T. L. La Porta
81
16
0
10 May 2023
Towards Scalable Adaptive Learning with Graph Neural Networks and Reinforcement Learning
Educational Data Mining (EDM), 2023
Jean Vassoyan
Jill-Jênn Vie
Pirmin Lemberger
GNN
67
8
0
10 May 2023
Discovery of Optimal Quantum Error Correcting Codes via Reinforcement Learning
Physical Review Applied (Phys. Rev. Appl.), 2023
V. P. Su
ChunJun Cao
Hong-Ye Hu
Y. Yanay
C. Tahan
Brian Swingle
159
24
0
10 May 2023
Mixture of personality improved Spiking actor network for efficient multi-agent cooperation
Frontiers in Neuroscience (Front. Neurosci.), 2023
Xiyun Li
Ziyi Ni
Jingqing Ruan
Linghui Meng
Jing Shi
Tielin Zhang
Bo Xu
222
8
0
10 May 2023
Assessment of Reinforcement Learning Algorithms for Nuclear Power Plant Fuel Optimization
Paul Seurin
K. Shirvan
267
14
0
09 May 2023
Reducing the Cost of Cycle-Time Tuning for Real-World Policy Optimization
IEEE International Joint Conference on Neural Network (IJCNN), 2023
Homayoon Farrahi
Rupam Mahmood
172
5
0
09 May 2023
DexArt: Benchmarking Generalizable Dexterous Manipulation with Articulated Objects
Computer Vision and Pattern Recognition (CVPR), 2023
Chen Bao
Helin Xu
Yuzhe Qin
Xiaolong Wang
227
59
0
09 May 2023
Fine-tuning Language Models with Generative Adversarial Reward Modelling
Z. Yu
Lau Jia Jaw
Zhang Hui
Bryan Kian Hsiang Low
ALM
219
6
0
09 May 2023
SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning
Adam Michalski
Filippos Christianos
Stefano V. Albrecht
117
6
0
09 May 2023
Safe Deep RL for Intraoperative Planning of Pedicle Screw Placement
Yu Ao
H. Esfandiari
F. Carrillo
Yarden As
Mazda Farshad
Benjamin Grewe
Andreas Krause
Philipp Fuernstahl
211
1
0
09 May 2023
Flexible Job Shop Scheduling via Dual Attention Network Based Reinforcement Learning
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Runqing Wang
G. Wang
Jian Sun
Fang Deng
Jie Chen
246
92
0
09 May 2023
Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection
International Conference on Learning Representations (ICLR), 2023
Jiajun Fan
Yuzheng Zhuang
Yuecheng Liu
Jianye Hao
Sijin Yu
Jiangcheng Zhu
Hao Wang
Shutao Xia
206
23
0
09 May 2023
Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning
International Conference on Machine Learning (ICML), 2023
Yulai Zhao
Zhuoran Yang
Zhaoran Wang
Jason D. Lee
199
7
0
08 May 2023
Enhancing Knowledge Graph Construction Using Large Language Models
Milena Trajanoska
Riste Stojanov
D. Trajanov
170
75
0
08 May 2023
Adaptive Learning Path Navigation Based on Knowledge Tracing and Reinforcement Learning
Jyun-Yi Chen
Saeed Saeedvand
I-Wei Lai
65
10
0
08 May 2023
Generalized Universal Domain Adaptation with Generative Flow Networks
ACM Multimedia (ACM MM), 2023
Didi Zhu
Yinchuan Li
Yunfeng Shao
Jianye Hao
Leilei Gan
Kun Kuang
Jun Xiao
Chao Wu
AI4CE
OOD
250
24
0
08 May 2023
DeformerNet: Learning Bimanual Manipulation of 3D Deformable Objects
Bao Thach
Brian Y. Cho
Shing-Hei Ho
Tucker Hermans
Alan Kuntz
274
6
0
08 May 2023
Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors
Letian Wang
Jie Liu
Hao Shao
Wenshuo Wang
Ruobing Chen
Y. Liu
Steven L. Waslander
239
47
0
08 May 2023
Truncating Trajectories in Monte Carlo Reinforcement Learning
International Conference on Machine Learning (ICML), 2023
Riccardo Poiani
Alberto Maria Metelli
Marcello Restelli
174
5
0
07 May 2023
Explaining RL Decisions with Trajectories
International Conference on Learning Representations (ICLR), 2023
Shripad Deshmukh
Arpan Dasgupta
Balaji Krishnamurthy
Nan Jiang
Chirag Agarwal
Georgios Theocharous
J. Subramanian
OffRL
204
7
0
06 May 2023
Autonomous Navigation for Robot-assisted Intraluminal and Endovascular Procedures: A Systematic Review
IEEE Transactions on robotics (TRO), 2023
Ameya Pore
Zhen Li
Diego DallÁlba
A. Hernansanz
Elena De Momi
A. Menciassi
Alicia Casals Gelpí
J. Dankelman
Paolo Fiorini
E. V. Poorten
159
49
0
06 May 2023
Reducing Idleness in Financial Cloud Services via Multi-objective Evolutionary Reinforcement Learning based Load Balancer
Science China Information Sciences (Sci China Inf Sci), 2023
Peng Yang
Laoming Zhang
Haifeng Liu
Guiying Li
171
47
0
05 May 2023
Towards Applying Powerful Large AI Models in Classroom Teaching: Opportunities, Challenges and Prospects
Kehui Tan
Tianqi Pang
Chenyou Fan
Song Yu
160
19
0
05 May 2023
Composite Motion Learning with Task Control
ACM Transactions on Graphics (TOG), 2023
Pei Xu
Xiumin Shang
Victor Zordan
Ioannis Karamouzas
238
42
0
05 May 2023
Previous
1
2
3
...
140
141
142
...
227
228
229
Next