ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.06347
  4. Cited By
Proximal Policy Optimization Algorithms
v1v2 (latest)

Proximal Policy Optimization Algorithms

20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Proximal Policy Optimization Algorithms"

50 / 11,424 papers shown
Verifiable Reinforcement Learning via Policy Extraction
Verifiable Reinforcement Learning via Policy Extraction
Osbert Bastani
Yewen Pu
Armando Solar-Lezama
OffRL
332
371
0
22 May 2018
Evolution-Guided Policy Gradient in Reinforcement Learning
Evolution-Guided Policy Gradient in Reinforcement Learning
Shauharda Khadka
Kagan Tumer
294
271
0
21 May 2018
Constrained Policy Improvement for Safe and Efficient Reinforcement
  Learning
Constrained Policy Improvement for Safe and Efficient Reinforcement Learning
Elad Sarafian
Aviv Tamar
Sarit Kraus
OffRL
221
11
0
20 May 2018
Unsupervised Video Object Segmentation for Deep Reinforcement Learning
Unsupervised Video Object Segmentation for Deep Reinforcement Learning
Vikrant Goel
James Weng
Pascal Poupart
OCL
207
70
0
20 May 2018
Deep Dynamical Modeling and Control of Unsteady Fluid Flows
Deep Dynamical Modeling and Control of Unsteady Fluid Flows
Jeremy Morton
F. Witherden
A. Jameson
Mykel J. Kochenderfer
AI4CE
215
180
0
18 May 2018
Learning Time-Sensitive Strategies in Space Fortress
Akshat Agarwal
Ryan Hope
Katia Sycara
203
0
0
17 May 2018
Task Agnostic Robust Learning on Corrupt Outputs by Correlation-Guided
  Mixture Density Networks
Task Agnostic Robust Learning on Corrupt Outputs by Correlation-Guided Mixture Density Networks
Sungjoon Choi
Sanghoon Hong
Kyungjae Lee
Sungbin Lim
OOD
254
8
0
16 May 2018
GAN Q-learning
GAN Q-learning
T. Doan
Bogdan Mazoure
Clare Lyle
OODOffRL
257
19
0
13 May 2018
Policy Optimization with Second-Order Advantage Information
Policy Optimization with Second-Order Advantage Information
Jiajin Li
Baoxiang Wang
152
7
0
09 May 2018
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Joshua Romoff
Peter Henderson
Alexandre Piché
Vincent François-Lavet
Joelle Pineau
325
43
0
09 May 2018
Deep Reinforcement Learning for Playing 2.5D Fighting Games
Deep Reinforcement Learning for Playing 2.5D Fighting Games
Yu-Jhe Li
Hsin-Yu Chang
Yu-Jing Lin
Po-Wei Wu
Y. Wang
GAN
88
6
0
05 May 2018
Decoupling Dynamics and Reward for Transfer Learning
Decoupling Dynamics and Reward for Transfer Learning
Amy Zhang
Harsh Satija
Joelle Pineau
OOD
215
75
0
27 Apr 2018
Deep Reinforcement Learning to Acquire Navigation Skills for
  Wheel-Legged Robots in Complex Environments
Deep Reinforcement Learning to Acquire Navigation Skills for Wheel-Legged Robots in Complex Environments
Xi Chen
Ali Ghadirzadeh
John Folkesson
Patric Jensfelt
206
45
0
27 Apr 2018
Sim-to-Real: Learning Agile Locomotion For Quadruped Robots
Sim-to-Real: Learning Agile Locomotion For Quadruped Robots
Jie Tan
Tingnan Zhang
Erwin Coumans
Atil Iscen
Yunfei Bai
Danijar Hafner
Steven Bohez
Vincent Vanhoucke
386
900
0
27 Apr 2018
Distributed Distributional Deterministic Policy Gradients
Distributed Distributional Deterministic Policy GradientsInternational Conference on Learning Representations (ICLR), 2018
Gabriel Barth-Maron
Matthew W. Hoffman
David Budden
Will Dabney
Dan Horgan
TB Dhruva
Alistair Muldal
N. Heess
Timothy Lillicrap
OffRL
261
520
0
23 Apr 2018
Vehicle Communication Strategies for Simulated Highway Driving
Vehicle Communication Strategies for Simulated Highway Driving
Cinjon Resnick
I. Kulikov
Dong Wang
Jason Weston
161
7
0
19 Apr 2018
An Adaptive Clipping Approach for Proximal Policy Optimization
An Adaptive Clipping Approach for Proximal Policy Optimization
Gang Chen
Yiming Peng
Mengjie Zhang
112
26
0
17 Apr 2018
On Learning Intrinsic Rewards for Policy Gradient Methods
On Learning Intrinsic Rewards for Policy Gradient Methods
Zeyu Zheng
Junhyuk Oh
Satinder Singh
278
224
0
17 Apr 2018
Rafiki: Machine Learning as an Analytics Service System
Rafiki: Machine Learning as an Analytics Service System
Wei Wang
Sheng Wang
Jinyang Gao
Meihui Zhang
Gang Chen
Teck Khim Ng
Beng Chin Ooi
248
122
0
17 Apr 2018
Intrinsically motivated reinforcement learning for human-robot
  interaction in the real-world
Intrinsically motivated reinforcement learning for human-robot interaction in the real-world
A. H. Qureshi
Yutaka Nakamura
Yuichiro Yoshikawa
H. Ishiguro
105
64
0
14 Apr 2018
Reinforcement Learning for UAV Attitude Control
Reinforcement Learning for UAV Attitude Control
W. Koch
R. Mancuso
R. West
Azer Bestavros
135
438
0
11 Apr 2018
Gotta Learn Fast: A New Benchmark for Generalization in RL
Gotta Learn Fast: A New Benchmark for Generalization in RL
Alex Nichol
Vicki Pfau
Christopher Hesse
Oleg Klimov
John Schulman
VLMOffRL
187
183
0
10 Apr 2018
Latent Space Policies for Hierarchical Reinforcement Learning
Latent Space Policies for Hierarchical Reinforcement Learning
Tuomas Haarnoja
Kristian Hartikainen
Pieter Abbeel
Sergey Levine
BDL
189
202
0
09 Apr 2018
DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based
  Character Skills
DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills
Xue Bin Peng
Pieter Abbeel
Sergey Levine
M. van de Panne
AI4CE
470
557
0
08 Apr 2018
Structured Evolution with Compact Architectures for Scalable Policy
  Optimization
Structured Evolution with Compact Architectures for Scalable Policy Optimization
K. Choromanski
Mark Rowland
Vikas Sindhwani
Richard Turner
Adrian Weller
270
158
0
06 Apr 2018
Information Maximizing Exploration with a Latent Dynamics Model
Information Maximizing Exploration with a Latent Dynamics Model
Trevor Barron
Oliver Obst
H. B. Amor
97
3
0
04 Apr 2018
Renewal Monte Carlo: Renewal theory based reinforcement learning
Renewal Monte Carlo: Renewal theory based reinforcement learning
Jayakumar Subramanian
Aditya Mahajan
74
12
0
03 Apr 2018
StarCraft Micromanagement with Reinforcement Learning and Curriculum
  Transfer Learning
StarCraft Micromanagement with Reinforcement Learning and Curriculum Transfer Learning
Youssef Attia El Hili
Yuanheng Zhu
Dongbin Zhao
221
182
0
03 Apr 2018
Universal Planning Networks
Universal Planning Networks
A. Srinivas
Allan Jabri
Pieter Abbeel
Sergey Levine
Chelsea Finn
SSL
183
146
0
02 Apr 2018
Learning to Run challenge solutions: Adapting reinforcement learning
  methods for neuromusculoskeletal environments
Learning to Run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments
L. Kidzinski
Sharada Mohanty
Carmichael F. Ong
Zhewei Huang
Shuchang Zhou
...
Sean F. Carroll
Jennifer Hicks
Sergey Levine
M. Salathé
Scott L. Delp
188
95
0
02 Apr 2018
Learning to Run challenge: Synthesizing physiologically accurate motion
  using deep reinforcement learning
Learning to Run challenge: Synthesizing physiologically accurate motion using deep reinforcement learning
L. Kidzinski
Sharada Mohanty
Carmichael F. Ong
Jennifer Hicks
Sean F. Carroll
Sergey Levine
M. Salathé
Scott L. Delp
167
66
0
31 Mar 2018
Reinforcement learning for non-prehensile manipulation: Transfer from
  simulation to physical system
Reinforcement learning for non-prehensile manipulation: Transfer from simulation to physical system
Kendall Lowrey
S. Kolev
Jeremy Dao
Aravind Rajeswaran
E. Todorov
146
61
0
28 Mar 2018
Long short-term memory and learning-to-learn in networks of spiking
  neurons
Long short-term memory and learning-to-learn in networks of spiking neurons
G. Bellec
Darjan Salaj
Anand Subramoney
Robert Legenstein
Wolfgang Maass
507
543
0
26 Mar 2018
Neuronal Circuit Policies
Neuronal Circuit Policies
Mathias Lechner
Ramin M. Hasani
Radu Grosu
67
12
0
22 Mar 2018
Automated Curriculum Learning by Rewarding Temporally Rare Events
Automated Curriculum Learning by Rewarding Temporally Rare Events
Niels Justesen
S. Risi
OffRL
146
20
0
19 Mar 2018
Simple random search provides a competitive approach to reinforcement
  learning
Simple random search provides a competitive approach to reinforcement learning
Horia Mania
Aurelia Guy
Benjamin Recht
201
330
0
19 Mar 2018
Feedback Control For Cassie With Deep Reinforcement Learning
Feedback Control For Cassie With Deep Reinforcement LearningIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2018
Zhaoming Xie
Glen Berseth
Patrick Clary
J. Hurst
M. van de Panne
243
199
0
15 Mar 2018
Learning to Explore with Meta-Policy Gradient
Learning to Explore with Meta-Policy GradientInternational Conference on Machine Learning (ICML), 2018
Tianbing Xu
Qiang Liu
Bo Pan
Jian Peng
149
54
0
13 Mar 2018
Policy Search in Continuous Action Domains: an Overview
Policy Search in Continuous Action Domains: an OverviewNeural Networks (NN), 2018
Olivier Sigaud
F. Stulp
310
76
0
13 Mar 2018
Deep Learning in Mobile and Wireless Networking: A Survey
Deep Learning in Mobile and Wireless Networking: A SurveyIEEE Communications Surveys and Tutorials (COMST), 2018
Chaoyun Zhang
P. Patras
Hamed Haddadi
357
1,428
0
12 Mar 2018
Accelerated Methods for Deep Reinforcement Learning
Accelerated Methods for Deep Reinforcement Learning
Adam Stooke
Pieter Abbeel
OffRLOnRL
149
141
0
07 Mar 2018
Transfer Learning with Neural AutoML
Transfer Learning with Neural AutoML
Catherine Wong
N. Houlsby
Yifeng Lu
Andrea Gesmundo
263
118
0
07 Mar 2018
Discontinuity-Sensitive Optimal Control Learning by Mixture of Experts
Discontinuity-Sensitive Optimal Control Learning by Mixture of Experts
Gao Tang
Kris K. Hauser
99
15
0
07 Mar 2018
Smoothed Action Value Functions for Learning Gaussian Policies
Smoothed Action Value Functions for Learning Gaussian Policies
Ofir Nachum
Mohammad Norouzi
George Tucker
Dale Schuurmans
258
30
0
06 Mar 2018
Some Considerations on Learning to Explore via Meta-Reinforcement
  Learning
Some Considerations on Learning to Explore via Meta-Reinforcement Learning
Bradly C. Stadie
Ge Yang
Rein Houthooft
Xi Chen
Yan Duan
Yuhuai Wu
Pieter Abbeel
Ilya Sutskever
LRM
192
122
0
03 Mar 2018
Deep Reinforcement Learning for Join Order Enumeration
Deep Reinforcement Learning for Join Order Enumeration
Ryan Marcus
Olga Papaemmanouil
284
252
0
28 Feb 2018
Model-Ensemble Trust-Region Policy Optimization
Model-Ensemble Trust-Region Policy OptimizationInternational Conference on Learning Representations (ICLR), 2018
Thanard Kurutach
I. Clavera
Yan Duan
Aviv Tamar
Pieter Abbeel
289
474
0
28 Feb 2018
Computational Theories of Curiosity-Driven Learning
Computational Theories of Curiosity-Driven Learning
Pierre-Yves Oudeyer
198
71
0
28 Feb 2018
The Mirage of Action-Dependent Baselines in Reinforcement Learning
The Mirage of Action-Dependent Baselines in Reinforcement LearningInternational Conference on Machine Learning (ICML), 2018
George Tucker
Surya Bhupatiraju
S. Gu
Richard Turner
Zoubin Ghahramani
Sergey Levine
OffRL
296
137
0
27 Feb 2018
Reinforcement and Imitation Learning for Diverse Visuomotor Skills
Reinforcement and Imitation Learning for Diverse Visuomotor Skills
Yuke Zhu
Ziyun Wang
J. Merel
Andrei A. Rusu
Tom Erez
...
S. Tunyasuvunakool
János Kramár
R. Hadsell
Nando de Freitas
N. Heess
SSL
377
334
0
26 Feb 2018
Previous
123...226227228229
Next
Page 227 of 229
Pageof 229