ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.06347
  4. Cited By
Proximal Policy Optimization Algorithms
v1v2 (latest)

Proximal Policy Optimization Algorithms

20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Proximal Policy Optimization Algorithms"

50 / 11,422 papers shown
A DIRT-T Approach to Unsupervised Domain Adaptation
A DIRT-T Approach to Unsupervised Domain Adaptation
Rui Shu
Hung Bui
Hirokazu Narui
Stefano Ermon
187
653
0
23 Feb 2018
Verifying Controllers Against Adversarial Examples with Bayesian
  Optimization
Verifying Controllers Against Adversarial Examples with Bayesian Optimization
Shromona Ghosh
Felix Berkenkamp
G. Ranade
S. Qadeer
Ashish Kapoor
AAML
163
46
0
23 Feb 2018
Structured Control Nets for Deep Reinforcement Learning
Structured Control Nets for Deep Reinforcement Learning
Mario Srouji
Jian Zhang
Ruslan Salakhutdinov
154
44
0
22 Feb 2018
Clipped Action Policy Gradient
Clipped Action Policy Gradient
Yasuhiro Fujita
S. Maeda
OffRL
126
40
0
21 Feb 2018
Learning to Play with Intrinsically-Motivated Self-Aware Agents
Learning to Play with Intrinsically-Motivated Self-Aware Agents
Nick Haber
Damian Mrowca
Li Fei-Fei
Daniel L. K. Yamins
LRM
189
122
0
21 Feb 2018
Fourier Policy Gradients
Fourier Policy Gradients
M. Fellows
K. Ciosek
Shimon Whiteson
164
15
0
19 Feb 2018
Learning High-level Representations from Demonstrations
Learning High-level Representations from Demonstrations
Garrett Andersen
Peter Vrancx
Haitham Bou-Ammar
125
4
0
19 Feb 2018
GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement
  Learning Algorithms
GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms
Cédric Colas
Olivier Sigaud
Pierre-Yves Oudeyer
302
165
0
14 Feb 2018
Evolved Policy Gradients
Evolved Policy Gradients
Rein Houthooft
Richard Y. Chen
Phillip Isola
Bradly C. Stadie
Filip Wolski
Jonathan Ho
Pieter Abbeel
400
235
0
13 Feb 2018
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
Zhang-Wei Hong
Tzu-Yun Shann
Shih-Yang Su
Yi-Hsiang Chang
Chun-Yi Lee
220
138
0
13 Feb 2018
Learning Robust and Adaptive Real-World Continuous Control Using Simulation and Transfer Learning
M. Ferguson
K. Law
75
3
0
13 Feb 2018
Hierarchical Learning for Modular Robots
Hierarchical Learning for Modular Robots
R. Kojcev
Nora Etxezarreta
Alejandro Hernández
Víctor Mayoral
112
4
0
12 Feb 2018
Towards self-adaptable robots: from programming to training machines
Towards self-adaptable robots: from programming to training machines
Víctor Mayoral
R. Kojcev
Nora Etxezarreta
Alejandro Hernández
I. Zamalloa
91
5
0
12 Feb 2018
VR-Goggles for Robots: Real-to-sim Domain Adaptation for Visual Control
VR-Goggles for Robots: Real-to-sim Domain Adaptation for Visual Control
Jingwei Zhang
L. Tai
Peng Yun
Yufeng Xiong
Ming-Yuan Liu
Joschka Boedecker
Wolfram Burgard
509
132
0
01 Feb 2018
Learning Symmetric and Low-energy Locomotion
Learning Symmetric and Low-energy Locomotion
Wenhao Yu
Greg Turk
Chenxi Liu
357
207
0
24 Jan 2018
An Empirical Analysis of Proximal Policy Optimization with
  Kronecker-factored Natural Gradients
An Empirical Analysis of Proximal Policy Optimization with Kronecker-factored Natural Gradients
Jiaming Song
Yuhuai Wu
70
2
0
17 Jan 2018
Model-Based Action Exploration for Learning Dynamic Motion Skills
Model-Based Action Exploration for Learning Dynamic Motion Skills
Glen Berseth
M. van de Panne
108
0
0
11 Jan 2018
Expected Policy Gradients for Reinforcement Learning
Expected Policy Gradients for Reinforcement Learning
K. Ciosek
Shimon Whiteson
296
60
0
10 Jan 2018
Distributed Deep Reinforcement Learning: Learn how to play Atari games
  in 21 minutes
Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes
Igor Adamski
R. Adamski
T. Grel
Adam Jedrych
Kamil Kaczmarek
Henryk Michalewski
OffRL
200
37
0
09 Jan 2018
Competitive Multi-agent Inverse Reinforcement Learning with Sub-optimal
  Demonstrations
Competitive Multi-agent Inverse Reinforcement Learning with Sub-optimal Demonstrations
Xingyu Wang
Diego Klabjan
114
44
0
07 Jan 2018
Jointly Learning to Construct and Control Agents using Deep
  Reinforcement Learning
Jointly Learning to Construct and Control Agents using Deep Reinforcement Learning
Charles B. Schaff
David Yunis
Ayan Chakrabarti
Matthew R. Walter
263
119
0
04 Jan 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
2.5K
10,124
0
04 Jan 2018
SBEED: Convergent Reinforcement Learning with Nonlinear Function
  Approximation
SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
Bo Dai
Albert Eaton Shaw
Lihong Li
Lin Xiao
Niao He
Zhen Liu
Jianshu Chen
Le Song
293
25
0
29 Dec 2017
Boosting the Actor with Dual Critic
Boosting the Actor with Dual CriticInternational Conference on Learning Representations (ICLR), 2017
Bo Dai
Albert Eaton Shaw
Niao He
Lihong Li
Le Song
152
46
0
29 Dec 2017
RLlib: Abstractions for Distributed Reinforcement Learning
RLlib: Abstractions for Distributed Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2017
Eric Liang
Richard Liaw
Philipp Moritz
Robert Nishihara
Roy Fox
Ken Goldberg
Joseph E. Gonzalez
Michael I. Jordan
Ion Stoica
OffRLAI4CE
305
178
0
26 Dec 2017
Safe Policy Improvement with Baseline Bootstrapping
Safe Policy Improvement with Baseline Bootstrapping
Romain Laroche
P. Trichelair
Rémi Tachet des Combes
OffRL
384
214
0
19 Dec 2017
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative
  for Training Deep Neural Networks for Reinforcement Learning
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning
F. Such
Vashisht Madhavan
Edoardo Conti
Joel Lehman
Kenneth O. Stanley
Jeff Clune
438
735
0
18 Dec 2017
Ray: A Distributed Framework for Emerging AI Applications
Ray: A Distributed Framework for Emerging AI Applications
Philipp Moritz
Robert Nishihara
Stephanie Wang
Alexey Tumanov
Richard Liaw
...
Melih Elibol
Zongheng Yang
William Paul
Sai Li
Ion Stoica
GNN
490
1,496
0
16 Dec 2017
Bayesian Policy Gradients via Alpha Divergence Dropout Inference
Bayesian Policy Gradients via Alpha Divergence Dropout Inference
Peter Henderson
T. Doan
Riashat Islam
David Meger
BDL
120
13
0
06 Dec 2017
A Deeper Look at Experience Replay
A Deeper Look at Experience Replay
Shangtong Zhang
R. Sutton
OffRLVLM
370
306
0
04 Dec 2017
Progressive Neural Architecture Search
Progressive Neural Architecture Search
Chenxi Liu
Barret Zoph
Maxim Neumann
Jonathon Shlens
Wei Hua
Li Li
Li Fei-Fei
Alan Yuille
Jonathan Huang
Kevin Patrick Murphy
499
2,097
0
02 Dec 2017
Time Limits in Reinforcement Learning
Time Limits in Reinforcement Learning
Fabio Pardo
Arash Tavakoli
Vitaly Levdik
Petar Kormushev
CLL
288
175
0
01 Dec 2017
Comparing Deep Reinforcement Learning and Evolutionary Methods in
  Continuous Control
Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control
Shangtong Zhang
Osmar R. Zaiane
145
11
0
30 Nov 2017
Learnings Options End-to-End for Continuous Action Tasks
Learnings Options End-to-End for Continuous Action Tasks
Martin Klissarov
Pierre-Luc Bacon
J. Harb
Doina Precup
194
59
0
30 Nov 2017
Automating Vehicles by Deep Reinforcement Learning using Task Separation
  with Hill Climbing
Automating Vehicles by Deep Reinforcement Learning using Task Separation with Hill Climbing
M. Plessen
94
13
0
29 Nov 2017
Cascade Attribute Learning Network
Cascade Attribute Learning Network
Zhuo Xu
Haonan Chang
Masayoshi Tomizuka
91
4
0
24 Nov 2017
Action Branching Architectures for Deep Reinforcement Learning
Action Branching Architectures for Deep Reinforcement Learning
Arash Tavakoli
Fabio Pardo
Petar Kormushev
255
299
0
24 Nov 2017
Run, skeleton, run: skeletal model in a physics-based simulation
Run, skeleton, run: skeletal model in a physics-based simulation
Mikhail Pavlov
Sergey Kolesnikov
Sergey Plis
AI4CE
203
15
0
18 Nov 2017
Worm-level Control through Search-based Reinforcement Learning
Worm-level Control through Search-based Reinforcement Learning
Mathias Lechner
Radu Grosu
Ramin M. Hasani
31
5
0
09 Nov 2017
Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?
Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?
M. Raghu
A. Irpan
Jacob Andreas
Robert D. Kleinberg
Quoc V. Le
Jon M. Kleinberg
496
29
0
07 Nov 2017
Policy Optimization by Genetic Distillation
Policy Optimization by Genetic Distillation
Tanmay Gangwani
Jian-wei Peng
189
19
0
03 Nov 2017
Automata-Guided Hierarchical Reinforcement Learning for Skill
  Composition
Automata-Guided Hierarchical Reinforcement Learning for Skill Composition
Xiao Li
Yao Ma
C. Belta
51
5
0
31 Oct 2017
Action-depedent Control Variates for Policy Optimization via Stein's
  Identity
Action-depedent Control Variates for Policy Optimization via Stein's Identity
Hao Liu
Yihao Feng
Yi Mao
Dengyong Zhou
Jian-wei Peng
Qiang Liu
277
4
0
30 Oct 2017
Transfer Learning to Learn with Multitask Neural Model Search
Transfer Learning to Learn with Multitask Neural Model Search
Catherine Wong
Andrea Gesmundo
88
5
0
30 Oct 2017
Diff-DAC: Distributed Actor-Critic for Average Multitask Deep
  Reinforcement Learning
Diff-DAC: Distributed Actor-Critic for Average Multitask Deep Reinforcement Learning
Sergio Valcarcel Macua
Aleksi Tukiainen
D. Hernández
David Baldazo
Enrique Munoz de Cote
S. Zazo
345
31
0
28 Oct 2017
Meta Learning Shared Hierarchies
Meta Learning Shared HierarchiesInternational Conference on Learning Representations (ICLR), 2017
Kevin Frans
Jonathan Ho
Xi Chen
Pieter Abbeel
John Schulman
200
372
0
26 Oct 2017
Deep Imitation Learning for Complex Manipulation Tasks from Virtual
  Reality Teleoperation
Deep Imitation Learning for Complex Manipulation Tasks from Virtual Reality Teleoperation
Tianhao Zhang
Zoe McCarthy
Owen Jow
Dennis Lee
Xi Chen
Ken Goldberg
Pieter Abbeel
SSL
418
724
0
12 Oct 2017
Emergent Complexity via Multi-Agent Competition
Emergent Complexity via Multi-Agent Competition
Trapit Bansal
J. Pachocki
Szymon Sidor
Ilya Sutskever
Igor Mordatch
295
418
0
10 Oct 2017
Continuous Adaptation via Meta-Learning in Nonstationary and Competitive
  Environments
Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments
Maruan Al-Shedivat
Trapit Bansal
Yuri Burda
Ilya Sutskever
Igor Mordatch
Pieter Abbeel
CLL
219
367
0
10 Oct 2017
Recurrent Deterministic Policy Gradient Method for Bipedal Locomotion on
  Rough Terrain Challenge
Recurrent Deterministic Policy Gradient Method for Bipedal Locomotion on Rough Terrain Challenge
Doo Re Song
Chuanyu Yang
C. McGreavy
Zhibin Li
370
32
0
08 Oct 2017
Previous
123...227228229
Next