ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.06347
  4. Cited By
Proximal Policy Optimization Algorithms

Proximal Policy Optimization Algorithms

20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
    OffRL
ArXivPDFHTML

Papers citing "Proximal Policy Optimization Algorithms"

50 / 7,146 papers shown
Title
Assessing Transferability from Simulation to Reality for Reinforcement
  Learning
Assessing Transferability from Simulation to Reality for Reinforcement Learning
Fabio Muratore
Michael Gienger
Jan Peters
19
61
0
10 Jul 2019
Deep Lagrangian Networks for end-to-end learning of energy-based control
  for under-actuated systems
Deep Lagrangian Networks for end-to-end learning of energy-based control for under-actuated systems
M. Lutter
Kim D. Listmann
Jan Peters
PINN
26
71
0
10 Jul 2019
Graph Policy Gradients for Large Scale Robot Control
Graph Policy Gradients for Large Scale Robot Control
Arbaaz Khan
Ekaterina V. Tolstaya
Alejandro Ribeiro
Vijay Kumar
10
93
0
08 Jul 2019
Data Efficient Reinforcement Learning for Legged Robots
Data Efficient Reinforcement Learning for Legged Robots
Yuxiang Yang
Ken Caluwaerts
Atil Iscen
Tingnan Zhang
Jie Tan
Vikas Sindhwani
33
139
0
08 Jul 2019
A Review of Robot Learning for Manipulation: Challenges,
  Representations, and Algorithms
A Review of Robot Learning for Manipulation: Challenges, Representations, and Algorithms
Oliver Kroemer
S. Niekum
George Konidaris
41
356
0
06 Jul 2019
Integration of Imitation Learning using GAIL and Reinforcement Learning
  using Task-achievement Rewards via Probabilistic Graphical Model
Integration of Imitation Learning using GAIL and Reinforcement Learning using Task-achievement Rewards via Probabilistic Graphical Model
Akira Kinose
T. Taniguchi
35
20
0
03 Jul 2019
Reasoning and Generalization in RL: A Tool Use Perspective
Reasoning and Generalization in RL: A Tool Use Perspective
Sam Wenke
D. Saunders
Mike Qiu
J. Fleming
OffRL
LRM
29
5
0
03 Jul 2019
Co-training for Policy Learning
Co-training for Policy Learning
Jialin Song
Ravi Lanka
Yisong Yue
M. Ono
OffRL
18
19
0
03 Jul 2019
Generalizing from a few environments in safety-critical reinforcement
  learning
Generalizing from a few environments in safety-critical reinforcement learning
Zachary Kenton
Angelos Filos
Owain Evans
Y. Gal
24
16
0
02 Jul 2019
Modified Actor-Critics
Modified Actor-Critics
Erinc Merdivan
S. Hanke
Matthieu Geist
24
2
0
02 Jul 2019
Continual Learning for Robotics: Definition, Framework, Learning
  Strategies, Opportunities and Challenges
Continual Learning for Robotics: Definition, Framework, Learning Strategies, Opportunities and Challenges
Timothée Lesort
Vincenzo Lomonaco
Andrei Stoian
Davide Maltoni
David Filliat
Natalia Díaz Rodríguez
CLL
35
248
0
29 Jun 2019
Demonstration-Guided Deep Reinforcement Learning of Control Policies for
  Dexterous Human-Robot Interaction
Demonstration-Guided Deep Reinforcement Learning of Control Policies for Dexterous Human-Robot Interaction
Sammy Christen
Stefan Stevšić
Otmar Hilliges
35
24
0
27 Jun 2019
Hyp-RL : Hyperparameter Optimization by Reinforcement Learning
Hyp-RL : Hyperparameter Optimization by Reinforcement Learning
H. Jomaa
Josif Grabocka
Lars Schmidt-Thieme
25
65
0
27 Jun 2019
Deep Active Learning with Adaptive Acquisition
Deep Active Learning with Adaptive Acquisition
Manuel Haussmann
Fred Hamprecht
M. Kandemir
22
41
0
27 Jun 2019
From self-tuning regulators to reinforcement learning and back again
From self-tuning regulators to reinforcement learning and back again
Nikolai Matni
Alexandre Proutiere
Anders Rantzer
Stephen Tu
29
88
0
27 Jun 2019
Learning Data Augmentation Strategies for Object Detection
Learning Data Augmentation Strategies for Object Detection
Barret Zoph
E. D. Cubuk
Golnaz Ghiasi
Nayeon Lee
Jonathon Shlens
Quoc V. Le
39
523
0
26 Jun 2019
Reinforcement Learning with Competitive Ensembles of
  Information-Constrained Primitives
Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives
Anirudh Goyal
Shagun Sodhani
Jonathan Binas
Xue Bin Peng
Sergey Levine
Yoshua Bengio
24
47
0
25 Jun 2019
Optimistic Proximal Policy Optimization
Optimistic Proximal Policy Optimization
Takahisa Imagawa
Takuya Hiraoka
Yoshimasa Tsuruoka
15
4
0
25 Jun 2019
Neural Proximal/Trust Region Policy Optimization Attains Globally
  Optimal Policy
Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy
Boyi Liu
Qi Cai
Zhuoran Yang
Zhaoran Wang
30
108
0
25 Jun 2019
Modern Deep Reinforcement Learning Algorithms
Modern Deep Reinforcement Learning Algorithms
Sergey Ivanov
A. Dýakonov
OffRL
29
39
0
24 Jun 2019
Proximal Distilled Evolutionary Reinforcement Learning
Proximal Distilled Evolutionary Reinforcement Learning
Cristian Bodnar
Ben Day
Pietro Lio
36
71
0
24 Jun 2019
Bias Correction of Learned Generative Models using Likelihood-Free
  Importance Weighting
Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting
Aditya Grover
Jiaming Song
Alekh Agarwal
Kenneth Tran
Ashish Kapoor
Eric Horvitz
Stefano Ermon
26
123
0
23 Jun 2019
Learning Belief Representations for Imitation Learning in POMDPs
Learning Belief Representations for Imitation Learning in POMDPs
Tanmay Gangwani
Joel Lehman
Qiang Liu
Jian Peng
29
36
0
22 Jun 2019
Entropic Risk Measure in Policy Search
Entropic Risk Measure in Policy Search
David Nass
Boris Belousov
Jan Peters
32
27
0
21 Jun 2019
Variable Impedance Control in End-Effector Space: An Action Space for
  Reinforcement Learning in Contact-Rich Tasks
Variable Impedance Control in End-Effector Space: An Action Space for Reinforcement Learning in Contact-Rich Tasks
Roberto Martín-Martín
Michelle A. Lee
Rachel Gardner
Silvio Savarese
Jeannette Bohg
Animesh Garg
33
194
0
20 Jun 2019
Exploring Model-based Planning with Policy Networks
Exploring Model-based Planning with Policy Networks
Tingwu Wang
Jimmy Ba
46
148
0
20 Jun 2019
Unsupervised Learning of Object Keypoints for Perception and Control
Unsupervised Learning of Object Keypoints for Perception and Control
Tejas D. Kulkarni
Ankush Gupta
Catalin Ionescu
Sebastian Borgeaud
Malcolm Reynolds
Andrew Zisserman
Volodymyr Mnih
SSL
OCL
14
194
0
19 Jun 2019
Unsupervised State Representation Learning in Atari
Unsupervised State Representation Learning in Atari
Ankesh Anand
Evan Racah
Sherjil Ozair
Yoshua Bengio
Marc-Alexandre Côté
R. Devon Hjelm
SSL
44
254
0
19 Jun 2019
RIDM: Reinforced Inverse Dynamics Modeling for Learning from a Single
  Observed Demonstration
RIDM: Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration
Brahma S. Pavse
F. Torabi
Josiah P. Hanna
Garrett A. Warnell
Peter Stone
32
33
0
18 Jun 2019
Is the Policy Gradient a Gradient?
Is the Policy Gradient a Gradient?
Chris Nota
Philip S. Thomas
8
57
0
17 Jun 2019
Learning-Driven Exploration for Reinforcement Learning
Learning-Driven Exploration for Reinforcement Learning
Muhammad Usama
D. Chang
35
10
0
17 Jun 2019
Reinforcement Learning Driven Heuristic Optimization
Reinforcement Learning Driven Heuristic Optimization
Qingpeng Cai
W. Hang
Azalia Mirhoseini
George Tucker
Jingtao Wang
Wei Wei
33
26
0
16 Jun 2019
Sub-policy Adaptation for Hierarchical Reinforcement Learning
Sub-policy Adaptation for Hierarchical Reinforcement Learning
Alexander C. Li
Carlos Florensa
I. Clavera
Pieter Abbeel
29
72
0
13 Jun 2019
Deep Reinforcement Learning for Cyber Security
Deep Reinforcement Learning for Cyber Security
Thanh Thi Nguyen
Vijay Janapa Reddi
OffRL
AI4CE
18
315
0
13 Jun 2019
Curriculum Learning for Cumulative Return Maximization
Curriculum Learning for Cumulative Return Maximization
Francesco Foglino
Christiano Coletto Christakou
Ricardo Luna Gutierrez
Matteo Leonetti
34
9
0
13 Jun 2019
Meta-Learning via Learned Loss
Meta-Learning via Learned Loss
Sarah Bechtle
Artem Molchanov
Yevgen Chebotar
Edward Grefenstette
Ludovic Righetti
Gaurav Sukhatme
Franziska Meier
28
110
0
12 Jun 2019
Search on the Replay Buffer: Bridging Planning and Reinforcement
  Learning
Search on the Replay Buffer: Bridging Planning and Reinforcement Learning
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
OffRL
37
287
0
12 Jun 2019
Fast Task Inference with Variational Intrinsic Successor Features
Fast Task Inference with Variational Intrinsic Successor Features
Steven Hansen
Will Dabney
André Barreto
T. Wiele
David Warde-Farley
Volodymyr Mnih
BDL
49
151
0
12 Jun 2019
NAS-FCOS: Fast Neural Architecture Search for Object Detection
NAS-FCOS: Fast Neural Architecture Search for Object Detection
Ning Wang
Yang Gao
Hao Chen
Peng Wang
Zhi Tian
Chunhua Shen
Yanning Zhang
ObjD
31
192
0
11 Jun 2019
Learning Powerful Policies by Using Consistent Dynamics Model
Learning Powerful Policies by Using Consistent Dynamics Model
Shagun Sodhani
Anirudh Goyal
T. Deleu
Yoshua Bengio
Sergey Levine
Jian Tang
OffRL
19
5
0
11 Jun 2019
Learning to Score Behaviors for Guided Policy Optimization
Learning to Score Behaviors for Guided Policy Optimization
Aldo Pacchiano
Jack Parker-Holder
Yunhao Tang
A. Choromańska
K. Choromanski
Michael I. Jordan
36
38
0
11 Jun 2019
Exploration via Hindsight Goal Generation
Exploration via Hindsight Goal Generation
Zhizhou Ren
Kefan Dong
Yuanshuo Zhou
Qiang Liu
Jian-wei Peng
40
86
0
10 Jun 2019
Self-Supervised Exploration via Disagreement
Self-Supervised Exploration via Disagreement
Deepak Pathak
Dhiraj Gandhi
Abhinav Gupta
SSL
40
375
0
10 Jun 2019
Exploiting the Sign of the Advantage Function to Learn Deterministic
  Policies in Continuous Domains
Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains
Matthieu Zimmer
Paul Weng
24
7
0
10 Jun 2019
Ego-Pose Estimation and Forecasting as Real-Time PD Control
Ego-Pose Estimation and Forecasting as Real-Time PD Control
Ye Yuan
Kris Kitani
EgoV
33
123
0
07 Jun 2019
Global Optimality Guarantees For Policy Gradient Methods
Global Optimality Guarantees For Policy Gradient Methods
Jalaj Bhandari
Daniel Russo
44
186
0
05 Jun 2019
Proximal Reliability Optimization for Reinforcement Learning
Proximal Reliability Optimization for Reinforcement Learning
Narendra Patwardhan
Zequn Wang
18
0
0
03 Jun 2019
An Empirical Study on Hyperparameters and their Interdependence for RL
  Generalization
An Empirical Study on Hyperparameters and their Interdependence for RL Generalization
Xingyou Song
Yilun Du
Jacob Jackson
AI4CE
29
8
0
02 Jun 2019
Air Learning: A Deep Reinforcement Learning Gym for Autonomous Aerial
  Robot Visual Navigation
Air Learning: A Deep Reinforcement Learning Gym for Autonomous Aerial Robot Visual Navigation
Srivatsan Krishnan
Behzad Boroujerdian
William Fu
Aleksandra Faust
Vijay Janapa Reddi
33
36
0
02 Jun 2019
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum
  Linear Quadratic Games
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games
Kai Zhang
Zhuoran Yang
Tamer Basar
37
125
0
31 May 2019
Previous
123...137138139...141142143
Next