ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.06347
  4. Cited By
Proximal Policy Optimization Algorithms

Proximal Policy Optimization Algorithms

20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
    OffRL
ArXivPDFHTML

Papers citing "Proximal Policy Optimization Algorithms"

50 / 7,146 papers shown
Title
Sim-to-(Multi)-Real: Transfer of Low-Level Robust Control Policies to
  Multiple Quadrotors
Sim-to-(Multi)-Real: Transfer of Low-Level Robust Control Policies to Multiple Quadrotors
Artem Molchanov
Tao Chen
Wolfgang Hönig
James A. Preiss
Nora Ayanian
Gaurav Sukhatme
24
107
0
11 Mar 2019
Learning to Paint With Model-based Deep Reinforcement Learning
Learning to Paint With Model-based Deep Reinforcement Learning
Zhewei Huang
Wen Heng
Shuchang Zhou
GAN
26
153
0
11 Mar 2019
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy
  Critics
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics
Denis Steckelmacher
Hélène Plisnier
D. Roijers
A. Nowé
OffRL
26
17
0
11 Mar 2019
Adaptive Power System Emergency Control using Deep Reinforcement
  Learning
Adaptive Power System Emergency Control using Deep Reinforcement Learning
Qiuhua Huang
Renke Huang
Weituo Hao
Jie Tan
Rui Fan
Zhenyu Huang
22
270
0
09 Mar 2019
Pixel-Attentive Policy Gradient for Multi-Fingered Grasping in Cluttered
  Scenes
Pixel-Attentive Policy Gradient for Multi-Fingered Grasping in Cluttered Scenes
Bohan Wu
Iretiayo Akinola
Peter K. Allen
22
34
0
08 Mar 2019
Provably Robust Blackbox Optimization for Reinforcement Learning
Provably Robust Blackbox Optimization for Reinforcement Learning
K. Choromanski
Aldo Pacchiano
Jack Parker-Holder
Yunhao Tang
Deepali Jain
Yuxiang Yang
Atil Iscen
Jasmine Hsu
Vikas Sindhwani
21
5
0
07 Mar 2019
Using Natural Language for Reward Shaping in Reinforcement Learning
Using Natural Language for Reward Shaping in Reinforcement Learning
Prasoon Goyal
S. Niekum
Raymond J. Mooney
LM&Ro
46
176
0
05 Mar 2019
Deep Active Localization
Deep Active Localization
S. Gottipati
K. Seo
Dhaivat Bhatt
Vincent Mai
Krishna Murthy Jatavallabhula
Liam Paull
21
37
0
05 Mar 2019
Episodic Learning with Control Lyapunov Functions for Uncertain Robotic
  Systems
Episodic Learning with Control Lyapunov Functions for Uncertain Robotic Systems
Andrew J. Taylor
Victor D. Dorobantu
Hoang Minh Le
Yisong Yue
Aaron D. Ames
117
78
0
04 Mar 2019
Sim-to-Real Transfer for Biped Locomotion
Sim-to-Real Transfer for Biped Locomotion
Wenhao Yu
Visak C. V. Kumar
Greg Turk
Chenxi Liu
12
114
0
04 Mar 2019
Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space
Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space
Zhou Fan
Ruilong Su
Weinan Zhang
Yong Yu
19
134
0
04 Mar 2019
Efficient Reinforcement Learning for StarCraft by Abstract Forward
  Models and Transfer Learning
Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning
Ruo-Ze Liu
Haifeng Guo
Xiaozhong Ji
Yang Yu
Zhen-Jia Pang
Zitai Xiao
Yuzhou Wu
Tong Lu
OffRL
19
13
0
02 Mar 2019
Regularity Normalization: Neuroscience-Inspired Unsupervised Attention
  across Neural Network Layers
Regularity Normalization: Neuroscience-Inspired Unsupervised Attention across Neural Network Layers
Baihan Lin
30
2
0
27 Feb 2019
Neural Packet Classification
Neural Packet Classification
Eric Liang
Hang Zhu
Xin Jin
Ion Stoica
OffRL
43
120
0
27 Feb 2019
Design of intentional backdoors in sequential models
Design of intentional backdoors in sequential models
Zhaoyuan Yang
N. Iyer
Johan Reimann
Nurali Virani
SILM
AAML
25
38
0
26 Feb 2019
Cooperative Learning of Disjoint Syntax and Semantics
Cooperative Learning of Disjoint Syntax and Semantics
Serhii Havrylov
Germán Kruszewski
Armand Joulin
18
48
0
25 Feb 2019
Investigating Generalisation in Continuous Deep Reinforcement Learning
Investigating Generalisation in Continuous Deep Reinforcement Learning
Chenyang Zhao
Olivier Sigaud
F. Stulp
Timothy M. Hospedales
OffRL
22
48
0
19 Feb 2019
Neural-encoding Human Experts' Domain Knowledge to Warm Start
  Reinforcement Learning
Neural-encoding Human Experts' Domain Knowledge to Warm Start Reinforcement Learning
Andrew Silva
Matthew C. Gombolay
OffRL
32
20
0
15 Feb 2019
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy
  Observations
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy Observations
Yuhui Wang
Hao He
Xiaoyang Tan
30
9
0
15 Feb 2019
Learn a Prior for RHEA for Better Online Planning
Learn a Prior for RHEA for Better Online Planning
Xinyao Tong
W. Liu
Bin Li
OffRL
64
0
0
14 Feb 2019
Non-Asymptotic Analysis of Monte Carlo Tree Search
Non-Asymptotic Analysis of Monte Carlo Tree Search
Devavrat Shah
Qiaomin Xie
Zhi Xu
19
9
0
14 Feb 2019
Deep Reinforcement Learning from Policy-Dependent Human Feedback
Deep Reinforcement Learning from Policy-Dependent Human Feedback
Dilip Arumugam
Jun Ki Lee
S. Saskin
Michael L. Littman
28
94
0
12 Feb 2019
VERIFAI: A Toolkit for the Design and Analysis of Artificial
  Intelligence-Based Systems
VERIFAI: A Toolkit for the Design and Analysis of Artificial Intelligence-Based Systems
T. Dreossi
Daniel J. Fremont
Shromona Ghosh
Edward J. Kim
H. Ravanbakhsh
Marcell Vazquez-Chanlatte
Sanjit A. Seshia
18
29
0
12 Feb 2019
Artificial Intelligence for Prosthetics - challenge solutions
Artificial Intelligence for Prosthetics - challenge solutions
L. Kidzinski
Carmichael F. Ong
Sharada Mohanty
Jennifer Hicks
Sean F. Carroll
...
E. Tumer
J. Watson
M. Salathé
Sergey Levine
Scott L. Delp
20
40
0
07 Feb 2019
A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning
A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning
Francisco M. Garcia
Philip S. Thomas
24
38
0
03 Feb 2019
Tsallis Reinforcement Learning: A Unified Framework for Maximum Entropy
  Reinforcement Learning
Tsallis Reinforcement Learning: A Unified Framework for Maximum Entropy Reinforcement Learning
Kyungjae Lee
Sungyub Kim
Sungbin Lim
Sungjoon Choi
Songhwai Oh
27
28
0
31 Jan 2019
Improving Evolutionary Strategies with Generative Neural Networks
Improving Evolutionary Strategies with Generative Neural Networks
Louis Faury
Clément Calauzènes
Olivier Fercoq
Syrine Krichene
27
12
0
31 Jan 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
24
363
0
30 Jan 2019
Discretizing Continuous Action Space for On-Policy Optimization
Discretizing Continuous Action Space for On-Policy Optimization
Yunhao Tang
Shipra Agrawal
OffRL
26
119
0
29 Jan 2019
Lyapunov-based Safe Policy Optimization for Continuous Control
Lyapunov-based Safe Policy Optimization for Continuous Control
Yinlam Chow
Ofir Nachum
Aleksandra Faust
Edgar A. Duénez-Guzmán
Mohammad Ghavamzadeh
33
244
0
28 Jan 2019
Designing a Multi-Objective Reward Function for Creating Teams of
  Robotic Bodyguards Using Deep Reinforcement Learning
Designing a Multi-Objective Reward Function for Creating Teams of Robotic Bodyguards Using Deep Reinforcement Learning
Hassam Sheikh
Ladislau Bölöni
20
3
0
28 Jan 2019
The Assistive Multi-Armed Bandit
The Assistive Multi-Armed Bandit
Lawrence Chan
Dylan Hadfield-Menell
S. Srinivasa
Anca Dragan
14
36
0
24 Jan 2019
Ablation Studies in Artificial Neural Networks
Ablation Studies in Artificial Neural Networks
Richard Meyes
Melanie Lu
Constantin Waubert de Puiseau
Tobias Meisen
16
211
0
24 Jan 2019
Distillation Strategies for Proximal Policy Optimization
Distillation Strategies for Proximal Policy Optimization
Sam Green
C. Vineyard
Ç. Koç
27
8
0
23 Jan 2019
Hierarchical Reinforcement Learning for Multi-agent MOBA Game
Hierarchical Reinforcement Learning for Multi-agent MOBA Game
Zhijian Zhang
Haozheng Li
Lu Zhang
Tianyin Zheng
Ting Zhang
Xiong Hao
Xiaoxin Chen
Min Chen
Fangxu Xiao
Wei Zhou
17
15
0
23 Jan 2019
Trust Region Value Optimization using Kalman Filtering
Trust Region Value Optimization using Kalman Filtering
Shirli Di-Castro Shashua
Shie Mannor
24
7
0
23 Jan 2019
Neuroflight: Next Generation Flight Control Firmware
Neuroflight: Next Generation Flight Control Firmware
W. Koch
R. Mancuso
Azer Bestavros
41
29
0
19 Jan 2019
On-Policy Trust Region Policy Optimisation with Replay Buffers
On-Policy Trust Region Policy Optimisation with Replay Buffers
D. Kangin
N. Pugeault
OffRL
19
3
0
18 Jan 2019
Imitation-Regularized Offline Learning
Imitation-Regularized Offline Learning
Yifei Ma
Yu Wang
Balakrishnan
Balakrishnan Narayanaswamy
OffRL
27
22
0
15 Jan 2019
AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep
  Reinforcement Learning
AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning
Ameer Haj-Ali
Qijing Huang
William S. Moses
J. Xiang
Ion Stoica
Krste Asanović
J. Wawrzynek
29
36
0
15 Jan 2019
Multi-Objective Reinforced Evolution in Mobile Neural Architecture
  Search
Multi-Objective Reinforced Evolution in Mobile Neural Architecture Search
Xiangxiang Chu
Bo Zhang
Ruijun Xu
Hailong Ma
36
98
0
04 Jan 2019
A Theoretical Analysis of Deep Q-Learning
A Theoretical Analysis of Deep Q-Learning
Jianqing Fan
Zhuoran Yang
Yuchen Xie
Zhaoran Wang
46
597
0
01 Jan 2019
Mid-Level Visual Representations Improve Generalization and Sample
  Efficiency for Learning Visuomotor Policies
Mid-Level Visual Representations Improve Generalization and Sample Efficiency for Learning Visuomotor Policies
Alexander Sax
Bradley Emi
Amir Zamir
Leonidas Guibas
Silvio Savarese
Jitendra Malik
SSL
49
16
0
31 Dec 2018
Learn to Interpret Atari Agents
Learn to Interpret Atari Agents
Zhao Yang
S. Bai
Li Zhang
Philip Torr
22
28
0
29 Dec 2018
Learning to Walk via Deep Reinforcement Learning
Learning to Walk via Deep Reinforcement Learning
Tuomas Haarnoja
Sehoon Ha
Aurick Zhou
Jie Tan
George Tucker
Sergey Levine
54
433
0
26 Dec 2018
VMAV-C: A Deep Attention-based Reinforcement Learning Algorithm for
  Model-based Control
VMAV-C: A Deep Attention-based Reinforcement Learning Algorithm for Model-based Control
Xingxing Liang
Qi Wang
Yanghe Feng
Zhong Liu
Jincai Huang
29
5
0
24 Dec 2018
TD-Regularized Actor-Critic Methods
TD-Regularized Actor-Critic Methods
Simone Parisi
Voot Tangkaratt
Jan Peters
Mohammad Emtiyaz Khan
OffRL
32
31
0
19 Dec 2018
Hierarchical Macro Strategy Model for MOBA Game AI
Hierarchical Macro Strategy Model for MOBA Game AI
Bin Wu
Qiang Fu
Jing Liang
Peng-fei Qu
Xiaoqian Li
Liang Wang
Wei Liu
Wei Yang
Yongsheng Liu
34
63
0
19 Dec 2018
Learning Montezuma's Revenge from a Single Demonstration
Learning Montezuma's Revenge from a Single Demonstration
Tim Salimans
Richard J. Chen
44
136
0
08 Dec 2018
Communication-Efficient Policy Gradient Methods for Distributed
  Reinforcement Learning
Communication-Efficient Policy Gradient Methods for Distributed Reinforcement Learning
Tianyi Chen
Kai Zhang
G. Giannakis
Tamer Basar
OffRL
29
41
0
07 Dec 2018
Previous
123...139140141142143
Next