ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.06347
  4. Cited By
Proximal Policy Optimization Algorithms

Proximal Policy Optimization Algorithms

20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
    OffRL
ArXivPDFHTML

Papers citing "Proximal Policy Optimization Algorithms"

50 / 7,399 papers shown
Title
DAC: The Double Actor-Critic Architecture for Learning Options
DAC: The Double Actor-Critic Architecture for Learning Options
Shangtong Zhang
Shimon Whiteson
30
72
0
29 Apr 2019
Deep Neuroevolution of Recurrent and Discrete World Models
Deep Neuroevolution of Recurrent and Discrete World Models
S. Risi
Kenneth O. Stanley
OCL
27
53
0
28 Apr 2019
Arbitrage of Energy Storage in Electricity Markets with Deep
  Reinforcement Learning
Arbitrage of Energy Storage in Electricity Markets with Deep Reinforcement Learning
Hanchen Xu
Xiao Li
Xiangyu Zhang
Junbo Zhang
20
26
0
28 Apr 2019
How You Act Tells a Lot: Privacy-Leakage Attack on Deep Reinforcement
  Learning
How You Act Tells a Lot: Privacy-Leakage Attack on Deep Reinforcement Learning
Xinlei Pan
Weiyao Wang
Xiaoshuai Zhang
Yue Liu
Jinfeng Yi
D. Song
MIACV
77
26
0
24 Apr 2019
Neural Logic Reinforcement Learning
Neural Logic Reinforcement Learning
Zhengyao Jiang
Shan Luo
NAI
32
71
0
24 Apr 2019
Model-free Deep Reinforcement Learning for Urban Autonomous Driving
Model-free Deep Reinforcement Learning for Urban Autonomous Driving
Jianyu Chen
Bodi Yuan
Masayoshi Tomizuka
30
263
0
20 Apr 2019
ConvLab: Multi-Domain End-to-End Dialog System Platform
ConvLab: Multi-Domain End-to-End Dialog System Platform
Sungjin Lee
Qi Zhu
Ryuichi Takanobu
Xiang Li
Yaoqin Zhang
...
Jinchao Li
Baolin Peng
Xiujun Li
Minlie Huang
Jianfeng Gao
VLM
29
110
0
18 Apr 2019
Decoupled Data Based Approach for Learning to Control Nonlinear
  Dynamical Systems
Decoupled Data Based Approach for Learning to Control Nonlinear Dynamical Systems
Ran A. Wang
Karthikeya S. Parunandi
Dan Yu
D. Kalathil
S. Chakravorty
31
12
0
17 Apr 2019
Energy-Efficient Slithering Gait Exploration for a Snake-like Robot
  based on Reinforcement Learning
Energy-Efficient Slithering Gait Exploration for a Snake-like Robot based on Reinforcement Learning
Zhenshan Bing
Christian Lemke
Zhuangyi Jiang
Kai-Qi Huang
Alois Knoll
16
17
0
16 Apr 2019
Multi-Objective Autonomous Braking System using Naturalistic Dataset
Multi-Objective Autonomous Braking System using Naturalistic Dataset
Rafael Vasquez
Bilal Farooq
11
10
0
15 Apr 2019
Learning to Guide: Guidance Law Based on Deep Meta-learning and Model
  Predictive Path Integral Control
Learning to Guide: Guidance Law Based on Deep Meta-learning and Model Predictive Path Integral Control
Chen Liang
Weihong Wang
Zhenghua Liu
Chao Lai
Benchun Zhou
24
28
0
15 Apr 2019
Let's Play Again: Variability of Deep Reinforcement Learning Agents in
  Atari Environments
Let's Play Again: Variability of Deep Reinforcement Learning Agents in Atari Environments
Kaleigh Clary
Emma Tosch
John Foley
David D. Jensen
9
17
0
12 Apr 2019
Knowledge Flow: Improve Upon Your Teachers
Knowledge Flow: Improve Upon Your Teachers
Iou-Jen Liu
Jian-wei Peng
Alex Schwing
21
62
0
11 Apr 2019
Model-Free Reinforcement Learning for Financial Portfolios: A Brief
  Survey
Model-Free Reinforcement Learning for Financial Portfolios: A Brief Survey
Yoshiharu Sato
OffRL
24
32
0
10 Apr 2019
Policy Gradient Search: Online Planning and Expert Iteration without
  Search Trees
Policy Gradient Search: Online Planning and Expert Iteration without Search Trees
Thomas W. Anthony
Robert Nishihara
Philipp Moritz
Tim Salimans
John Schulman
37
30
0
07 Apr 2019
Reinforcement Learning with Attention that Works: A Self-Supervised
  Approach
Reinforcement Learning with Attention that Works: A Self-Supervised Approach
Anthony Manchin
Ehsan Abbasnejad
Anton Van Den Hengel
30
60
0
06 Apr 2019
Multi-Preference Actor Critic
Multi-Preference Actor Critic
Ishan Durugkar
Matthew J. Hausknecht
Adith Swaminathan
Patrick MacAlpine
19
1
0
05 Apr 2019
Architecture Search of Dynamic Cells for Semantic Video Segmentation
Architecture Search of Dynamic Cells for Semantic Video Segmentation
Vladimir Nekrasov
Hao Chen
Chunhua Shen
Ian Reid
39
21
0
04 Apr 2019
PaintBot: A Reinforcement Learning Approach for Natural Media Painting
PaintBot: A Reinforcement Learning Approach for Natural Media Painting
Biao Jia
Chen Fang
Jonathan Brandt
Byungmoon Kim
Tianyi Zhou
25
15
0
03 Apr 2019
Deep Reinforcement Learning on a Budget: 3D Control and Reasoning
  Without a Supercomputer
Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer
E. Beeching
Christian Wolf
J. Dibangoye
Olivier Simonin
OffRL
LRM
35
25
0
03 Apr 2019
Habitat: A Platform for Embodied AI Research
Habitat: A Platform for Embodied AI Research
Manolis Savva
Abhishek Kadian
Oleksandr Maksymets
Yili Zhao
Erik Wijmans
...
Jia-Wei Liu
V. Koltun
Jitendra Malik
Devi Parikh
Dhruv Batra
LM&Ro
48
1,382
0
02 Apr 2019
Autoregressive Policies for Continuous Control Deep Reinforcement
  Learning
Autoregressive Policies for Continuous Control Deep Reinforcement Learning
D. Korenkevych
A. R. Mahmood
Gautham Vasan
James Bergstra
35
28
0
27 Mar 2019
Generalized Off-Policy Actor-Critic
Generalized Off-Policy Actor-Critic
Shangtong Zhang
Wendelin Bohmer
Shimon Whiteson
OffRL
CML
30
43
0
27 Mar 2019
AlphaX: eXploring Neural Architectures with Deep Neural Networks and
  Monte Carlo Tree Search
AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search
Linnan Wang
Yiyang Zhao
Yuu Jinnai
Yuandong Tian
Rodrigo Fonseca
BDL
25
95
0
26 Mar 2019
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Riley Simmons-Edler
Ben Eisner
E. Mitchell
Sebastian Seung
Daniel D. Lee
49
28
0
25 Mar 2019
Learning a Multi-Modal Policy via Imitating Demonstrations with Mixed
  Behaviors
Learning a Multi-Modal Policy via Imitating Demonstrations with Mixed Behaviors
Fang-I Hsiao
Jui-Hsuan Kuo
Min Sun
OffRL
21
14
0
25 Mar 2019
HouseExpo: A Large-scale 2D Indoor Layout Dataset for Learning-based
  Algorithms on Mobile Robots
HouseExpo: A Large-scale 2D Indoor Layout Dataset for Learning-based Algorithms on Mobile Robots
Tingguang Li
Danny Ho
Chenming Li
Delong Zhu
Chaoqun Wang
Max Meng
3DV
27
55
0
23 Mar 2019
TTR-Based Reward for Reinforcement Learning with Implicit Model Priors
TTR-Based Reward for Reinforcement Learning with Implicit Model Priors
Xubo Lyu
Mo Chen
OffRL
14
3
0
23 Mar 2019
Iterative Reinforcement Learning Based Design of Dynamic Locomotion
  Skills for Cassie
Iterative Reinforcement Learning Based Design of Dynamic Locomotion Skills for Cassie
Zhaoming Xie
Patrick Clary
Jeremy Dao
Pedro Morais
J. Hurst
M. van de Panne
23
67
0
22 Mar 2019
Macro Action Reinforcement Learning with Sequence Disentanglement using
  Variational Autoencoder
Macro Action Reinforcement Learning with Sequence Disentanglement using Variational Autoencoder
Heecheol Kim
Masanori Yamada
Kosuke Miyoshi
Hiroshi Yamakawa
DRL
21
6
0
22 Mar 2019
Hindsight Generative Adversarial Imitation Learning
Hindsight Generative Adversarial Imitation Learning
N. Liu
Tao Lu
Yinghao Cai
Boyao Li
Shuo Wang
44
6
0
19 Mar 2019
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL
Dhruva Tirumala
Hyeonwoo Noh
Alexandre Galashov
Leonard Hasenclever
Arun Ahuja
Greg Wayne
Razvan Pascanu
Yee Whye Teh
N. Heess
OffRL
19
45
0
18 Mar 2019
Adaptive Variance for Changing Sparse-Reward Environments
Adaptive Variance for Changing Sparse-Reward Environments
Xingyu Lin
Pengsheng Guo
Carlos Florensa
David Held
41
6
0
15 Mar 2019
Sim-to-(Multi)-Real: Transfer of Low-Level Robust Control Policies to
  Multiple Quadrotors
Sim-to-(Multi)-Real: Transfer of Low-Level Robust Control Policies to Multiple Quadrotors
Artem Molchanov
Tao Chen
Wolfgang Hönig
James A. Preiss
Nora Ayanian
Gaurav Sukhatme
29
107
0
11 Mar 2019
Learning to Paint With Model-based Deep Reinforcement Learning
Learning to Paint With Model-based Deep Reinforcement Learning
Zhewei Huang
Wen Heng
Shuchang Zhou
GAN
56
153
0
11 Mar 2019
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy
  Critics
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics
Denis Steckelmacher
Hélène Plisnier
D. Roijers
A. Nowé
OffRL
31
17
0
11 Mar 2019
Adaptive Power System Emergency Control using Deep Reinforcement
  Learning
Adaptive Power System Emergency Control using Deep Reinforcement Learning
Qiuhua Huang
Renke Huang
Weituo Hao
Jie Tan
Rui Fan
Zhenyu Huang
27
270
0
09 Mar 2019
Pixel-Attentive Policy Gradient for Multi-Fingered Grasping in Cluttered
  Scenes
Pixel-Attentive Policy Gradient for Multi-Fingered Grasping in Cluttered Scenes
Bohan Wu
Iretiayo Akinola
Peter K. Allen
22
34
0
08 Mar 2019
Provably Robust Blackbox Optimization for Reinforcement Learning
Provably Robust Blackbox Optimization for Reinforcement Learning
K. Choromanski
Aldo Pacchiano
Jack Parker-Holder
Yunhao Tang
Deepali Jain
Yuxiang Yang
Atil Iscen
Jasmine Hsu
Vikas Sindhwani
26
5
0
07 Mar 2019
Distributed Policy Learning Based Random Access for Diversified QoS
  Requirements
Distributed Policy Learning Based Random Access for Diversified QoS Requirements
Zhiyuan Jiang
Sheng Zhou
Z. Niu
4
13
0
06 Mar 2019
Using Natural Language for Reward Shaping in Reinforcement Learning
Using Natural Language for Reward Shaping in Reinforcement Learning
Prasoon Goyal
S. Niekum
Raymond J. Mooney
LM&Ro
46
177
0
05 Mar 2019
Deep Active Localization
Deep Active Localization
S. Gottipati
K. Seo
Dhaivat Bhatt
Vincent Mai
Krishna Murthy Jatavallabhula
Liam Paull
26
37
0
05 Mar 2019
Episodic Learning with Control Lyapunov Functions for Uncertain Robotic
  Systems
Episodic Learning with Control Lyapunov Functions for Uncertain Robotic Systems
Andrew J. Taylor
Victor D. Dorobantu
Hoang Minh Le
Yisong Yue
Aaron D. Ames
117
78
0
04 Mar 2019
Sim-to-Real Transfer for Biped Locomotion
Sim-to-Real Transfer for Biped Locomotion
Wenhao Yu
Visak C. V. Kumar
Greg Turk
Chenxi Liu
17
115
0
04 Mar 2019
Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space
Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space
Zhou Fan
Ruilong Su
Weinan Zhang
Yong Yu
21
134
0
04 Mar 2019
Efficient Reinforcement Learning for StarCraft by Abstract Forward
  Models and Transfer Learning
Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning
Ruo-Ze Liu
Haifeng Guo
Xiaozhong Ji
Yang Yu
Zhen-Jia Pang
Zitai Xiao
Yuzhou Wu
Tong Lu
OffRL
24
13
0
02 Mar 2019
Regularity Normalization: Neuroscience-Inspired Unsupervised Attention
  across Neural Network Layers
Regularity Normalization: Neuroscience-Inspired Unsupervised Attention across Neural Network Layers
Baihan Lin
33
2
0
27 Feb 2019
Neural Packet Classification
Neural Packet Classification
Eric Liang
Hang Zhu
Xin Jin
Ion Stoica
OffRL
48
120
0
27 Feb 2019
Design of intentional backdoors in sequential models
Design of intentional backdoors in sequential models
Zhaoyuan Yang
N. Iyer
Johan Reimann
Nurali Virani
SILM
AAML
28
38
0
26 Feb 2019
Cooperative Learning of Disjoint Syntax and Semantics
Cooperative Learning of Disjoint Syntax and Semantics
Serhii Havrylov
Germán Kruszewski
Armand Joulin
23
48
0
25 Feb 2019
Previous
123...143144145146147148
Next