Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.06347
Cited By
Proximal Policy Optimization Algorithms
20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Proximal Policy Optimization Algorithms"
50 / 7,399 papers shown
Title
On the Global Convergence Rates of Softmax Policy Gradient Methods
Jincheng Mei
Chenjun Xiao
Csaba Szepesvári
Dale Schuurmans
52
278
0
13 May 2020
Smooth Exploration for Robotic Reinforcement Learning
Antonin Raffin
Jens Kober
F. Stulp
37
57
0
12 May 2020
Maximizing Information Gain in Partially Observable Environments via Prediction Reward
Yash Satsangi
Sungsu Lim
Shimon Whiteson
F. Oliehoek
Martha White
32
15
0
11 May 2020
Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation
Xinting Huang
Jianzhong Qi
Yu Sun
Rui Zhang
OffRL
71
18
0
09 May 2020
CARL: Controllable Agent with Reinforcement Learning for Quadruped Locomotion
Ying-Sheng Luo
Jonathan Hans Soeseno
Trista Pei-chun Chen
Wei-Chao Chen
20
15
0
07 May 2020
Learning Adaptive Exploration Strategies in Dynamic Environments Through Informed Policy Regularization
Pierre-Alexandre Kamienny
Matteo Pirotta
A. Lazaric
Thibault Lavril
Nicolas Usunier
Ludovic Denoyer
46
19
0
06 May 2020
Deep Reinforcement Learning for Intelligent Transportation Systems: A Survey
Ammar Haydari
Y. Yilmaz
AI4TS
67
456
0
02 May 2020
Reinforcement Learning with Augmented Data
Michael Laskin
Kimin Lee
Adam Stooke
Lerrel Pinto
Pieter Abbeel
A. Srinivas
OffRL
20
649
0
30 Apr 2020
Plan-Space State Embeddings for Improved Reinforcement Learning
Max Pflueger
Gaurav Sukhatme
17
1
0
30 Apr 2020
Actor-Critic Reinforcement Learning for Control with Stability Guarantee
Minghao Han
Lixian Zhang
Jun Wang
Wei Pan
18
107
0
29 Apr 2020
The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies
Stephan Zheng
Alexander R. Trott
Sunil Srinivasa
Nikhil Naik
Melvin Gruesbeck
David C. Parkes
R. Socher
36
131
0
28 Apr 2020
First return, then explore
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
52
355
0
27 Apr 2020
Reinforcement Learning Generalization with Surprise Minimization
Jerry Zikun Chen
OOD
29
19
0
26 Apr 2020
Self-Paced Deep Reinforcement Learning
Pascal Klink
Carlo DÉramo
Jan Peters
Joni Pajarinen
ODL
50
54
0
24 Apr 2020
OF-VO: Efficient Navigation among Pedestrians Using Commodity Sensors
Jing Liang
Yi-Ling Qiao
Tianrui Guan
Tianyi Zhou
18
13
0
23 Apr 2020
Qd-tree: Learning Data Layouts for Big Data Analytics
Zongheng Yang
Badrish Chandramouli
Chi Wang
J. Gehrke
Yinan Li
U. F. Minhas
P. Larson
Donald Kossmann
R. Acharya
21
93
0
22 Apr 2020
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
Shangtong Zhang
Bo Liu
Shimon Whiteson
36
38
0
22 Apr 2020
Policy Gradient from Demonstration and Curiosity
Jie Chen
Wenjun Xu
24
11
0
22 Apr 2020
Learning Goal-oriented Dialogue Policy with Opposite Agent Awareness
Zheng Zhang
Lizi Liao
Xiaoyan Zhu
Tat-Seng Chua
Zitao Liu
Yan Huang
Minlie Huang
LLMAG
35
20
0
21 Apr 2020
Goal-conditioned Batch Reinforcement Learning for Rotation Invariant Locomotion
Aditi Mavalankar
OffRL
40
7
0
17 Apr 2020
F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning
Wenhao Li
Bo Jin
Xiangfeng Wang
Junchi Yan
H. Zha
30
21
0
17 Apr 2020
Network-principled deep generative models for designing drug combinations as graph sets
Mostafa Karimi
Arman Hasanzadeh
Yang Shen
GNN
38
31
0
16 Apr 2020
lamBERT: Language and Action Learning Using Multimodal BERT
Kazuki Miyazawa
Tatsuya Aoki
Takato Horii
Takayuki Nagai
SSL
LM&Ro
47
12
0
15 Apr 2020
Reinforcement Learning Approach to Vibration Compensation for Dynamic Feed Drive Systems
Ralf Gulde
Marc Tuscher
A. Csiszar
O. Riedel
A. Verl
AI4CE
19
1
0
14 Apr 2020
Adversarial Evaluation of Autonomous Vehicles in Lane-Change Scenarios
Baiming Chen
Xiang Chen
Qiong Wu
Liang-Sheng Li
AAML
22
94
0
14 Apr 2020
Certifiable Robustness to Adversarial State Uncertainty in Deep Reinforcement Learning
Michael Everett
Bjorn Lutjens
Jonathan P. How
AAML
27
41
0
11 Apr 2020
Meta-Learning in Neural Networks: A Survey
Timothy M. Hospedales
Antreas Antoniou
P. Micaelli
Amos Storkey
OOD
161
1,944
0
11 Apr 2020
Residual Policy Learning for Shared Autonomy
Charles B. Schaff
Matthew R. Walter
30
40
0
10 Apr 2020
State-Only Imitation Learning for Dexterous Manipulation
Ilija Radosavovic
Xiaolong Wang
Lerrel Pinto
Jitendra Malik
OffRL
24
122
0
07 Apr 2020
An Application of Deep Reinforcement Learning to Algorithmic Trading
Thibaut Théate
D. Ernst
AIFin
19
163
0
07 Apr 2020
A Deep Ensemble Multi-Agent Reinforcement Learning Approach for Air Traffic Control
Supriyo Ghosh
Sean Laguna
Shiau Hong Lim
L. Wynter
Hasan A. Poonawala
35
14
0
03 Apr 2020
Action Space Shaping in Deep Reinforcement Learning
Anssi Kanervisto
Christian Scheller
Ville Hautamaki
37
81
0
02 Apr 2020
Learning Agile Robotic Locomotion Skills by Imitating Animals
Xue Bin Peng
Erwin Coumans
Tingnan Zhang
T. Lee
Jie Tan
Sergey Levine
41
499
0
02 Apr 2020
A New Challenge: Approaching Tetris Link with AI
Matthias Muller-Brockhausen
Mike Preuss
Aske Plaat
11
2
0
01 Apr 2020
Learning to Ask Medical Questions using Reinforcement Learning
Uri Shaham
Tom Zahavy
C. Caraballo
S. Mahajan
D. Massey
H. Krumholz
OOD
29
1
0
31 Mar 2020
Optical Non-Line-of-Sight Physics-based 3D Human Pose Estimation
Mariko Isogawa
Ye Yuan
Matthew O'Toole
Kris Kitani
3DH
30
61
0
31 Mar 2020
Robotic Table Tennis with Model-Free Reinforcement Learning
Wenbo Gao
L. Graesser
K. Choromanski
Xingyou Song
N. Lazić
Pannag R Sanketi
Vikas Sindhwani
Navdeep Jaitly
24
45
0
31 Mar 2020
Controlling Rayleigh-Bénard convection via Reinforcement Learning
Gerben Beintema
Alessandro Corbetta
Luca Biferale
F. Toschi
AI4CE
38
79
0
31 Mar 2020
Leverage the Average: an Analysis of KL Regularization in RL
Nino Vieillard
Tadashi Kozuno
B. Scherrer
Olivier Pietquin
Rémi Munos
Matthieu Geist
27
43
0
31 Mar 2020
Continual Learning with Node-Importance based Adaptive Group Sparse Regularization
Sangwon Jung
Hongjoon Ahn
Sungmin Cha
Taesup Moon
CLL
25
120
0
30 Mar 2020
When Autonomous Systems Meet Accuracy and Transferability through AI: A Survey
Chongzhen Zhang
Jianrui Wang
Gary G. Yen
Chaoqiang Zhao
Qiyu Sun
Yang Tang
Feng Qian
Jürgen Kurths
AAML
51
20
0
29 Mar 2020
Obstacle Avoidance and Navigation Utilizing Reinforcement Learning with Reward Shaping
Dan-xu Zhang
Colleen P. Bailey
11
12
0
28 Mar 2020
A Survey of Deep Learning for Scientific Discovery
M. Raghu
Erica Schmidt
OOD
AI4CE
60
122
0
26 Mar 2020
Fiber: A Platform for Efficient Development and Distributed Training for Reinforcement Learning and Population-Based Methods
Jiale Zhi
Rui Wang
Jeff Clune
Kenneth O. Stanley
OffRL
35
12
0
25 Mar 2020
PADS: Policy-Adapted Sampling for Visual Similarity Learning
Karsten Roth
Timo Milbich
Bjorn Ommer
34
49
0
24 Mar 2020
An empirical investigation of the challenges of real-world reinforcement learning
Gabriel Dulac-Arnold
Nir Levine
D. Mankowitz
Jerry Li
Cosmin Paduraru
Sven Gowal
Todd Hester
OffRL
48
122
0
24 Mar 2020
Multi-Agent Reinforcement Learning for Problems with Combined Individual and Team Reward
Hassam Sheikh
Ladislau Bölöni
47
36
0
24 Mar 2020
Robust Deep Reinforcement Learning against Adversarial Perturbations on State Observations
Huan Zhang
Hongge Chen
Chaowei Xiao
Yue Liu
Mingyan D. Liu
Duane S. Boning
Cho-Jui Hsieh
AAML
61
262
0
19 Mar 2020
Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions
Rui Wang
Joel Lehman
Aditya Rawal
Jiale Zhi
Yulun Li
Jeff Clune
Kenneth O. Stanley
37
126
0
19 Mar 2020
Neuroevolution of Self-Interpretable Agents
Yujin Tang
Duong Nguyen
David R Ha
39
111
0
18 Mar 2020
Previous
1
2
3
...
136
137
138
...
146
147
148
Next