Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1604.06778
Cited By
Benchmarking Deep Reinforcement Learning for Continuous Control
22 April 2016
Yan Duan
Xi Chen
Rein Houthooft
John Schulman
Pieter Abbeel
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Benchmarking Deep Reinforcement Learning for Continuous Control"
50 / 348 papers shown
Title
Control-Tutored Reinforcement Learning: Towards the Integration of Data-Driven and Model-Based Control
F. D. Lellis
M. Coraggio
G. Russo
Mirco Musolesi
M. D. Bernardo
21
7
0
11 Dec 2021
Learning over All Stabilizing Nonlinear Controllers for a Partially-Observed Linear System
Ruigang Wang
Nicholas H. Barbara
Max Revay
I. Manchester
27
16
0
08 Dec 2021
Learning a Robust Multiagent Driving Policy for Traffic Congestion Reduction
Yulin Zhang
William Macke
Jiaxun Cui
Daniel Urieli
Peter Stone
31
8
0
03 Dec 2021
MAMRL: Exploiting Multi-agent Meta Reinforcement Learning in WAN Traffic Engineering
Shan Sun
M. Kiran
Wei Ren
38
2
0
30 Nov 2021
Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms
Yanwei Jia
X. Zhou
OffRL
36
79
0
22 Nov 2021
Visual Goal-Directed Meta-Learning with Contextual Planning Networks
Corban G. Rivera
D. Handelman
43
0
0
18 Nov 2021
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Yanqiu Wu
Xinyue Chen
Che Wang
Yiming Zhang
Keith Ross
OffRL
19
9
0
17 Nov 2021
B-Pref: Benchmarking Preference-Based Reinforcement Learning
Kimin Lee
Laura M. Smith
Anca Dragan
Pieter Abbeel
OffRL
45
93
0
04 Nov 2021
Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies
Tim Seyde
Igor Gilitschenski
Wilko Schwarting
Bartolomeo Stellato
Martin Riedmiller
Markus Wulfmeier
Daniela Rus
28
44
0
03 Nov 2021
OnSlicing: Online End-to-End Network Slicing with Reinforcement Learning
Qiang Liu
Nakjung Choi
Tao Han
OffRL
32
29
0
02 Nov 2021
Context Meta-Reinforcement Learning via Neuromodulation
Eseoghene Ben-Iwhiwhu
Jeffery Dick
Nicholas A. Ketz
Praveen K. Pilly
Andrea Soltoggio
OffRL
56
12
0
30 Oct 2021
Generalized Proximal Policy Optimization with Sample Reuse
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
42
47
0
29 Oct 2021
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Tung M. Luu
Chang D. Yoo
23
8
0
28 Oct 2021
An Adaptable Approach to Learn Realistic Legged Locomotion without Examples
Daniel Felipe Ordoñez Apraez
Antonio Agudo
Francesc Moreno-Noguer
Mario Martin
44
8
0
28 Oct 2021
Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning
Junsup Kim
Younggyo Seo
Jinwoo Shin
28
58
0
26 Oct 2021
Hierarchical Skills for Efficient Exploration
Jonas Gehring
Gabriel Synnaeve
Andreas Krause
Nicolas Usunier
28
40
0
20 Oct 2021
Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning
Yang Shu
Zhangjie Cao
Jing Gao
Jianmin Wang
Philip S. Yu
Mingsheng Long
38
11
0
14 Oct 2021
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
J. Kuba
Ruiqing Chen
Munning Wen
Ying Wen
Fanglei Sun
Jun Wang
Yaodong Yang
49
231
0
23 Sep 2021
Targeted Attack on Deep RL-based Autonomous Driving with Learned Visual Patterns
Prasanth Buddareddygari
Travis Zhang
Yezhou Yang
Yi Ren
AAML
37
13
0
16 Sep 2021
Data Generation Method for Learning a Low-dimensional Safe Region in Safe Reinforcement Learning
Zhehua Zhou
Ozgur S. Oguz
Yi Ren
M. Leibold
M. Buss
OffRL
22
0
0
10 Sep 2021
A Survey of Exploration Methods in Reinforcement Learning
Susan Amin
Maziar Gomrokchi
Harsh Satija
H. V. Hoof
Doina Precup
OffRL
37
80
0
01 Sep 2021
Explainable Reinforcement Learning for Broad-XAI: A Conceptual Framework and Survey
Richard Dazeley
Peter Vamplew
Francisco Cruz
32
60
0
20 Aug 2021
Settling the Variance of Multi-Agent Policy Gradients
J. Kuba
Muning Wen
Yaodong Yang
Linghui Meng
Shangding Gu
Haifeng Zhang
D. Mguni
Jun Wang
26
59
0
19 Aug 2021
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos
Yuzhe Qin
Yueh-hua Wu
Shaowei Liu
Hanwen Jiang
Ruihan Yang
Yang Fu
Xiaolong Wang
134
190
0
12 Aug 2021
Reinforcement Learning for Intelligent Healthcare Systems: A Comprehensive Survey
A. Abdellatif
N. Mhaisen
Z. Chkirbene
Amr M. Mohamed
A. Erbad
Mohsen Guizani
OffRL
AI4TS
25
21
0
05 Aug 2021
Fake News and Phishing Detection Using a Machine Learning Trained Expert System
Benjamin Fitzpatrick
X. Liang
Jeremy Straub
34
6
0
04 Aug 2021
Tianshou: a Highly Modularized Deep Reinforcement Learning Library
Jiayi Weng
Huayu Chen
Dong Yan
Kaichao You
Alexis Duburcq
Minghao Zhang
Yi Su
Hang Su
Jun Zhu
NoLa
OffRL
41
195
0
29 Jul 2021
Levels of explainable artificial intelligence for human-aligned conversational explanations
Richard Dazeley
Peter Vamplew
Cameron Foale
Charlotte Young
Sunil Aryal
F. Cruz
32
90
0
07 Jul 2021
SA-MATD3:Self-attention-based multi-agent continuous control method in cooperative environments
Kai Liu
Yuyang Zhao
Gang Wang
Bei Peng
33
18
0
01 Jul 2021
Active Learning in Robotics: A Review of Control Principles
Annalisa T. Taylor
Thomas A. Berrueta
Todd D. Murphey
32
71
0
25 Jun 2021
Average-Reward Reinforcement Learning with Trust Region Methods
Xiaoteng Ma
Xiao-Jing Tang
Li Xia
Jun Yang
Qianchuan Zhao
24
16
0
07 Jun 2021
Network-wide traffic signal control optimization using a multi-agent deep reinforcement learning
Zhenning Li
Hao Yu
Guohui Zhang
Shangjia Dong
Cheng-Zhong Xu
21
109
0
20 Apr 2021
Distributed TD(0) with Almost No Communication
R. Liu
Alexander Olshevsky
FedML
30
15
0
16 Apr 2021
Hybrid analysis and modeling, eclecticism, and multifidelity computing toward digital twin revolution
Omer San
Adil Rasheed
T. Kvamsdal
55
50
0
26 Mar 2021
Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow
John Mcleod
Hrvoje Stojić
Vincent Adam
Dongho Kim
Jordi Grau-Moya
Peter Vrancx
Felix Leibfried
OffRL
21
2
0
26 Mar 2021
Parametrized quantum policies for reinforcement learning
Sofiene Jerbi
Casper Gyurik
Simon Marshall
Hans J. Briegel
Vedran Dunjko
42
105
0
09 Mar 2021
Decaying Clipping Range in Proximal Policy Optimization
Mónika Farsang
Luca Szegletes
OffRL
18
4
0
20 Feb 2021
Differentiable Trust Region Layers for Deep Reinforcement Learning
Fabian Otto
P. Becker
Ngo Anh Vien
Hanna Ziesche
Gerhard Neumann
OffRL
41
19
0
22 Jan 2021
Benchmarking Simulation-Based Inference
Jan-Matthis Lueckmann
Jan Boelts
David S. Greenberg
P. J. Gonçalves
Jakob H. Macke
104
186
0
12 Jan 2021
Uncertainty-Aware Policy Optimization: A Robust, Adaptive Trust Region Approach
James Queeney
I. Paschalidis
Christos G. Cassandras
31
9
0
19 Dec 2020
Adaptable Automation with Modular Deep Reinforcement Learning and Policy Transfer
Zohreh Raziei
Mohsen Moghaddam
26
25
0
27 Nov 2020
RLlib Flow: Distributed Reinforcement Learning is a Dataflow Problem
Eric Liang
Zhanghao Wu
Michael Luo
Sven Mika
Joseph E. Gonzalez
Ion Stoica
AI4CE
23
9
0
25 Nov 2020
MRAC-RL: A Framework for On-Line Policy Adaptation Under Parametric Model Uncertainty
A. Guha
Anuradha M. Annaswamy
31
12
0
20 Nov 2020
Joint Space Control via Deep Reinforcement Learning
Visak C. V. Kumar
David Hoeller
Balakumar Sundaralingam
Jonathan Tremblay
Stan Birchfield
DRL
25
15
0
12 Nov 2020
On Function Approximation in Reinforcement Learning: Optimism in the Face of Large State Spaces
Zhuoran Yang
Chi Jin
Zhaoran Wang
Mengdi Wang
Michael I. Jordan
44
18
0
09 Nov 2020
Control with adaptive Q-learning
J. Araújo
Mário A. T. Figueiredo
M. Botto
33
2
0
03 Nov 2020
How to Make Deep RL Work in Practice
Nirnai Rao
Elie Aljalbout
Axel Sauer
Sami Haddadin
OffRL
29
11
0
25 Oct 2020
Learning Partially Observed Linear Dynamical Systems from Logarithmic Number of Samples
S. Fattahi
26
14
0
08 Oct 2020
FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning
Honghao Wei
Lei Ying
25
7
0
04 Oct 2020
GRAC: Self-Guided and Self-Regularized Actor-Critic
Lin Shao
Yifan You
Mengyuan Yan
Qingyun Sun
Jeannette Bohg
29
23
0
18 Sep 2020
Previous
1
2
3
4
5
6
7
Next