Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1604.06778
Cited By
Benchmarking Deep Reinforcement Learning for Continuous Control
22 April 2016
Yan Duan
Xi Chen
Rein Houthooft
John Schulman
Pieter Abbeel
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Benchmarking Deep Reinforcement Learning for Continuous Control"
48 / 348 papers shown
Title
Variational Deep Q Network
Yunhao Tang
A. Kucukelbir
BDL
40
10
0
30 Nov 2017
Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control
Shangtong Zhang
Osmar R. Zaiane
45
10
0
30 Nov 2017
A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management
I. Casanueva
Paweł Budzianowski
Pei-hao Su
N. Mrksic
Tsung-Hsien Wen
Stefan Ultes
L. Rojas-Barahona
S. Young
Milica Gasic
OffRL
35
54
0
29 Nov 2017
Action Branching Architectures for Deep Reinforcement Learning
Arash Tavakoli
Fabio Pardo
Petar Kormushev
22
260
0
24 Nov 2017
Data-driven Planning via Imitation Learning
Sanjiban Choudhury
M. Bhardwaj
S. Arora
Ashish Kapoor
G. Ranade
Sebastian Scherer
Debadeepta Dey
46
80
0
17 Nov 2017
Composing Meta-Policies for Autonomous Driving Using Hierarchical Deep Reinforcement Learning
Richard Liaw
S. Krishnan
Animesh Garg
D. Crankshaw
Joseph E. Gonzalez
Ken Goldberg
BDL
32
23
0
04 Nov 2017
Fast Model Identification via Physics Engines for Data-Efficient Policy Search
Shaojun Zhu
A. Kimmel
Kostas E. Bekris
Abdeslam Boularias
31
14
0
24 Oct 2017
Sim-to-Real Transfer of Robotic Control with Dynamics Randomization
Xue Bin Peng
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
67
1,345
0
18 Oct 2017
Burn-In Demonstrations for Multi-Modal Imitation Learning
Alex Kuefler
Mykel J. Kochenderfer
29
24
0
13 Oct 2017
On the Sample Complexity of the Linear Quadratic Regulator
Sarah Dean
Horia Mania
Nikolai Matni
Benjamin Recht
Stephen Tu
40
570
0
04 Oct 2017
Predictive-State Decoders: Encoding the Future into Recurrent Networks
Arun Venkatraman
Nicholas Rhinehart
Wen Sun
Lerrel Pinto
M. Hebert
Byron Boots
Kris Kitani
J. Andrew Bagnell
AI4CE
26
42
0
25 Sep 2017
Deep Reinforcement Learning for Event-Driven Multi-Agent Decision Processes
Kunal Menda
Yi-Chun Chen
J. Grana
J. Bono
Brendan D. Tracey
Mykel J. Kochenderfer
David Wolpert
27
48
0
19 Sep 2017
DropoutDAgger: A Bayesian Approach to Safe Imitation Learning
Kunal Menda
Katherine Driggs-Campbell
Mykel J. Kochenderfer
32
28
0
18 Sep 2017
TensorFlow Agents: Efficient Batched Reinforcement Learning in TensorFlow
Danijar Hafner
James Davidson
Vincent Vanhoucke
OffRL
17
49
0
08 Sep 2017
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
65
2,780
0
19 Aug 2017
CASSL: Curriculum Accelerated Self-Supervised Learning
Adithyavairavan Murali
Lerrel Pinto
Dhiraj Gandhi
Abhinav Gupta
SSL
27
35
0
04 Aug 2017
Mutual Alignment Transfer Learning
Markus Wulfmeier
Ingmar Posner
Pieter Abbeel
16
60
0
25 Jul 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
84
18,385
0
20 Jul 2017
Reverse Curriculum Generation for Reinforcement Learning
Carlos Florensa
David Held
Markus Wulfmeier
Michael Zhang
Pieter Abbeel
36
436
0
17 Jul 2017
Emergence of Locomotion Behaviours in Rich Environments
N. Heess
TB Dhruva
S. Sriram
Jay Lemmon
J. Merel
...
Tom Erez
Ziyun Wang
S. M. Ali Eslami
Martin Riedmiller
David Silver
143
928
0
07 Jul 2017
Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning
Paweł Budzianowski
Stefan Ultes
Pei-hao Su
N. Mrksic
Tsung-Hsien Wen
I. Casanueva
L. Rojas-Barahona
Milica Gasic
27
49
0
19 Jun 2017
Constrained Policy Optimization
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
61
1,302
0
30 May 2017
Fine-grained acceleration control for autonomous intersection management using deep reinforcement learning
H. Mirzaei
T. Givargis
22
8
0
30 May 2017
Automatic Goal Generation for Reinforcement Learning Agents
Carlos Florensa
David Held
Xinyang Geng
Pieter Abbeel
78
499
0
17 May 2017
Probabilistically Safe Policy Transfer
David Held
Zoe McCarthy
Michael Zhang
Fred Shentu
Pieter Abbeel
35
19
0
15 May 2017
Stochastic Neural Networks for Hierarchical Reinforcement Learning
Carlos Florensa
Yan Duan
Pieter Abbeel
BDL
47
360
0
10 Apr 2017
Real-Time Machine Learning: The Missing Pieces
Robert Nishihara
Philipp Moritz
Stephanie Wang
Alexey Tumanov
William Paul
Johann Schleier-Smith
Richard Liaw
Mehrdad Niknami
Michael I. Jordan
Ion Stoica
OffRL
35
64
0
11 Mar 2017
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Tim Salimans
Jonathan Ho
Xi Chen
Szymon Sidor
Ilya Sutskever
45
1,516
0
10 Mar 2017
Towards Generalization and Simplicity in Continuous Control
Aravind Rajeswaran
Kendall Lowrey
E. Todorov
Sham Kakade
OffRL
55
276
0
08 Mar 2017
Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
Joshua Achiam
S. Shankar Sastry
40
235
0
06 Mar 2017
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction
Wen Sun
Arun Venkatraman
Geoffrey J. Gordon
Byron Boots
J. Andrew Bagnell
35
232
0
03 Mar 2017
Deep Predictive Policy Training using Reinforcement Learning
Ali Ghadirzadeh
A. Maki
Danica Kragic
Mårten Björkman
36
128
0
02 Mar 2017
Reinforcement Learning for Pivoting Task
Rika Antonova
S. Cruciani
Christian Smith
Danica Kragic
16
68
0
01 Mar 2017
Learning Control for Air Hockey Striking using Deep Reinforcement Learning
Ayal Taitler
N. Shimkin
23
10
0
26 Feb 2017
Expert Level control of Ramp Metering based on Multi-task Deep Reinforcement Learning
Francois Belletti
Daniel Haziza
G. Gomes
Alexandre M. Bayen
19
139
0
30 Jan 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
106
1,505
0
25 Jan 2017
Imitating Driver Behavior with Generative Adversarial Networks
Alex Kuefler
Jeremy Morton
T. Wheeler
Mykel Kochenderfer
GAN
35
405
0
24 Jan 2017
Agent-Agnostic Human-in-the-Loop Reinforcement Learning
David Abel
J. Salvatier
Andreas Stuhlmuller
Owain Evans
19
60
0
15 Jan 2017
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
OffRL
60
760
0
15 Nov 2016
RL
2
^2
2
: Fast Reinforcement Learning via Slow Reinforcement Learning
Yan Duan
John Schulman
Xi Chen
Peter L. Bartlett
Ilya Sutskever
Pieter Abbeel
OffRL
35
1,008
0
09 Nov 2016
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic
S. Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard Turner
Sergey Levine
OffRL
BDL
47
343
0
07 Nov 2016
Deep Learning Approximation for Stochastic Control Problems
Jiequn Han
E. Weinan
BDL
29
191
0
02 Nov 2016
Towards Cognitive Exploration through Deep Reinforcement Learning for Mobile Robots
L. Tai
Ming Liu
21
103
0
06 Oct 2016
Actor-critic versus direct policy search: a comparison based on sample complexity
Arnaud de Froissard de Broissia
Olivier Sigaud
34
12
0
29 Jun 2016
Longitudinal Face Modeling via Temporal Deep Restricted Boltzmann Machines
C. Duong
Khoa Luu
Kha Gia Quach
Tien D. Bui
CVBM
43
44
0
07 Jun 2016
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
72
5,029
0
05 Jun 2016
Deep Reinforcement Learning Radio Control and Signal Detection with KeRLym, a Gym RL Agent
Tim O'Shea
T. Clancy
32
19
0
30 May 2016
Value Iteration Networks
Aviv Tamar
Yi Wu
G. Thomas
Sergey Levine
Pieter Abbeel
41
649
0
09 Feb 2016
Previous
1
2
3
4
5
6
7