ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1604.06778
  4. Cited By
Benchmarking Deep Reinforcement Learning for Continuous Control

Benchmarking Deep Reinforcement Learning for Continuous Control

22 April 2016
Yan Duan
Xi Chen
Rein Houthooft
John Schulman
Pieter Abbeel
    OffRL
ArXivPDFHTML

Papers citing "Benchmarking Deep Reinforcement Learning for Continuous Control"

48 / 348 papers shown
Title
Variational Deep Q Network
Variational Deep Q Network
Yunhao Tang
A. Kucukelbir
BDL
40
10
0
30 Nov 2017
Comparing Deep Reinforcement Learning and Evolutionary Methods in
  Continuous Control
Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control
Shangtong Zhang
Osmar R. Zaiane
45
10
0
30 Nov 2017
A Benchmarking Environment for Reinforcement Learning Based Task
  Oriented Dialogue Management
A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management
I. Casanueva
Paweł Budzianowski
Pei-hao Su
N. Mrksic
Tsung-Hsien Wen
Stefan Ultes
L. Rojas-Barahona
S. Young
Milica Gasic
OffRL
35
54
0
29 Nov 2017
Action Branching Architectures for Deep Reinforcement Learning
Action Branching Architectures for Deep Reinforcement Learning
Arash Tavakoli
Fabio Pardo
Petar Kormushev
22
260
0
24 Nov 2017
Data-driven Planning via Imitation Learning
Data-driven Planning via Imitation Learning
Sanjiban Choudhury
M. Bhardwaj
S. Arora
Ashish Kapoor
G. Ranade
Sebastian Scherer
Debadeepta Dey
46
80
0
17 Nov 2017
Composing Meta-Policies for Autonomous Driving Using Hierarchical Deep
  Reinforcement Learning
Composing Meta-Policies for Autonomous Driving Using Hierarchical Deep Reinforcement Learning
Richard Liaw
S. Krishnan
Animesh Garg
D. Crankshaw
Joseph E. Gonzalez
Ken Goldberg
BDL
32
23
0
04 Nov 2017
Fast Model Identification via Physics Engines for Data-Efficient Policy
  Search
Fast Model Identification via Physics Engines for Data-Efficient Policy Search
Shaojun Zhu
A. Kimmel
Kostas E. Bekris
Abdeslam Boularias
31
14
0
24 Oct 2017
Sim-to-Real Transfer of Robotic Control with Dynamics Randomization
Sim-to-Real Transfer of Robotic Control with Dynamics Randomization
Xue Bin Peng
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
67
1,345
0
18 Oct 2017
Burn-In Demonstrations for Multi-Modal Imitation Learning
Burn-In Demonstrations for Multi-Modal Imitation Learning
Alex Kuefler
Mykel J. Kochenderfer
29
24
0
13 Oct 2017
On the Sample Complexity of the Linear Quadratic Regulator
On the Sample Complexity of the Linear Quadratic Regulator
Sarah Dean
Horia Mania
Nikolai Matni
Benjamin Recht
Stephen Tu
40
570
0
04 Oct 2017
Predictive-State Decoders: Encoding the Future into Recurrent Networks
Predictive-State Decoders: Encoding the Future into Recurrent Networks
Arun Venkatraman
Nicholas Rhinehart
Wen Sun
Lerrel Pinto
M. Hebert
Byron Boots
Kris Kitani
J. Andrew Bagnell
AI4CE
26
42
0
25 Sep 2017
Deep Reinforcement Learning for Event-Driven Multi-Agent Decision
  Processes
Deep Reinforcement Learning for Event-Driven Multi-Agent Decision Processes
Kunal Menda
Yi-Chun Chen
J. Grana
J. Bono
Brendan D. Tracey
Mykel J. Kochenderfer
David Wolpert
27
48
0
19 Sep 2017
DropoutDAgger: A Bayesian Approach to Safe Imitation Learning
DropoutDAgger: A Bayesian Approach to Safe Imitation Learning
Kunal Menda
Katherine Driggs-Campbell
Mykel J. Kochenderfer
32
28
0
18 Sep 2017
TensorFlow Agents: Efficient Batched Reinforcement Learning in
  TensorFlow
TensorFlow Agents: Efficient Batched Reinforcement Learning in TensorFlow
Danijar Hafner
James Davidson
Vincent Vanhoucke
OffRL
17
49
0
08 Sep 2017
A Brief Survey of Deep Reinforcement Learning
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
65
2,780
0
19 Aug 2017
CASSL: Curriculum Accelerated Self-Supervised Learning
CASSL: Curriculum Accelerated Self-Supervised Learning
Adithyavairavan Murali
Lerrel Pinto
Dhiraj Gandhi
Abhinav Gupta
SSL
27
35
0
04 Aug 2017
Mutual Alignment Transfer Learning
Mutual Alignment Transfer Learning
Markus Wulfmeier
Ingmar Posner
Pieter Abbeel
16
60
0
25 Jul 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
84
18,385
0
20 Jul 2017
Reverse Curriculum Generation for Reinforcement Learning
Reverse Curriculum Generation for Reinforcement Learning
Carlos Florensa
David Held
Markus Wulfmeier
Michael Zhang
Pieter Abbeel
36
436
0
17 Jul 2017
Emergence of Locomotion Behaviours in Rich Environments
Emergence of Locomotion Behaviours in Rich Environments
N. Heess
TB Dhruva
S. Sriram
Jay Lemmon
J. Merel
...
Tom Erez
Ziyun Wang
S. M. Ali Eslami
Martin Riedmiller
David Silver
143
928
0
07 Jul 2017
Sub-domain Modelling for Dialogue Management with Hierarchical
  Reinforcement Learning
Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning
Paweł Budzianowski
Stefan Ultes
Pei-hao Su
N. Mrksic
Tsung-Hsien Wen
I. Casanueva
L. Rojas-Barahona
Milica Gasic
27
49
0
19 Jun 2017
Constrained Policy Optimization
Constrained Policy Optimization
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
61
1,302
0
30 May 2017
Fine-grained acceleration control for autonomous intersection management
  using deep reinforcement learning
Fine-grained acceleration control for autonomous intersection management using deep reinforcement learning
H. Mirzaei
T. Givargis
22
8
0
30 May 2017
Automatic Goal Generation for Reinforcement Learning Agents
Automatic Goal Generation for Reinforcement Learning Agents
Carlos Florensa
David Held
Xinyang Geng
Pieter Abbeel
78
499
0
17 May 2017
Probabilistically Safe Policy Transfer
Probabilistically Safe Policy Transfer
David Held
Zoe McCarthy
Michael Zhang
Fred Shentu
Pieter Abbeel
35
19
0
15 May 2017
Stochastic Neural Networks for Hierarchical Reinforcement Learning
Stochastic Neural Networks for Hierarchical Reinforcement Learning
Carlos Florensa
Yan Duan
Pieter Abbeel
BDL
47
360
0
10 Apr 2017
Real-Time Machine Learning: The Missing Pieces
Real-Time Machine Learning: The Missing Pieces
Robert Nishihara
Philipp Moritz
Stephanie Wang
Alexey Tumanov
William Paul
Johann Schleier-Smith
Richard Liaw
Mehrdad Niknami
Michael I. Jordan
Ion Stoica
OffRL
35
64
0
11 Mar 2017
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Tim Salimans
Jonathan Ho
Xi Chen
Szymon Sidor
Ilya Sutskever
45
1,516
0
10 Mar 2017
Towards Generalization and Simplicity in Continuous Control
Towards Generalization and Simplicity in Continuous Control
Aravind Rajeswaran
Kendall Lowrey
E. Todorov
Sham Kakade
OffRL
55
276
0
08 Mar 2017
Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
Joshua Achiam
S. Shankar Sastry
40
235
0
06 Mar 2017
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential
  Prediction
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction
Wen Sun
Arun Venkatraman
Geoffrey J. Gordon
Byron Boots
J. Andrew Bagnell
35
232
0
03 Mar 2017
Deep Predictive Policy Training using Reinforcement Learning
Deep Predictive Policy Training using Reinforcement Learning
Ali Ghadirzadeh
A. Maki
Danica Kragic
Mårten Björkman
36
128
0
02 Mar 2017
Reinforcement Learning for Pivoting Task
Reinforcement Learning for Pivoting Task
Rika Antonova
S. Cruciani
Christian Smith
Danica Kragic
16
68
0
01 Mar 2017
Learning Control for Air Hockey Striking using Deep Reinforcement
  Learning
Learning Control for Air Hockey Striking using Deep Reinforcement Learning
Ayal Taitler
N. Shimkin
23
10
0
26 Feb 2017
Expert Level control of Ramp Metering based on Multi-task Deep
  Reinforcement Learning
Expert Level control of Ramp Metering based on Multi-task Deep Reinforcement Learning
Francois Belletti
Daniel Haziza
G. Gomes
Alexandre M. Bayen
19
139
0
30 Jan 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
106
1,505
0
25 Jan 2017
Imitating Driver Behavior with Generative Adversarial Networks
Imitating Driver Behavior with Generative Adversarial Networks
Alex Kuefler
Jeremy Morton
T. Wheeler
Mykel Kochenderfer
GAN
35
405
0
24 Jan 2017
Agent-Agnostic Human-in-the-Loop Reinforcement Learning
Agent-Agnostic Human-in-the-Loop Reinforcement Learning
David Abel
J. Salvatier
Andreas Stuhlmuller
Owain Evans
19
60
0
15 Jan 2017
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement
  Learning
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
OffRL
60
760
0
15 Nov 2016
RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning
RL2^22: Fast Reinforcement Learning via Slow Reinforcement Learning
Yan Duan
John Schulman
Xi Chen
Peter L. Bartlett
Ilya Sutskever
Pieter Abbeel
OffRL
35
1,008
0
09 Nov 2016
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic
S. Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard Turner
Sergey Levine
OffRL
BDL
47
343
0
07 Nov 2016
Deep Learning Approximation for Stochastic Control Problems
Deep Learning Approximation for Stochastic Control Problems
Jiequn Han
E. Weinan
BDL
29
191
0
02 Nov 2016
Towards Cognitive Exploration through Deep Reinforcement Learning for
  Mobile Robots
Towards Cognitive Exploration through Deep Reinforcement Learning for Mobile Robots
L. Tai
Ming Liu
21
103
0
06 Oct 2016
Actor-critic versus direct policy search: a comparison based on sample
  complexity
Actor-critic versus direct policy search: a comparison based on sample complexity
Arnaud de Froissard de Broissia
Olivier Sigaud
34
12
0
29 Jun 2016
Longitudinal Face Modeling via Temporal Deep Restricted Boltzmann
  Machines
Longitudinal Face Modeling via Temporal Deep Restricted Boltzmann Machines
C. Duong
Khoa Luu
Kha Gia Quach
Tien D. Bui
CVBM
43
44
0
07 Jun 2016
OpenAI Gym
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
72
5,029
0
05 Jun 2016
Deep Reinforcement Learning Radio Control and Signal Detection with
  KeRLym, a Gym RL Agent
Deep Reinforcement Learning Radio Control and Signal Detection with KeRLym, a Gym RL Agent
Tim O'Shea
T. Clancy
32
19
0
30 May 2016
Value Iteration Networks
Value Iteration Networks
Aviv Tamar
Yi Wu
G. Thomas
Sergey Levine
Pieter Abbeel
41
649
0
09 Feb 2016
Previous
1234567