ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1604.06778
  4. Cited By
Benchmarking Deep Reinforcement Learning for Continuous Control

Benchmarking Deep Reinforcement Learning for Continuous Control

22 April 2016
Yan Duan
Xi Chen
Rein Houthooft
John Schulman
Pieter Abbeel
    OffRL
ArXivPDFHTML

Papers citing "Benchmarking Deep Reinforcement Learning for Continuous Control"

50 / 348 papers shown
Title
Adversarial Learning of a Sampler Based on an Unnormalized Distribution
Adversarial Learning of a Sampler Based on an Unnormalized Distribution
Chunyuan Li
Ke Bai
Jianqiao Li
Guoyin Wang
Changyou Chen
Lawrence Carin
19
10
0
03 Jan 2019
Deep Reinforcement Learning for Multi-Agent Systems: A Review of
  Challenges, Solutions and Applications
Deep Reinforcement Learning for Multi-Agent Systems: A Review of Challenges, Solutions and Applications
Thanh Thi Nguyen
Ngoc Duy Nguyen
S. Nahavandi
27
775
0
31 Dec 2018
Dopamine: A Research Framework for Deep Reinforcement Learning
Dopamine: A Research Framework for Deep Reinforcement Learning
Pablo Samuel Castro
Subhodeep Moitra
Carles Gelada
Saurabh Kumar
Marc G. Bellemare
OffRL
28
276
0
14 Dec 2018
Relative Entropy Regularized Policy Iteration
Relative Entropy Regularized Policy Iteration
A. Abdolmaleki
Jost Tobias Springenberg
Jonas Degrave
Steven Bohez
Yuval Tassa
Dan Belov
N. Heess
Martin Riedmiller
27
72
0
05 Dec 2018
Meta-Learning for Multi-objective Reinforcement Learning
Meta-Learning for Multi-objective Reinforcement Learning
Xi Chen
Ali Ghadirzadeh
Mårten Björkman
Pablo G. Cámara
OffRL
23
54
0
08 Nov 2018
VIREL: A Variational Inference Framework for Reinforcement Learning
VIREL: A Variational Inference Framework for Reinforcement Learning
M. Fellows
Anuj Mahajan
Tim G. J. Rudner
Shimon Whiteson
DRL
38
54
0
03 Nov 2018
Assessing Generalization in Deep Reinforcement Learning
Assessing Generalization in Deep Reinforcement Learning
Charles Packer
Katelyn Gao
Jernej Kos
Philipp Krahenbuhl
V. Koltun
D. Song
OffRL
18
233
0
29 Oct 2018
Sample-Efficient Learning of Nonprehensile Manipulation Policies via
  Physics-Based Informed State Distributions
Sample-Efficient Learning of Nonprehensile Manipulation Policies via Physics-Based Informed State Distributions
Lerrel Pinto
Aditya Mandalika
Brian Hou
S. Srinivasa
33
13
0
24 Oct 2018
Hierarchical Approaches for Reinforcement Learning in Parameterized
  Action Space
Hierarchical Approaches for Reinforcement Learning in Parameterized Action Space
E. Wei
Drew Wicke
S. Luke
BDL
30
35
0
23 Oct 2018
Actor-Critic Policy Optimization in Partially Observable Multiagent
  Environments
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
S. Srinivasan
Marc Lanctot
V. Zambaldi
Julien Perolat
K. Tuyls
Rémi Munos
Michael Bowling
13
148
0
21 Oct 2018
GPU-Accelerated Robotic Simulation for Distributed Reinforcement
  Learning
GPU-Accelerated Robotic Simulation for Distributed Reinforcement Learning
Jacky Liang
Viktor Makoviychuk
Ankur Handa
N. Chentanez
Miles Macklin
Dieter Fox
AI4CE
27
182
0
12 Oct 2018
PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation
PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation
Perttu Hämäläinen
Amin Babadi
Xiaoxiao Ma
J. Lehtinen
32
62
0
05 Oct 2018
CEM-RL: Combining evolutionary and gradient-based methods for policy
  search
CEM-RL: Combining evolutionary and gradient-based methods for policy search
Aloïs Pourchot
Olivier Sigaud
32
160
0
02 Oct 2018
Boosting Trust Region Policy Optimization by Normalizing Flows Policy
Boosting Trust Region Policy Optimization by Normalizing Flows Policy
Yunhao Tang
Shipra Agrawal
TPM
39
29
0
27 Sep 2018
Scaling simulation-to-real transfer by learning composable robot skills
Scaling simulation-to-real transfer by learning composable robot skills
Ryan Julian
Eric Heiden
Zhanpeng He
Hejia Zhang
S. Schaal
Joseph J. Lim
Gaurav Sukhatme
Karol Hausman
17
15
0
26 Sep 2018
Switching Isotropic and Directional Exploration with Parameter Space
  Noise in Deep Reinforcement Learning
Switching Isotropic and Directional Exploration with Parameter Space Noise in Deep Reinforcement Learning
Izumi Karino
Kazutoshi Tanaka
Ryuma Niiyama
Yasuo Kuniyoshi
19
3
0
18 Sep 2018
Model-Based Reinforcement Learning via Meta-Policy Optimization
Model-Based Reinforcement Learning via Meta-Policy Optimization
I. Clavera
Jonas Rothfuss
John Schulman
Yasuhiro Fujita
Tamim Asfour
Pieter Abbeel
30
225
0
14 Sep 2018
Policy Optimization as Wasserstein Gradient Flows
Policy Optimization as Wasserstein Gradient Flows
Ruiyi Zhang
Changyou Chen
Chunyuan Li
Lawrence Carin
21
66
0
09 Aug 2018
Learning to Listen, Read, and Follow: Score Following as a Reinforcement
  Learning Game
Learning to Listen, Read, and Follow: Score Following as a Reinforcement Learning Game
Matthias Dorfer
Florian Henkel
Gerhard Widmer
24
34
0
17 Jul 2018
End-to-End Race Driving with Deep Reinforcement Learning
End-to-End Race Driving with Deep Reinforcement Learning
M. Jaritz
Raoul de Charette
Marin Toromanoff
E. Perot
F. Nashashibi
22
167
0
06 Jul 2018
Variance Reduction for Reinforcement Learning in Input-Driven
  Environments
Variance Reduction for Reinforcement Learning in Input-Driven Environments
Hongzi Mao
S. Venkatakrishnan
Malte Schwarzkopf
Mohammad Alizadeh
OffRL
41
95
0
06 Jul 2018
Stochastic Variance-Reduced Policy Gradient
Stochastic Variance-Reduced Policy Gradient
Matteo Papini
Damiano Binaghi
Giuseppe Canonaco
Matteo Pirotta
Marcello Restelli
24
174
0
14 Jun 2018
Maximum a Posteriori Policy Optimisation
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
48
471
0
14 Jun 2018
The streaming rollout of deep networks - towards fully model-parallel
  execution
The streaming rollout of deep networks - towards fully model-parallel execution
Volker Fischer
Jan M. Köhler
Thomas Pfeil
27
16
0
13 Jun 2018
Unsupervised Meta-Learning for Reinforcement Learning
Unsupervised Meta-Learning for Reinforcement Learning
Abhishek Gupta
Benjamin Eysenbach
Chelsea Finn
Sergey Levine
SSL
OffRL
54
106
0
12 Jun 2018
Supervised Policy Update for Deep Reinforcement Learning
Supervised Policy Update for Deep Reinforcement Learning
Q. Vuong
Yiming Zhang
Keith Ross
19
20
0
29 May 2018
Fingerprint Policy Optimisation for Robust Reinforcement Learning
Fingerprint Policy Optimisation for Robust Reinforcement Learning
Supratik Paul
Michael A. Osborne
Shimon Whiteson
32
18
0
27 May 2018
Learning Self-Imitating Diverse Policies
Learning Self-Imitating Diverse Policies
Tanmay Gangwani
Qiang Liu
Jian Peng
29
65
0
25 May 2018
Deep Reinforcement Learning of Marked Temporal Point Processes
Deep Reinforcement Learning of Marked Temporal Point Processes
U. Upadhyay
A. De
Manuel Gomez Rodriguez
BDL
OffRL
31
111
0
23 May 2018
Data-Efficient Hierarchical Reinforcement Learning
Data-Efficient Hierarchical Reinforcement Learning
Ofir Nachum
S. Gu
Honglak Lee
Sergey Levine
OffRL
68
797
0
21 May 2018
Task-Agnostic Meta-Learning for Few-shot Learning
Task-Agnostic Meta-Learning for Few-shot Learning
Muhammad Abdullah Jamal
Guo-Jun Qi
M. Shah
52
456
0
20 May 2018
Policy Optimization with Second-Order Advantage Information
Policy Optimization with Second-Order Advantage Information
Jiajin Li
Baoxiang Wang
22
6
0
09 May 2018
Exploration by Distributional Reinforcement Learning
Exploration by Distributional Reinforcement Learning
Yunhao Tang
Shipra Agrawal
OOD
41
30
0
04 May 2018
An Adaptive Clipping Approach for Proximal Policy Optimization
An Adaptive Clipping Approach for Proximal Policy Optimization
Gang Chen
Yiming Peng
Mengjie Zhang
22
22
0
17 Apr 2018
Finite-Data Performance Guarantees for the Output-Feedback Control of an
  Unknown System
Finite-Data Performance Guarantees for the Output-Feedback Control of an Unknown System
Ross Boczar
Nikolai Matni
Benjamin Recht
26
45
0
25 Mar 2018
A Survey of Deep Learning Techniques for Mobile Robot Applications
A Survey of Deep Learning Techniques for Mobile Robot Applications
Jahanzaib Shabbir
T. Anwer
34
36
0
20 Mar 2018
Setting up a Reinforcement Learning Task with a Real-World Robot
Setting up a Reinforcement Learning Task with a Real-World Robot
A. R. Mahmood
D. Korenkevych
Brent Komer
James Bergstra
26
75
0
19 Mar 2018
Policy Search in Continuous Action Domains: an Overview
Policy Search in Continuous Action Domains: an Overview
Olivier Sigaud
F. Stulp
16
72
0
13 Mar 2018
A Multi-Objective Deep Reinforcement Learning Framework
A Multi-Objective Deep Reinforcement Learning Framework
Thanh Thi Nguyen
Ngoc Duy Nguyen
Peter Vamplew
Saeid Nahavandi
Richard Dazeley
Chee Peng Lim
OffRL
15
108
0
08 Mar 2018
Accelerated Methods for Deep Reinforcement Learning
Accelerated Methods for Deep Reinforcement Learning
Adam Stooke
Pieter Abbeel
OffRL
OnRL
25
133
0
07 Mar 2018
Multi-Agent Imitation Learning for Driving Simulation
Multi-Agent Imitation Learning for Driving Simulation
Raunak P. Bhattacharyya
Derek J. Phillips
Blake Wulfe
Jeremy Morton
Alex Kuefler
Mykel J. Kochenderfer
22
118
0
02 Mar 2018
The Mirage of Action-Dependent Baselines in Reinforcement Learning
The Mirage of Action-Dependent Baselines in Reinforcement Learning
George Tucker
Surya Bhupatiraju
S. Gu
Richard Turner
Zoubin Ghahramani
Sergey Levine
OffRL
30
126
0
27 Feb 2018
Temporal Difference Models: Model-Free Deep RL for Model-Based Control
Temporal Difference Models: Model-Free Deep RL for Model-Based Control
Vitchyr H. Pong
S. Gu
Murtaza Dalal
Sergey Levine
OffRL
66
238
0
25 Feb 2018
Back to Basics: Benchmarking Canonical Evolution Strategies for Playing
  Atari
Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari
P. Chrabaszcz
I. Loshchilov
Frank Hutter
32
99
0
24 Feb 2018
Clipped Action Policy Gradient
Clipped Action Policy Gradient
Yasuhiro Fujita
S. Maeda
OffRL
34
37
0
21 Feb 2018
Accelerated Primal-Dual Policy Optimization for Safe Reinforcement
  Learning
Accelerated Primal-Dual Policy Optimization for Safe Reinforcement Learning
Qingkai Liang
Fanyu Que
E. Modiano
29
101
0
19 Feb 2018
Evolved Policy Gradients
Evolved Policy Gradients
Rein Houthooft
Richard Y. Chen
Phillip Isola
Bradly C. Stadie
Filip Wolski
Jonathan Ho
Pieter Abbeel
49
227
0
13 Feb 2018
Global Convergence of Policy Gradient Methods for the Linear Quadratic
  Regulator
Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator
Maryam Fazel
Rong Ge
Sham Kakade
M. Mesbahi
35
597
0
15 Jan 2018
DeepMind Control Suite
DeepMind Control Suite
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
...
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
ELM
LM&Ro
BDL
74
1,105
0
02 Jan 2018
Time Limits in Reinforcement Learning
Time Limits in Reinforcement Learning
Fabio Pardo
Arash Tavakoli
Vitaly Levdik
Petar Kormushev
CLL
44
158
0
01 Dec 2017
Previous
1234567
Next