Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1509.02971
Cited By
v1
v2
v3
v4
v5
v6 (latest)
Continuous control with deep reinforcement learning
9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Continuous control with deep reinforcement learning"
50 / 4,796 papers shown
Hybrid Policy Learning for Energy-Latency Tradeoff in MEC-Assisted VR Video Service
IEEE Transactions on Vehicular Technology (IEEE Trans. Veh. Technol.), 2021
Chong Zheng
Shengheng Liu
Yongming Huang
Luxi Yang
EgoV
123
37
0
02 Apr 2021
Replay in Deep Learning: Current Approaches and Missing Biological Elements
Neural Computation (Neural Comput.), 2021
Tyler L. Hayes
G. Krishnan
M. Bazhenov
H. Siegelmann
T. Sejnowski
Christopher Kanan
CLL
239
139
0
01 Apr 2021
Storchastic: A Framework for General Stochastic Automatic Differentiation
Neural Information Processing Systems (NeurIPS), 2021
Emile van Krieken
Jakub M. Tomczak
A. T. Teije
ODL
OffRL
359
18
0
01 Apr 2021
Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2021
Hiroki Furuta
Tadashi Kozuno
T. Matsushima
Y. Matsuo
S. Gu
325
14
0
31 Mar 2021
Solving Heterogeneous General Equilibrium Economic Models with Deep Reinforcement Learning
Edward W. Hill
M. Bardoscia
A. Turrell
125
31
0
31 Mar 2021
Learning and Exploring Motor Skills with Spacetime Bounds
Li-Ke Ma
Zeshi Yang
Xin Tong
B. Guo
KangKang Yin
144
26
0
31 Mar 2021
Benchmarks for Deep Off-Policy Evaluation
International Conference on Learning Representations (ICLR), 2021
Justin Fu
Mohammad Norouzi
Ofir Nachum
George Tucker
Ziyun Wang
...
Yutian Chen
Aviral Kumar
Cosmin Paduraru
Sergey Levine
T. Paine
ELM
OffRL
210
109
0
30 Mar 2021
Saddle Point Optimization with Approximate Minimization Oracle
Annual Conference on Genetic and Evolutionary Computation (GECCO), 2021
Youhei Akimoto
201
8
0
29 Mar 2021
Deep Hedging of Derivatives Using Reinforcement Learning
The Journal of Financial Data Science (JFDS), 2019
Jay Cao
Jacky Chen
J. Hull
Zissis Poulos
258
89
0
29 Mar 2021
Joint Resource Management for MC-NOMA: A Deep Reinforcement Learning Approach
IEEE Transactions on Wireless Communications (IEEE TWC), 2021
Shaoyang Wang
Tiejun Lv
Wei Ni
N. Beaulieu
Y. Guo
91
54
0
29 Mar 2021
Robust Feedback Motion Policy Design Using Reinforcement Learning on a 3D Digit Bipedal Robot
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021
Guillermo A. Castillo
Bowen Weng
Wei Zhang
Ayonga Hereid
111
69
0
29 Mar 2021
Self-adaptive Torque Vectoring Controller Using Reinforcement Learning
International Conference on Intelligent Transportation Systems (ITSC), 2021
Shayan Taherian
Sampo Kuutti
Marco Visca
Saber Fallah
43
7
0
27 Mar 2021
KnowRU: Knowledge Reusing via Knowledge Distillation in Multi-agent Reinforcement Learning
Entropy (Entropy), 2021
Zijian Gao
Kele Xu
Bo Ding
Huaimin Wang
Yiying Li
Hongda Jia
115
18
0
27 Mar 2021
Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow
John Mcleod
Hrvoje Stojić
Vincent Adam
Dongho Kim
Jordi Grau-Moya
Peter Vrancx
Felix Leibfried
OffRL
272
2
0
26 Mar 2021
Value Function Estimators for Feynman-Kac Forward-Backward SDEs in Stochastic Optimal Control
Kelsey P. Hawkins
A. Pakniyat
Panagiotis Tsiotras
76
7
0
26 Mar 2021
Preliminary Experimental Results of Context-Aware Teams of Multiple Autonomous Agents Operating under Constrained Communications
J. Martinez-Lorenzo
Jeffrey Hudack
Yutao Jing
Michael Shaham
Zixuan Liang
...
Juan Heredia Juesas
Yuntao Ma
Tristan Sweeney
Nicolas Ares
Ari Fox
176
0
0
25 Mar 2021
A Meta-Reinforcement Learning Approach to Process Control
IFAC-PapersOnLine (IFAC-PapersOnLine), 2021
Daniel G. McClement
Nathan P. Lawrence
Philip D. Loewen
M. Forbes
Johan U. Backstrom
R. Bhushan Gopaluni
109
13
0
25 Mar 2021
Adversarial Imitation Learning with Trajectorial Augmentation and Correction
IEEE International Conference on Robotics and Automation (ICRA), 2021
Dafni Antotsiou
C. Ciliberto
Tae-Kyun Kim
151
12
0
25 Mar 2021
Model Predictive Actor-Critic: Accelerating Robot Skill Acquisition with Deep Reinforcement Learning
IEEE International Conference on Robotics and Automation (ICRA), 2021
A. S. Morgan
Daljeet Nandha
Georgia Chalvatzaki
Carlo DÉramo
A. Dollar
Jan Peters
166
49
0
25 Mar 2021
Self-Imitation Learning by Planning
IEEE International Conference on Robotics and Automation (ICRA), 2021
Junhyuk Oh
Yijie Guo
Satinder Singh
SSL
226
85
0
25 Mar 2021
Discriminator Augmented Model-Based Reinforcement Learning
Behzad Haghgoo
Allan Zhou
Archit Sharma
Chelsea Finn
OffRL
151
3
0
24 Mar 2021
Deep Reinforcement Learning for Mapless Navigation of a Hybrid Aerial Underwater Vehicle with Medium Transition
IEEE International Conference on Robotics and Automation (ICRA), 2021
Ricardo B. Grando
J. C. Jesus
V. A. Kich
A. H. Kolling
Nicolas P. Bortoluzzi
P. Pinheiro
A. A. Neto
Paulo L. J. Drews-Jr
173
46
0
23 Mar 2021
Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning
International Conference on Machine Learning (ICML), 2021
Hiroki Furuta
T. Matsushima
Tadashi Kozuno
Y. Matsuo
Sergey Levine
Ofir Nachum
S. Gu
OffRL
150
16
0
23 Mar 2021
Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification
Neural Information Processing Systems (NeurIPS), 2021
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
345
53
0
23 Mar 2021
Neural Network Controller for Autonomous Pile Loading Revised
IEEE International Conference on Robotics and Automation (ICRA), 2021
Wenyan Yang
N. Strokina
N. Serbenyuk
Joni Pajarinen
R. Ghabcheloo
J. Vihonen
M. M. Aref
Joni-Kristian Kämäräinen
127
7
0
23 Mar 2021
Regularized Softmax Deep Multi-Agent
Q
Q
Q
-Learning
Neural Information Processing Systems (NeurIPS), 2021
L. Pan
Tabish Rashid
Bei Peng
Longbo Huang
Shimon Whiteson
236
42
0
22 Mar 2021
IPAPRec: A promising tool for learning high-performance mapless navigation skills with deep reinforcement learning
IEEE/ASME transactions on mechatronics (IEEE/ASME Trans. Mechatronics), 2021
Wei Zhang
Yunfeng Zhang
Ning Liu
Kai Ren
Pengfei Wang
155
20
0
22 Mar 2021
Bayesian Distributional Policy Gradients
AAAI Conference on Artificial Intelligence (AAAI), 2021
Luchen Li
A. Faisal
BDL
OffRL
189
11
0
20 Mar 2021
Self-Supervised Steering Angle Prediction for Vehicle Control Using Visual Odometry
International Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Qadeer Ahmad Khan
Patrick Wenzel
Zorah Lähner
SSL
289
3
0
20 Mar 2021
A Self-adaptive SAC-PID Control Approach based on Reinforcement Learning for Mobile Robots
International Journal of Robust and Nonlinear Control (IJRNC), 2021
Xinyi Yu
Yu Fan
Siyu Xu
L. Ou
109
44
0
19 Mar 2021
Integrated Decision and Control: Towards Interpretable and Computationally Efficient Driving Intelligence
IEEE Transactions on Cybernetics (IEEE Trans. Cybern.), 2021
Yang Guan
Yangang Ren
Qi Sun
Shengbo Eben Li
Haitong Ma
Jingliang Duan
Yifan Dai
B. Cheng
186
85
0
18 Mar 2021
Maximum Entropy Reinforcement Learning with Mixture Policies
Nir Baram
Guy Tennenholtz
Shie Mannor
111
5
0
18 Mar 2021
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL
International Conference on Machine Learning (ICML), 2021
Clément Romac
Rémy Portelas
Katja Hofmann
Pierre-Yves Oudeyer
248
28
0
17 Mar 2021
Weakly Supervised Reinforcement Learning for Autonomous Highway Driving via Virtual Safety Cages
Italian National Conference on Sensors (INS), 2021
Sampo Kuutti
Richard Bowden
Saber Fallah
165
14
0
17 Mar 2021
Regularized Behavior Value Estimation
Çağlar Gülçehre
Sergio Gomez Colmenarejo
Ziyun Wang
Jakub Sygnowski
T. Paine
Konrad Zolna
Yutian Chen
Matthew W. Hoffman
Razvan Pascanu
Nando de Freitas
OffRL
176
42
0
17 Mar 2021
A Practical Guide to Multi-Objective Reinforcement Learning and Planning
Autonomous Agents and Multi-Agent Systems (AAMAS), 2021
Conor F. Hayes
Roxana Ruadulescu
Eugenio Bargiacchi
Johan Källström
Matthew Macfarlane
...
Ann Nowé
Gabriel de Oliveira Ramos
Marcello Restelli
Peter Vamplew
D. Roijers
OffRL
290
435
0
17 Mar 2021
Self-Organizing mmWave MIMO Cell-Free Networks With Hybrid Beamforming: A Hierarchical DRL-Based Design
IEEE Transactions on Communications (IEEE Trans. Commun.), 2021
Yasser F. Al-Eryani
Ekram Hossain
85
32
0
17 Mar 2021
Learning in Nonzero-Sum Stochastic Games with Potentials
International Conference on Machine Learning (ICML), 2021
D. Mguni
Yutong Wu
Yali Du
Yaodong Yang
Ziyi Wang
Minne Li
Ying Wen
Joel Jennings
Jun Wang
348
50
0
16 Mar 2021
Goal-constrained Sparse Reinforcement Learning for End-to-End Driving
Pranav Agarwal
Pierre de Beaucorps
Raoul de Charette
232
3
0
16 Mar 2021
Inclined Quadrotor Landing using Deep Reinforcement Learning
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021
Jacob E. Kooi
Robert Babuška
142
36
0
16 Mar 2021
Hierarchical Reinforcement Learning Framework for Stochastic Spaceflight Campaign Design
Journal of Spacecraft and Rockets (JSR), 2021
Yuji Takubo
Hao Chen
K. Ho
79
16
0
16 Mar 2021
Few Shot System Identification for Reinforcement Learning
Asia-Pacific Conference on Intelligent Robot Systems (AIRS), 2021
Karim Farid
Nourhan Sakr
160
2
0
16 Mar 2021
Growing 3D Artefacts and Functional Machines with Neural Cellular Automata
IEEE Symposium on Artificial Life (AL), 2021
Shyam Sudhakaran
Djordje Grbic
Siyan Li
Adam Katona
Elias Najarro
Claire Glanois
S. Risi
188
54
0
15 Mar 2021
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021
Thanh Nguyen
Tung M. Luu
Thang Vu
Chang D. Yoo
140
19
0
15 Mar 2021
Mutual Information State Intrinsic Control
International Conference on Learning Representations (ICLR), 2021
Rui Zhao
Yang Gao
Pieter Abbeel
Volker Tresp
Wenyuan Xu
SSL
142
25
0
15 Mar 2021
Learning robust driving policies without online exploration
IEEE International Conference on Robotics and Automation (ICRA), 2021
D. Graves
Nhat M. Nguyen
Kimia Hassanzadeh
Jun Jin
Jun Luo
OffRL
173
3
0
15 Mar 2021
Offline Reinforcement Learning with Fisher Divergence Critic Regularization
International Conference on Machine Learning (ICML), 2021
Ilya Kostrikov
Jonathan Tompson
Rob Fergus
Ofir Nachum
OffRL
315
337
0
14 Mar 2021
Simulation Studies on Deep Reinforcement Learning for Building Control with Human Interaction
Dong-hwan Lee
Niao He
Seungjae Lee
P. Karava
Jianghai Hu
AI4CE
47
0
0
14 Mar 2021
Discovering Diverse Solutions in Deep Reinforcement Learning by Maximizing State-Action-Based Mutual Information
Neural Networks (NN), 2021
Takayuki Osa
Voot Tangkaratt
Masashi Sugiyama
279
35
0
12 Mar 2021
A Quadratic Actor Network for Model-Free Reinforcement Learning
Matthias Weissenbacher
Yoshinobu Kawahara
78
1
0
11 Mar 2021
Previous
1
2
3
...
56
57
58
...
94
95
96
Next