Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1509.02971
Cited By
v1
v2
v3
v4
v5
v6 (latest)
Continuous control with deep reinforcement learning
9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Continuous control with deep reinforcement learning"
50 / 4,796 papers shown
Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial
Amal Feriani
Ekram Hossain
374
315
0
06 Nov 2020
Adversarial Skill Learning for Robust Manipulation
Pingcheng Jian
Chao Yang
Di Guo
Huaping Liu
F. Sun
AAML
148
9
0
06 Nov 2020
Sample-efficient Reinforcement Learning in Robotic Table Tennis
Jonas Tebbe
Lukas Krauch
Yapeng Gao
A. Zell
302
44
0
06 Nov 2020
Playing optical tweezers with deep reinforcement learning: in virtual, physical and augmented environments
M. Praeger
Yunhui Xie
J. Grant-Jacob
R. Eason
B. Mills
285
12
0
05 Nov 2020
Learning a Decentralized Multi-arm Motion Planner
Huy Ha
Jingxi Xu
Shuran Song
258
63
0
05 Nov 2020
Learning Trajectories for Visual-Inertial System Calibration via Model-based Heuristic Deep Reinforcement Learning
Le Chen
Yu Ao
Florian Tschopp
Andrei Cramariuc
Michel Breyer
Jen Jen Chung
Roland Siegwart
Cesar Cadena
111
4
0
04 Nov 2020
Generative Inverse Deep Reinforcement Learning for Online Recommendation
Xiaocong Chen
Lina Yao
Aixin Sun
Xianzhi Wang
Xiwei Xu
Liming Zhu
OffRL
125
30
0
04 Nov 2020
A Study of Policy Gradient on a Class of Exactly Solvable Models
Gavin McCracken
Colin Daniels
Rosie Zhao
Anna M. Brandenberger
Prakash Panangaden
Doina Precup
143
0
0
03 Nov 2020
Representation Matters: Improving Perception and Exploration for Robotics
Markus Wulfmeier
Arunkumar Byravan
Tim Hertweck
I. Higgins
Ankush Gupta
...
Malcolm Reynolds
Denis Teplyashin
Agrim Gupta
Thomas Lampe
Martin Riedmiller
307
19
0
03 Nov 2020
Amortized Variational Deep Q Network
Haotian Zhang
Yuhao Wang
Jianyong Sun
Zongben Xu
BDL
140
0
0
03 Nov 2020
Episodic Linear Quadratic Regulators with Low-rank Transitions
Tianyu Wang
Lin F. Yang
180
3
0
03 Nov 2020
Reinforcement Learning with Efficient Active Feature Acquisition
Haiyan Yin
Yingzhen Li
Sinno Jialin Pan
Cheng Zhang
Sebastian Tschiatschek
OffRL
166
18
0
02 Nov 2020
Observation Space Matters: Benchmark and Optimization Algorithm
IEEE International Conference on Robotics and Automation (ICRA), 2020
J. Kim
Sehoon Ha
OOD
OffRL
216
12
0
02 Nov 2020
Optimizing Mixed Autonomy Traffic Flow With Decentralized Autonomous Vehicles and Multi-Agent RL
Eugene Vinitsky
Nathan Lichtlé
Kanaad Parvate
Alexandre M. Bayen
125
12
0
30 Oct 2020
Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones
IEEE Robotics and Automation Letters (RA-L), 2020
Brijen Thananjeyan
Ashwin Balakrishna
Suraj Nair
Michael Luo
K. Srinivasan
M. Hwang
Joseph E. Gonzalez
Julian Ibarz
Chelsea Finn
Ken Goldberg
OffRL
299
268
0
29 Oct 2020
DeepQ Stepper: A framework for reactive dynamic walking on uneven terrain
IEEE International Conference on Robotics and Automation (ICRA), 2020
Avadesh Meduri
Majid Khadiv
Ludovic Righetti
134
13
0
28 Oct 2020
Learning to Represent Action Values as a Hypergraph on the Action Vertices
International Conference on Learning Representations (ICLR), 2020
Arash Tavakoli
Mehdi Fatemi
Petar Kormushev
158
24
0
28 Oct 2020
Learning to Plan Optimistically: Uncertainty-Guided Deep Exploration via Latent Model Ensembles
Conference on Robot Learning (CoRL), 2020
Tim Seyde
Wilko Schwarting
S. Karaman
Daniela Rus
264
15
0
27 Oct 2020
Learning Time Reduction Using Warm Start Methods for a Reinforcement Learning Based Supervisory Control in Hybrid Electric Vehicle Applications
IEEE Transactions on Transportation Electrification (TE), 2020
Bin Xu
Jun Hou
Junzhe Shi
Huayi Li
Dhruvang Rathod
Zhe Wang
Z. Filipi
94
28
0
27 Oct 2020
Batch Reinforcement Learning with a Nonparametric Off-Policy Policy Gradient
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Samuele Tosatto
João Carvalho
Jan Peters
OffRL
234
8
0
27 Oct 2020
Behavior Priors for Efficient Reinforcement Learning
Journal of machine learning research (JMLR), 2020
Dhruva Tirumala
Alexandre Galashov
Hyeonwoo Noh
Leonard Hasenclever
Razvan Pascanu
...
Guillaume Desjardins
Wojciech M. Czarnecki
Arun Ahuja
Yee Whye Teh
N. Heess
235
46
0
27 Oct 2020
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous Controls
Journal of machine learning research (JMLR), 2020
Jeongho Kim
Jaeuk Shin
Insoon Yang
162
41
0
27 Oct 2020
Contextual Latent-Movements Off-Policy Optimization for Robotic Manipulation Skills
IEEE International Conference on Robotics and Automation (ICRA), 2020
Samuele Tosatto
Georgia Chalvatzaki
Jan Peters
212
13
0
26 Oct 2020
Behavioral decision-making for urban autonomous driving in the presence of pedestrians using Deep Recurrent Q-Network
International Conference on Control, Automation, Robotics and Vision (ICARCV), 2020
Niranjan Deshpande
Dominique Vaufreydaz
A. Spalanzani
120
18
0
26 Oct 2020
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2020
Younggyo Seo
Kimin Lee
I. Clavera
Thanard Kurutach
Jinwoo Shin
Pieter Abbeel
234
45
0
26 Oct 2020
How to Make Deep RL Work in Practice
Nirnai Rao
Elie Aljalbout
Axel Sauer
Sami Haddadin
OffRL
188
11
0
25 Oct 2020
Dynamic Adversarial Patch for Evading Object Detection Models
Shahar Hoory
T. Shapira
A. Shabtai
Yuval Elovici
AAML
171
51
0
25 Oct 2020
Improving the Exploration of Deep Reinforcement Learning in Continuous Domains using Planning for Policy Search
Jakob J. Hollenstein
Erwan Renaudo
Matteo Saveriano
J. Piater
85
2
0
24 Oct 2020
Planning with Exploration: Addressing Dynamics Bottleneck in Model-based Reinforcement Learning
Xiyao Wang
Junge Zhang
Wenzhen Huang
Qiyue Yin
150
1
0
24 Oct 2020
Stabilizing Transformer-Based Action Sequence Generation For Q-Learning
Gideon Stein
Andrey Filchenkov
Arip Asadulaev
OffRL
200
2
0
23 Oct 2020
Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning
Guangxiang Zhu
Minghao Zhang
Honglak Lee
Chongjie Zhang
OffRL
304
21
0
23 Oct 2020
Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments
Jun Yamada
Youngwoon Lee
G. Salhotra
Karl Pertsch
Max Pflueger
Gaurav Sukhatme
Joseph J. Lim
Péter Englert
158
48
0
22 Oct 2020
Deep Q-Network-based Adaptive Alert Threshold Selection Policy for Payment Fraud Systems in Retail Banking
Hongda Shen
Eren Kurshan
209
25
0
21 Oct 2020
Improving Generalization in Reinforcement Learning with Mixture Regularization
Neural Information Processing Systems (NeurIPS), 2020
Kaixin Wang
Bingyi Kang
Jie Shao
Jiashi Feng
364
129
0
21 Oct 2020
Iterative Amortized Policy Optimization
Neural Information Processing Systems (NeurIPS), 2020
Joseph Marino
Alexandre Piché
Alessandro Davide Ialongo
Yisong Yue
OffRL
256
23
0
20 Oct 2020
Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification
D. Mankowitz
D. A. Calian
Rae Jeong
Cosmin Paduraru
N. Heess
Sumanth Dathathri
Martin Riedmiller
Timothy A. Mann
316
20
0
20 Oct 2020
Quality of service based radar resource management using deep reinforcement learning
International Radar Conference (RADAR), 2020
S. Durst
S. Brüggenwirth
70
15
0
20 Oct 2020
Survivable Hyper-Redundant Robotic Arm with Bayesian Policy Morphing
Sayyed Jaffar Ali Raza
Apan Dastider
Mingjie Lin
63
1
0
20 Oct 2020
Proximal Policy Gradient: PPO with Policy Gradient
Ju-Seung Byun
Byungmoon Kim
Huamin Wang
OffRL
114
10
0
20 Oct 2020
Dream and Search to Control: Latent Space Planning for Continuous Control
Anurag Koul
Varun V. Kumar
Alan Fern
Somdeb Majumdar
165
6
0
19 Oct 2020
Deep Reinforcement Learning with Population-Coded Spiking Neural Network for Continuous Control
Conference on Robot Learning (CoRL), 2020
Guangzhi Tang
Neelesh Kumar
Raymond Yoo
Konstantinos Michmizos
205
101
0
19 Oct 2020
What About Inputing Policy in Value Function: Policy Representation and Policy-extended Value Function Approximator
Hongyao Tang
Zhaopeng Meng
Jianye Hao
Chong Chen
D. Graves
...
Hangyu Mao
Wulong Liu
Yaodong Yang
Wenyuan Tao
Li Wang
OffRL
260
7
0
19 Oct 2020
Chance-Constrained Control with Lexicographic Deep Reinforcement Learning
IEEE Control Systems Letters (L-CSS), 2020
Alessandro Giuseppi
A. Pietrabissa
89
9
0
19 Oct 2020
Softmax Deep Double Deterministic Policy Gradients
Neural Information Processing Systems (NeurIPS), 2020
Ling Pan
Qingpeng Cai
Longbo Huang
233
116
0
19 Oct 2020
Belief-Grounded Networks for Accelerated Robot Learning under Partial Observability
Conference on Robot Learning (CoRL), 2020
Hai V. Nguyen
Brett Daley
Xinchao Song
Chris Amato
Robert Platt
305
15
0
19 Oct 2020
DOOM: A Novel Adversarial-DRL-Based Op-Code Level Metamorphic Malware Obfuscator for the Enhancement of IDS
Mohit Sewak
S. K. Sahay
Hemant Rathore
114
19
0
16 Oct 2020
On the Guaranteed Almost Equivalence between Imitation Learning from Observation and Demonstration
IEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2020
Zhihao Cheng
Liu Liu
Aishan Liu
Hao Sun
Meng Fang
Dacheng Tao
136
11
0
16 Oct 2020
Decentralized Multi-Agent Pursuit using Deep Reinforcement Learning
IEEE Robotics and Automation Letters (RA-L), 2020
C. de Souza
Rhys Newbury
Akansel Cosgun
P. Castillo
B. Vidolov
Dana Kulić
246
120
0
16 Oct 2020
A Learning Approach to Robot-Agnostic Force-Guided High Precision Assembly
Jieliang Luo
Hui Li
249
20
0
15 Oct 2020
Deep Learning of Koopman Representation for Control
Yiqiang Han
Wenjian Hao
Umesh Vaidya
AI4CE
101
131
0
15 Oct 2020
Previous
1
2
3
...
62
63
64
...
94
95
96
Next