Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1509.02971
Cited By
v1
v2
v3
v4
v5
v6 (latest)
Continuous control with deep reinforcement learning
9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Continuous control with deep reinforcement learning"
50 / 4,796 papers shown
Decentralized Multi-Agent Reinforcement Learning for Task Offloading Under Uncertainty
Yuanchao Xu
Amal Feriani
Ekram Hossain
OffRL
198
4
0
16 Jul 2021
An Information-state based Approach to the Optimal Output Feedback Control of Nonlinear Systems
R. Goyal
Ran A. Wang
M. Mohamed
Aayushman Sharma
S. Chakravorty
86
2
0
16 Jul 2021
Boosting the Convergence of Reinforcement Learning-based Auto-pruning Using Historical Data
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (IEEE TCAD), 2021
Jiandong Mu
Mengdi Wang
Feiwen Zhu
Jun Yang
Weiran Lin
Wei Zhang
OnRL
106
2
0
16 Jul 2021
Geometric Value Iteration: Dynamic Error-Aware KL Regularization for Reinforcement Learning
Asian Conference on Machine Learning (ACML), 2021
Toshinori Kitamura
Lingwei Zhu
Takamitsu Matsubara
168
2
0
16 Jul 2021
NeuSaver: Neural Adaptive Power Consumption Optimization for Mobile Video Streaming
IEEE Transactions on Mobile Computing (IEEE TMC), 2021
Kyoungjun Park
Myungchul Kim
Laihyuk Park
48
4
0
15 Jul 2021
Plan-Based Relaxed Reward Shaping for Goal-Directed Tasks
International Conference on Learning Representations (ICLR), 2021
Ingmar Schubert
Ozgur S. Oguz
Marc Toussaint
OffRL
151
8
0
14 Jul 2021
Going Beyond Linear RL: Sample Efficient Neural Function Approximation
Baihe Huang
Kaixuan Huang
Sham Kakade
Jason D. Lee
Qi Lei
Runzhe Wang
Jiaqi Yang
193
10
0
14 Jul 2021
A Deep Reinforcement Learning Approach for Traffic Signal Control Optimization
Zhenning Li
Chengzhong Xu
Guohui Zhang
134
8
0
13 Jul 2021
Reinforcement Learning based Proactive Control for Transmission Grid Resilience to Wildfire
S. U. Kadir
Subir Majumder
A. Chhokra
A. Dubey
H. Neema
Aron Laszka
A. Srivastava
53
2
0
12 Jul 2021
Conservative Offline Distributional Reinforcement Learning
Yecheng Jason Ma
Dinesh Jayaraman
Osbert Bastani
OffRL
253
98
0
12 Jul 2021
Cautious Actor-Critic
Lingwei Zhu
Toshinori Kitamura
Takamitsu Matsubara
AAML
173
2
0
12 Jul 2021
Behavior Self-Organization Supports Task Inference for Continual Robot Learning
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021
Muhammad Burhan Hafez
S. Wermter
CLL
SSL
115
13
0
09 Jul 2021
Aligning an optical interferometer with beam divergence control and continuous action space
Conference on Robot Learning (CoRL), 2021
Stepan Makarenko
Dmitry Sorokin
Alexander Ulanov
A. Lvovsky
AI4CE
121
4
0
09 Jul 2021
RMA: Rapid Motor Adaptation for Legged Robots
Ashish Kumar
Zipeng Fu
Deepak Pathak
Jitendra Malik
890
745
0
08 Jul 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision
International Conference on Machine Learning (ICML), 2021
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
374
76
0
08 Jul 2021
Deep Learning for Embodied Vision Navigation: A Survey
Fengda Zhu
Yi Zhu
Vincent CS Lee
Xiaodan Liang
Xiaojun Chang
EgoV
LM&Ro
494
0
0
07 Jul 2021
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
267
6
0
07 Jul 2021
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning
Erdun Gao
Fan Feng
Chaochao Lu
Sara Magliacane
Kun Zhang
321
73
0
06 Jul 2021
Gradient Importance Learning for Incomplete Observations
Qitong Gao
Dong Wang
Joshua D. Amason
Siyang Yuan
Chenyang Tao
Ricardo Henao
M. Hadziahmetovic
Lawrence Carin
Miroslav Pajic
133
10
0
05 Jul 2021
Reward-Adaptive Reinforcement Learning: Dynamic Policy Gradient Optimization for Bipedal Locomotion
Changxin Huang
Guangrun Wang
Zhibo Zhou
Ronghui Zhang
Liang Lin
133
35
0
05 Jul 2021
Low-Dimensional State and Action Representation Learning with MDP Homomorphism Metrics
N. Botteghi
M. Poel
B. Sirmaçek
C. Brune
174
3
0
04 Jul 2021
Low Dimensional State Representation Learning with Robotics Priors in Continuous Action Spaces
N. Botteghi
K. Alaa
M. Poel
B. Sirmaçek
C. Brune
A. Mersha
Stefano Stramigioli
SSL
132
11
0
04 Jul 2021
SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement Learning Agents
Grgur Kovač
Rémy Portelas
Katja Hofmann
Pierre-Yves Oudeyer
ALM
257
6
0
02 Jul 2021
Embodiment and Computational Creativity
Christian Guckelsberger
Anna Kantosalo
S. Negrete-Yankelevich
T. Takala
111
18
0
02 Jul 2021
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation
Nicklas Hansen
H. Su
Xiaolong Wang
OffRL
364
161
0
01 Jul 2021
MHER: Model-based Hindsight Experience Replay
Rui Yang
Meng Fang
Lei Han
Yali Du
Feng Luo
Xiu Li
OffRL
261
23
0
01 Jul 2021
Deep Multiagent Reinforcement Learning: Challenges and Directions
Artificial Intelligence Review (AIR), 2021
Annie Wong
Thomas Bäck
Anna V. Kononova
Aske Plaat
AI4CE
266
153
0
29 Jun 2021
Survivable Robotic Control through Guided Bayesian Policy Search with Deep Reinforcement Learning
Sayyed Jaffar Ali Raza
Apan Dastider
Mingjie Lin
75
1
0
29 Jun 2021
Convergent and Efficient Deep Q Network Algorithm
Zhikang T. Wang
Masahito Ueda
238
14
0
29 Jun 2021
Policy Regularization via Noisy Advantage Values for Cooperative Multi-agent Actor-Critic methods
Jian Hu
Siyue Hu
Shih-Wei Liao
659
21
0
27 Jun 2021
Continuous Control with Deep Reinforcement Learning for Autonomous Vessels
Nader Zare
Bruno Brandoli
Mahtab Sarvmaili
Amilcar Soares
Stan Matwin
167
9
0
27 Jun 2021
POLAR: A Polynomial Arithmetic Framework for Verifying Neural-Network Controlled Systems
Automated Technology for Verification and Analysis (ATVA), 2021
Chao Huang
Jiameng Fan
Zhilu Wang
Yixuan Wang
Weichao Zhou
Jiajun Li
Xin Chen
Wenchao Li
Qi Zhu
315
55
0
25 Jun 2021
Active Learning in Robotics: A Review of Control Principles
Mechatronics (Oxford) (MO), 2021
Annalisa T. Taylor
Thomas A. Berrueta
Todd Murphey
259
88
0
25 Jun 2021
panda-gym: Open-source goal-conditioned environments for robotic learning
Quentin Gallouedec
Nicolas Cazin
Emmanuel Dellandrea
Liming Chen
OffRL
148
99
0
25 Jun 2021
Density Constrained Reinforcement Learning
Zengyi Qin
Yuxiao Chen
Chuchu Fan
190
38
0
24 Jun 2021
Evolving Hierarchical Memory-Prediction Machines in Multi-Task Reinforcement Learning
Stephen Kelly
Tatiana Voegerl
W. Banzhaf
C. Gondro
178
17
0
23 Jun 2021
Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via Discretisation
Stephen James
Kentaro Wada
Tristan Laidlow
Andrew J. Davison
329
173
0
23 Jun 2021
Imitation Learning: Progress, Taxonomies and Challenges
Boyuan Zheng
Sunny Verma
Jianlong Zhou
Ivor Tsang
Fang Chen
297
131
0
23 Jun 2021
Local policy search with Bayesian optimization
Neural Information Processing Systems (NeurIPS), 2021
Sarah Müller
Alexander von Rohr
Sebastian Trimpe
BDL
215
53
0
22 Jun 2021
Policy Smoothing for Provably Robust Reinforcement Learning
International Conference on Learning Representations (ICLR), 2021
Aounon Kumar
Alexander Levine
Soheil Feizi
AAML
283
62
0
21 Jun 2021
Multi-Agent Curricula and Emergent Implicit Signaling
Adaptive Agents and Multi-Agent Systems (AAMAS), 2021
Niko A. Grupen
Daniel D. Lee
B. Selman
271
11
0
21 Jun 2021
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
International Conference on Machine Learning (ICML), 2021
Jongmin Lee
Wonseok Jeon
Byung-Jun Lee
J. Pineau
Kee-Eung Kim
OffRL
351
118
0
21 Jun 2021
Learning to Reach, Swim, Walk and Fly in One Trial: Data-Driven Control with Scarce Data and Side Information
Franck Djeumou
Ufuk Topcu
145
5
0
19 Jun 2021
Boosting Offline Reinforcement Learning with Residual Generative Modeling
International Joint Conference on Artificial Intelligence (IJCAI), 2021
Hua Wei
Deheng Ye
Zhao Liu
Hao Wu
Bo Yuan
Qiang Fu
Wei Yang
Ruoyao Xiao
OffRL
259
10
0
19 Jun 2021
Multi-Task Learning for User Engagement and Adoption in Live Video Streaming Events
Stefanos Antaris
Dimitrios Rafailidis
Romina Arriaza
OffRL
104
0
0
18 Jun 2021
MADE: Exploration via Maximizing Deviation from Explored Regions
Neural Information Processing Systems (NeurIPS), 2021
Tianjun Zhang
Paria Rashidinejad
Jiantao Jiao
Yuandong Tian
Joseph E. Gonzalez
Stuart J. Russell
OffRL
226
48
0
18 Jun 2021
Deep Reinforcement Learning Models Predict Visual Responses in the Brain: A Preliminary Result
Maytus Piriyajitakonkij
Sirawaj Itthipuripat
Theerawit Wilaiprasitporn
Nat Dilokthanakul
FAtt
97
2
0
18 Jun 2021
No-frills Dynamic Planning using Static Planners
IEEE International Conference on Robotics and Automation (ICRA), 2021
Mara Levy
Vasista Ayyagari
Abhinav Shrivastava
87
0
0
17 Jun 2021
A Deep Reinforcement Learning Approach towards Pendulum Swing-up Problem based on TF-Agents
Yifei Bi
Xinyi Chen
Caihui Xiao
52
2
0
17 Jun 2021
Learning from Demonstration without Demonstrations
Tom Blau
Gilad Francis
Philippe Morere
OffRL
134
1
0
17 Jun 2021
Previous
1
2
3
...
52
53
54
...
94
95
96
Next
Page 53 of 96
Page
of 96
Go