ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.02971
  4. Cited By
Continuous control with deep reinforcement learning
v1v2v3v4v5v6 (latest)

Continuous control with deep reinforcement learning

9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
ArXiv (abs)PDFHTML

Papers citing "Continuous control with deep reinforcement learning"

50 / 4,796 papers shown
Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled
  Wireless Networks: A Tutorial
Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial
Amal Feriani
Ekram Hossain
374
315
0
06 Nov 2020
Adversarial Skill Learning for Robust Manipulation
Adversarial Skill Learning for Robust Manipulation
Pingcheng Jian
Chao Yang
Di Guo
Huaping Liu
F. Sun
AAML
148
9
0
06 Nov 2020
Sample-efficient Reinforcement Learning in Robotic Table Tennis
Sample-efficient Reinforcement Learning in Robotic Table Tennis
Jonas Tebbe
Lukas Krauch
Yapeng Gao
A. Zell
302
44
0
06 Nov 2020
Playing optical tweezers with deep reinforcement learning: in virtual,
  physical and augmented environments
Playing optical tweezers with deep reinforcement learning: in virtual, physical and augmented environments
M. Praeger
Yunhui Xie
J. Grant-Jacob
R. Eason
B. Mills
285
12
0
05 Nov 2020
Learning a Decentralized Multi-arm Motion Planner
Learning a Decentralized Multi-arm Motion Planner
Huy Ha
Jingxi Xu
Shuran Song
258
63
0
05 Nov 2020
Learning Trajectories for Visual-Inertial System Calibration via
  Model-based Heuristic Deep Reinforcement Learning
Learning Trajectories for Visual-Inertial System Calibration via Model-based Heuristic Deep Reinforcement Learning
Le Chen
Yu Ao
Florian Tschopp
Andrei Cramariuc
Michel Breyer
Jen Jen Chung
Roland Siegwart
Cesar Cadena
111
4
0
04 Nov 2020
Generative Inverse Deep Reinforcement Learning for Online Recommendation
Generative Inverse Deep Reinforcement Learning for Online Recommendation
Xiaocong Chen
Lina Yao
Aixin Sun
Xianzhi Wang
Xiwei Xu
Liming Zhu
OffRL
125
30
0
04 Nov 2020
A Study of Policy Gradient on a Class of Exactly Solvable Models
A Study of Policy Gradient on a Class of Exactly Solvable Models
Gavin McCracken
Colin Daniels
Rosie Zhao
Anna M. Brandenberger
Prakash Panangaden
Doina Precup
143
0
0
03 Nov 2020
Representation Matters: Improving Perception and Exploration for
  Robotics
Representation Matters: Improving Perception and Exploration for Robotics
Markus Wulfmeier
Arunkumar Byravan
Tim Hertweck
I. Higgins
Ankush Gupta
...
Malcolm Reynolds
Denis Teplyashin
Agrim Gupta
Thomas Lampe
Martin Riedmiller
307
19
0
03 Nov 2020
Amortized Variational Deep Q Network
Amortized Variational Deep Q Network
Haotian Zhang
Yuhao Wang
Jianyong Sun
Zongben Xu
BDL
140
0
0
03 Nov 2020
Episodic Linear Quadratic Regulators with Low-rank Transitions
Episodic Linear Quadratic Regulators with Low-rank Transitions
Tianyu Wang
Lin F. Yang
180
3
0
03 Nov 2020
Reinforcement Learning with Efficient Active Feature Acquisition
Reinforcement Learning with Efficient Active Feature Acquisition
Haiyan Yin
Yingzhen Li
Sinno Jialin Pan
Cheng Zhang
Sebastian Tschiatschek
OffRL
166
18
0
02 Nov 2020
Observation Space Matters: Benchmark and Optimization Algorithm
Observation Space Matters: Benchmark and Optimization AlgorithmIEEE International Conference on Robotics and Automation (ICRA), 2020
J. Kim
Sehoon Ha
OODOffRL
216
12
0
02 Nov 2020
Optimizing Mixed Autonomy Traffic Flow With Decentralized Autonomous
  Vehicles and Multi-Agent RL
Optimizing Mixed Autonomy Traffic Flow With Decentralized Autonomous Vehicles and Multi-Agent RL
Eugene Vinitsky
Nathan Lichtlé
Kanaad Parvate
Alexandre M. Bayen
125
12
0
30 Oct 2020
Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones
Recovery RL: Safe Reinforcement Learning with Learned Recovery ZonesIEEE Robotics and Automation Letters (RA-L), 2020
Brijen Thananjeyan
Ashwin Balakrishna
Suraj Nair
Michael Luo
K. Srinivasan
M. Hwang
Joseph E. Gonzalez
Julian Ibarz
Chelsea Finn
Ken Goldberg
OffRL
299
268
0
29 Oct 2020
DeepQ Stepper: A framework for reactive dynamic walking on uneven
  terrain
DeepQ Stepper: A framework for reactive dynamic walking on uneven terrainIEEE International Conference on Robotics and Automation (ICRA), 2020
Avadesh Meduri
Majid Khadiv
Ludovic Righetti
134
13
0
28 Oct 2020
Learning to Represent Action Values as a Hypergraph on the Action
  Vertices
Learning to Represent Action Values as a Hypergraph on the Action VerticesInternational Conference on Learning Representations (ICLR), 2020
Arash Tavakoli
Mehdi Fatemi
Petar Kormushev
158
24
0
28 Oct 2020
Learning to Plan Optimistically: Uncertainty-Guided Deep Exploration via
  Latent Model Ensembles
Learning to Plan Optimistically: Uncertainty-Guided Deep Exploration via Latent Model EnsemblesConference on Robot Learning (CoRL), 2020
Tim Seyde
Wilko Schwarting
S. Karaman
Daniela Rus
264
15
0
27 Oct 2020
Learning Time Reduction Using Warm Start Methods for a Reinforcement
  Learning Based Supervisory Control in Hybrid Electric Vehicle Applications
Learning Time Reduction Using Warm Start Methods for a Reinforcement Learning Based Supervisory Control in Hybrid Electric Vehicle ApplicationsIEEE Transactions on Transportation Electrification (TE), 2020
Bin Xu
Jun Hou
Junzhe Shi
Huayi Li
Dhruvang Rathod
Zhe Wang
Z. Filipi
94
28
0
27 Oct 2020
Batch Reinforcement Learning with a Nonparametric Off-Policy Policy
  Gradient
Batch Reinforcement Learning with a Nonparametric Off-Policy Policy GradientIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Samuele Tosatto
João Carvalho
Jan Peters
OffRL
234
8
0
27 Oct 2020
Behavior Priors for Efficient Reinforcement Learning
Behavior Priors for Efficient Reinforcement LearningJournal of machine learning research (JMLR), 2020
Dhruva Tirumala
Alexandre Galashov
Hyeonwoo Noh
Leonard Hasenclever
Razvan Pascanu
...
Guillaume Desjardins
Wojciech M. Czarnecki
Arun Ahuja
Yee Whye Teh
N. Heess
235
46
0
27 Oct 2020
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time
  Systems with Lipschitz Continuous Controls
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous ControlsJournal of machine learning research (JMLR), 2020
Jeongho Kim
Jaeuk Shin
Insoon Yang
162
41
0
27 Oct 2020
Contextual Latent-Movements Off-Policy Optimization for Robotic
  Manipulation Skills
Contextual Latent-Movements Off-Policy Optimization for Robotic Manipulation SkillsIEEE International Conference on Robotics and Automation (ICRA), 2020
Samuele Tosatto
Georgia Chalvatzaki
Jan Peters
212
13
0
26 Oct 2020
Behavioral decision-making for urban autonomous driving in the presence
  of pedestrians using Deep Recurrent Q-Network
Behavioral decision-making for urban autonomous driving in the presence of pedestrians using Deep Recurrent Q-NetworkInternational Conference on Control, Automation, Robotics and Vision (ICARCV), 2020
Niranjan Deshpande
Dominique Vaufreydaz
A. Spalanzani
120
18
0
26 Oct 2020
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in
  Reinforcement Learning
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2020
Younggyo Seo
Kimin Lee
I. Clavera
Thanard Kurutach
Jinwoo Shin
Pieter Abbeel
234
45
0
26 Oct 2020
How to Make Deep RL Work in Practice
How to Make Deep RL Work in Practice
Nirnai Rao
Elie Aljalbout
Axel Sauer
Sami Haddadin
OffRL
188
11
0
25 Oct 2020
Dynamic Adversarial Patch for Evading Object Detection Models
Dynamic Adversarial Patch for Evading Object Detection Models
Shahar Hoory
T. Shapira
A. Shabtai
Yuval Elovici
AAML
171
51
0
25 Oct 2020
Improving the Exploration of Deep Reinforcement Learning in Continuous
  Domains using Planning for Policy Search
Improving the Exploration of Deep Reinforcement Learning in Continuous Domains using Planning for Policy Search
Jakob J. Hollenstein
Erwan Renaudo
Matteo Saveriano
J. Piater
85
2
0
24 Oct 2020
Planning with Exploration: Addressing Dynamics Bottleneck in Model-based
  Reinforcement Learning
Planning with Exploration: Addressing Dynamics Bottleneck in Model-based Reinforcement Learning
Xiyao Wang
Junge Zhang
Wenzhen Huang
Qiyue Yin
150
1
0
24 Oct 2020
Stabilizing Transformer-Based Action Sequence Generation For Q-Learning
Stabilizing Transformer-Based Action Sequence Generation For Q-Learning
Gideon Stein
Andrey Filchenkov
Arip Asadulaev
OffRL
200
2
0
23 Oct 2020
Bridging Imagination and Reality for Model-Based Deep Reinforcement
  Learning
Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning
Guangxiang Zhu
Minghao Zhang
Honglak Lee
Chongjie Zhang
OffRL
304
21
0
23 Oct 2020
Motion Planner Augmented Reinforcement Learning for Robot Manipulation
  in Obstructed Environments
Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments
Jun Yamada
Youngwoon Lee
G. Salhotra
Karl Pertsch
Max Pflueger
Gaurav Sukhatme
Joseph J. Lim
Péter Englert
158
48
0
22 Oct 2020
Deep Q-Network-based Adaptive Alert Threshold Selection Policy for
  Payment Fraud Systems in Retail Banking
Deep Q-Network-based Adaptive Alert Threshold Selection Policy for Payment Fraud Systems in Retail Banking
Hongda Shen
Eren Kurshan
209
25
0
21 Oct 2020
Improving Generalization in Reinforcement Learning with Mixture
  Regularization
Improving Generalization in Reinforcement Learning with Mixture RegularizationNeural Information Processing Systems (NeurIPS), 2020
Kaixin Wang
Bingyi Kang
Jie Shao
Jiashi Feng
364
129
0
21 Oct 2020
Iterative Amortized Policy Optimization
Iterative Amortized Policy OptimizationNeural Information Processing Systems (NeurIPS), 2020
Joseph Marino
Alexandre Piché
Alessandro Davide Ialongo
Yisong Yue
OffRL
256
23
0
20 Oct 2020
Robust Constrained Reinforcement Learning for Continuous Control with
  Model Misspecification
Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification
D. Mankowitz
D. A. Calian
Rae Jeong
Cosmin Paduraru
N. Heess
Sumanth Dathathri
Martin Riedmiller
Timothy A. Mann
316
20
0
20 Oct 2020
Quality of service based radar resource management using deep
  reinforcement learning
Quality of service based radar resource management using deep reinforcement learningInternational Radar Conference (RADAR), 2020
S. Durst
S. Brüggenwirth
70
15
0
20 Oct 2020
Survivable Hyper-Redundant Robotic Arm with Bayesian Policy Morphing
Survivable Hyper-Redundant Robotic Arm with Bayesian Policy Morphing
Sayyed Jaffar Ali Raza
Apan Dastider
Mingjie Lin
63
1
0
20 Oct 2020
Proximal Policy Gradient: PPO with Policy Gradient
Proximal Policy Gradient: PPO with Policy Gradient
Ju-Seung Byun
Byungmoon Kim
Huamin Wang
OffRL
114
10
0
20 Oct 2020
Dream and Search to Control: Latent Space Planning for Continuous
  Control
Dream and Search to Control: Latent Space Planning for Continuous Control
Anurag Koul
Varun V. Kumar
Alan Fern
Somdeb Majumdar
165
6
0
19 Oct 2020
Deep Reinforcement Learning with Population-Coded Spiking Neural Network
  for Continuous Control
Deep Reinforcement Learning with Population-Coded Spiking Neural Network for Continuous ControlConference on Robot Learning (CoRL), 2020
Guangzhi Tang
Neelesh Kumar
Raymond Yoo
Konstantinos Michmizos
205
101
0
19 Oct 2020
What About Inputing Policy in Value Function: Policy Representation and
  Policy-extended Value Function Approximator
What About Inputing Policy in Value Function: Policy Representation and Policy-extended Value Function Approximator
Hongyao Tang
Zhaopeng Meng
Jianye Hao
Chong Chen
D. Graves
...
Hangyu Mao
Wulong Liu
Yaodong Yang
Wenyuan Tao
Li Wang
OffRL
260
7
0
19 Oct 2020
Chance-Constrained Control with Lexicographic Deep Reinforcement
  Learning
Chance-Constrained Control with Lexicographic Deep Reinforcement LearningIEEE Control Systems Letters (L-CSS), 2020
Alessandro Giuseppi
A. Pietrabissa
89
9
0
19 Oct 2020
Softmax Deep Double Deterministic Policy Gradients
Softmax Deep Double Deterministic Policy GradientsNeural Information Processing Systems (NeurIPS), 2020
Ling Pan
Qingpeng Cai
Longbo Huang
233
116
0
19 Oct 2020
Belief-Grounded Networks for Accelerated Robot Learning under Partial
  Observability
Belief-Grounded Networks for Accelerated Robot Learning under Partial ObservabilityConference on Robot Learning (CoRL), 2020
Hai V. Nguyen
Brett Daley
Xinchao Song
Chris Amato
Robert Platt
305
15
0
19 Oct 2020
DOOM: A Novel Adversarial-DRL-Based Op-Code Level Metamorphic Malware
  Obfuscator for the Enhancement of IDS
DOOM: A Novel Adversarial-DRL-Based Op-Code Level Metamorphic Malware Obfuscator for the Enhancement of IDS
Mohit Sewak
S. K. Sahay
Hemant Rathore
114
19
0
16 Oct 2020
On the Guaranteed Almost Equivalence between Imitation Learning from
  Observation and Demonstration
On the Guaranteed Almost Equivalence between Imitation Learning from Observation and DemonstrationIEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2020
Zhihao Cheng
Liu Liu
Aishan Liu
Hao Sun
Meng Fang
Dacheng Tao
136
11
0
16 Oct 2020
Decentralized Multi-Agent Pursuit using Deep Reinforcement Learning
Decentralized Multi-Agent Pursuit using Deep Reinforcement LearningIEEE Robotics and Automation Letters (RA-L), 2020
C. de Souza
Rhys Newbury
Akansel Cosgun
P. Castillo
B. Vidolov
Dana Kulić
246
120
0
16 Oct 2020
A Learning Approach to Robot-Agnostic Force-Guided High Precision
  Assembly
A Learning Approach to Robot-Agnostic Force-Guided High Precision Assembly
Jieliang Luo
Hui Li
249
20
0
15 Oct 2020
Deep Learning of Koopman Representation for Control
Deep Learning of Koopman Representation for Control
Yiqiang Han
Wenjian Hao
Umesh Vaidya
AI4CE
101
131
0
15 Oct 2020
Previous
123...626364...949596
Next