ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.02971
  4. Cited By
Continuous control with deep reinforcement learning
v1v2v3v4v5v6 (latest)

Continuous control with deep reinforcement learning

9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
ArXiv (abs)PDFHTML

Papers citing "Continuous control with deep reinforcement learning"

50 / 4,795 papers shown
Analyzing the Hidden Activations of Deep Policy Networks: Why
  Representation Matters
Analyzing the Hidden Activations of Deep Policy Networks: Why Representation Matters
Trevor A. McInroe
Michael Spurrier
J. Sieber
Stephen Conneely
97
0
0
11 Mar 2021
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Maximum Entropy RL (Provably) Solves Some Robust RL ProblemsInternational Conference on Learning Representations (ICLR), 2021
Benjamin Eysenbach
Sergey Levine
OOD
270
220
0
10 Mar 2021
Decentralized Circle Formation Control for Fish-like Robots in the
  Real-world via Reinforcement Learning
Decentralized Circle Formation Control for Fish-like Robots in the Real-world via Reinforcement LearningIEEE International Conference on Robotics and Automation (ICRA), 2021
Tianhao Zhang
Yueheng Li
Shuai Li
Qiwei Ye
Chen Wang
Guangming Xie
OffRL
123
20
0
09 Mar 2021
Learning to Play Soccer From Scratch: Sample-Efficient Emergent
  Coordination through Curriculum-Learning and Competition
Learning to Play Soccer From Scratch: Sample-Efficient Emergent Coordination through Curriculum-Learning and CompetitionIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021
Pavan Samtani
Francisco Leiva
Javier Ruiz-del-Solar
88
2
0
09 Mar 2021
Model-free Policy Learning with Reward Gradients
Model-free Policy Learning with Reward GradientsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Qingfeng Lan
Samuele Tosatto
Homayoon Farrahi
Rupam Mahmood
282
7
0
09 Mar 2021
Domain-Robust Visual Imitation Learning with Mutual Information
  Constraints
Domain-Robust Visual Imitation Learning with Mutual Information ConstraintsInternational Conference on Learning Representations (ICLR), 2021
Edoardo Cetin
Oya Celiktutan
OODDRL
196
22
0
08 Mar 2021
Instabilities of Offline RL with Pre-Trained Neural Representation
Instabilities of Offline RL with Pre-Trained Neural RepresentationInternational Conference on Machine Learning (ICML), 2021
Ruosong Wang
Yifan Wu
Ruslan Salakhutdinov
Sham Kakade
OffRL
270
43
0
08 Mar 2021
A Crash Course on Reinforcement Learning
A Crash Course on Reinforcement Learning
F. Yaghmaie
L. Ljung
143
3
0
08 Mar 2021
Learning a State Representation and Navigation in Cluttered and Dynamic
  Environments
Learning a State Representation and Navigation in Cluttered and Dynamic EnvironmentsIEEE Robotics and Automation Letters (RA-L), 2021
David Hoeller
Lorenz Wellhausen
Farbod Farshidian
Marco Hutter
SSL
220
89
0
07 Mar 2021
Visual Explanation using Attention Mechanism in Actor-Critic-based Deep
  Reinforcement Learning
Visual Explanation using Attention Mechanism in Actor-Critic-based Deep Reinforcement LearningIEEE International Joint Conference on Neural Network (IJCNN), 2021
Hidenori Itaya
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
K. Sugiura
165
23
0
06 Mar 2021
Can You Fix My Neural Network? Real-Time Adaptive Waveform Synthesis for
  Resilient Wireless Signal Classification
Can You Fix My Neural Network? Real-Time Adaptive Waveform Synthesis for Resilient Wireless Signal Classification
Salvatore D’oro
Francesco Restuccia
Tommaso Melodia
130
12
0
05 Mar 2021
Deep reinforcement learning in medical imaging: A literature review
Deep reinforcement learning in medical imaging: A literature review
S. Kevin Zhou
Hoang Ngan Le
Khoa Luu
Hien V Nguyen
N. Ayache
LM&MAOffRLMedIm
157
168
0
05 Mar 2021
Neuromechanics-based Deep Reinforcement Learning of Neurostimulation
  Control in FES cycling
Neuromechanics-based Deep Reinforcement Learning of Neurostimulation Control in FES cyclingInternational IEEE/EMBS Conference on Neural Engineering (NER), 2021
Nat Wannawas
Mahendran Subramanian
A. Faisal
171
19
0
04 Mar 2021
Improving Computational Efficiency in Visual Reinforcement Learning via
  Stored Embeddings
Improving Computational Efficiency in Visual Reinforcement Learning via Stored EmbeddingsNeural Information Processing Systems (NeurIPS), 2021
Lili Chen
Kimin Lee
A. Srinivas
Pieter Abbeel
OffRL
212
13
0
04 Mar 2021
An RL-Based Adaptive Detection Strategy to Secure Cyber-Physical Systems
An RL-Based Adaptive Detection Strategy to Secure Cyber-Physical Systems
Ipsita Koley
Sunandan Adhikary
Soumyajit Dey
212
1
0
04 Mar 2021
Reinforcement Learning for Orientation Estimation Using Inertial Sensors
  with Performance Guarantee
Reinforcement Learning for Orientation Estimation Using Inertial Sensors with Performance GuaranteeIEEE International Conference on Robotics and Automation (ICRA), 2021
Liang Hu
Yujie Tang
Zhipeng Zhou
Wei Pan
211
10
0
03 Mar 2021
Addressing Action Oscillations through Learning Policy Inertia
Addressing Action Oscillations through Learning Policy InertiaAAAI Conference on Artificial Intelligence (AAAI), 2021
Chong Chen
Hongyao Tang
Jianye Hao
Wulong Liu
Zhaopeng Meng
124
22
0
03 Mar 2021
Foresee then Evaluate: Decomposing Value Estimation with Latent Future
  Prediction
Foresee then Evaluate: Decomposing Value Estimation with Latent Future PredictionAAAI Conference on Artificial Intelligence (AAAI), 2021
Hongyao Tang
Jianye Hao
Guangyong Chen
Pengfei Chen
Chong Chen
Yaodong Yang
Jun Liu
Wulong Liu
Zhaopeng Meng
OffRL
195
5
0
03 Mar 2021
Design of an Affordable Prosthetic Arm Equipped with Deep Learning
  Vision-Based Manipulation
Design of an Affordable Prosthetic Arm Equipped with Deep Learning Vision-Based Manipulation
A. Imran
William Escobar
F. Barez
141
9
0
03 Mar 2021
Offline Reinforcement Learning with Pseudometric Learning
Offline Reinforcement Learning with Pseudometric LearningInternational Conference on Machine Learning (ICML), 2021
Robert Dadashi
Shideh Rezaeifar
Nino Vieillard
Léonard Hussenot
Olivier Pietquin
Matthieu Geist
OffRL
196
43
0
02 Mar 2021
Mind Mappings: Enabling Efficient Algorithm-Accelerator Mapping Space
  Search
Mind Mappings: Enabling Efficient Algorithm-Accelerator Mapping Space SearchInternational Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2021
Kartik Hegde
Po-An Tsai
Sitao Huang
Vikas Chandra
A. Parashar
Christopher W. Fletcher
197
108
0
02 Mar 2021
Safe Learning of Uncertain Environments
Safe Learning of Uncertain Environments
F. Farokhi
Alex S. C. Leong
Iman Shames
Mohammad Zamani
149
0
0
02 Mar 2021
Sample Complexity and Overparameterization Bounds for Temporal
  Difference Learning with Neural Network Approximation
Sample Complexity and Overparameterization Bounds for Temporal Difference Learning with Neural Network ApproximationIEEE Transactions on Automatic Control (IEEE TAC), 2021
Semih Cayci
Siddhartha Satpathi
Niao He
F. I. R. Srikant
187
11
0
02 Mar 2021
Hierarchical and Partially Observable Goal-driven Policy Learning with
  Goals Relational Graph
Hierarchical and Partially Observable Goal-driven Policy Learning with Goals Relational GraphComputer Vision and Pattern Recognition (CVPR), 2021
Xin Ye
Yezhou Yang
275
26
0
01 Mar 2021
Decision Making in Monopoly using a Hybrid Deep Reinforcement Learning
  Approach
Decision Making in Monopoly using a Hybrid Deep Reinforcement Learning ApproachIEEE Transactions on Emerging Topics in Computational Intelligence (IEEE TETCI), 2021
Trevor Bonjour
Marina Haliem
A. Alsalem
Shilpa Thomas
Hongyu Li
Vaneet Aggarwal
Mayank Kejriwal
Bharat K. Bhargava
407
18
0
01 Mar 2021
Sim-to-Real Transfer for Robotic Manipulation with Tactile Sensory
Sim-to-Real Transfer for Robotic Manipulation with Tactile SensoryIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021
Zihan Ding
Ya-Yen Tsai
Wang Wei Lee
Bidan Huang
197
39
0
28 Feb 2021
Revisiting Peng's Q($λ$) for Modern Reinforcement Learning
Revisiting Peng's Q(λλλ) for Modern Reinforcement LearningInternational Conference on Machine Learning (ICML), 2021
Tadashi Kozuno
Yunhao Tang
Mark Rowland
Rémi Munos
Steven Kapturowski
Will Dabney
Michal Valko
David Abel
OffRL
146
20
0
27 Feb 2021
Reducing Conservativeness Oriented Offline Reinforcement Learning
Reducing Conservativeness Oriented Offline Reinforcement Learning
Hongchang Zhang
Jianzhun Shao
Yuhang Jiang
Shuncheng He
Xiangyang Ji
OffRL
217
6
0
27 Feb 2021
Multi-Agent Path Planning based on MPC and DDPG
Multi-Agent Path Planning based on MPC and DDPG
Junxiao Xue
Xiangya Kong
Bowei Dong
Mingliang Xu
148
10
0
26 Feb 2021
Off-Policy Imitation Learning from Observations
Off-Policy Imitation Learning from ObservationsNeural Information Processing Systems (NeurIPS), 2021
Zhuangdi Zhu
Kaixiang Lin
Bo Dai
Jiayu Zhou
OffRL
210
94
0
25 Feb 2021
Bias-reduced Multi-step Hindsight Experience Replay for Efficient
  Multi-goal Reinforcement Learning
Bias-reduced Multi-step Hindsight Experience Replay for Efficient Multi-goal Reinforcement Learning
Rui Yang
Jiafei Lyu
Yu Yang
Jiangpeng Yan
Feng Luo
Dijun Luo
Lanqing Li
Xiu Li
178
7
0
25 Feb 2021
Improved Regret Bound and Experience Replay in Regularized Policy
  Iteration
Improved Regret Bound and Experience Replay in Regularized Policy IterationInternational Conference on Machine Learning (ICML), 2021
N. Lazić
Dong Yin
Yasin Abbasi-Yadkori
Csaba Szepesvári
OffRL
124
19
0
25 Feb 2021
Online Policy Gradient for Model Free Learning of Linear Quadratic
  Regulators with $\sqrt{T}$ Regret
Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with T\sqrt{T}T​ RegretInternational Conference on Machine Learning (ICML), 2021
Asaf B. Cassel
Tomer Koren
OffRL
195
19
0
25 Feb 2021
Deep Reinforcement Learning for Safe Landing Site Selection with
  Concurrent Consideration of Divert Maneuvers
Deep Reinforcement Learning for Safe Landing Site Selection with Concurrent Consideration of Divert Maneuvers
Keidai Iiyama
Kento Tomita
Bhavi Jagatia
Tatsuwaki Nakagawa
K. Ho
73
14
0
24 Feb 2021
Hybrid Car-Following Strategy based on Deep Deterministic Policy
  Gradient and Cooperative Adaptive Cruise Control
Hybrid Car-Following Strategy based on Deep Deterministic Policy Gradient and Cooperative Adaptive Cruise ControlIEEE Transactions on Automation Science and Engineering (T-ASE), 2021
Ruidong Yan
Rui Jiang
Bin Jia
Jin Huang
Diange Yang
197
53
0
24 Feb 2021
Memory-based Deep Reinforcement Learning for POMDPs
Memory-based Deep Reinforcement Learning for POMDPsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021
Lingheng Meng
R. Gorbet
Dana Kulic
367
122
0
24 Feb 2021
Combining Off and On-Policy Training in Model-Based Reinforcement
  Learning
Combining Off and On-Policy Training in Model-Based Reinforcement Learning
Alexandre Borges
Arlindo L. Oliveira
172
5
0
24 Feb 2021
FIXAR: A Fixed-Point Deep Reinforcement Learning Platform with
  Quantization-Aware Training and Adaptive Parallelism
FIXAR: A Fixed-Point Deep Reinforcement Learning Platform with Quantization-Aware Training and Adaptive ParallelismDesign Automation Conference (DAC), 2021
Jenny Yang
Seongmin Hong
Joo-Young Kim
112
22
0
24 Feb 2021
Modular Deep Reinforcement Learning for Continuous Motion Planning with
  Temporal Logic
Modular Deep Reinforcement Learning for Continuous Motion Planning with Temporal LogicIEEE Robotics and Automation Letters (RA-L), 2021
Mingyu Cai
Mohammadhosein Hasanbeig
Shaoping Xiao
Alessandro Abate
Z. Kan
736
95
0
24 Feb 2021
Honey, I Shrunk The Actor: A Case Study on Preserving Performance with
  Smaller Actors in Actor-Critic RL
Honey, I Shrunk The Actor: A Case Study on Preserving Performance with Smaller Actors in Actor-Critic RL
Siddharth Mysore
B. Mabsout
R. Mancuso
Kate Saenko
OffRL
255
15
0
23 Feb 2021
Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Doubly Robust Off-Policy Actor-Critic: Convergence and OptimalityInternational Conference on Machine Learning (ICML), 2021
Tengyu Xu
Zhuoran Yang
Zhaoran Wang
Yingbin Liang
OffRL
271
29
0
23 Feb 2021
Differentiable Logic Machines
Differentiable Logic Machines
Matthieu Zimmer
Xuening Feng
Claire Glanois
Zhaohui Jiang
Jianyi Zhang
Paul Weng
Li Dong
Hao Jianye
Liu Wulong
AI4CE
304
28
0
23 Feb 2021
Mixed Policy Gradient: off-policy reinforcement learning driven jointly
  by data and model
Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model
Yang Guan
Jingliang Duan
Shengbo Eben Li
Jie Li
Jianyu Chen
B. Cheng
OffRL
160
12
0
23 Feb 2021
DeepThermal: Combustion Optimization for Thermal Power Generating Units
  Using Offline Reinforcement Learning
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2021
Xianyuan Zhan
Haoran Xu
Yueying Zhang
Xiangyu Zhu
Honglei Yin
Yu Zheng
OffRLAI4CE
343
88
0
23 Feb 2021
Exploring Supervised and Unsupervised Rewards in Machine Translation
Exploring Supervised and Unsupervised Rewards in Machine TranslationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Julia Ive
Zixu Wang
M. Fomicheva
Lucia Specia
123
2
0
22 Feb 2021
Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy
  Reinforcement Learning
Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2021
Brett Daley
Cameron Hickert
Chris Amato
OffRL
104
6
0
22 Feb 2021
Reinforcement Learning with Prototypical Representations
Reinforcement Learning with Prototypical RepresentationsInternational Conference on Machine Learning (ICML), 2021
Denis Yarats
Rob Fergus
A. Lazaric
Lerrel Pinto
SSL
322
246
0
22 Feb 2021
Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement
  Learning via Frank-Wolfe Policy Optimization
Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Policy OptimizationConference on Uncertainty in Artificial Intelligence (UAI), 2021
Jyun-Li Lin
Wei-Ting Hung
Shangtong Yang
Ping-Chun Hsieh
Xi Liu
252
18
0
22 Feb 2021
Improved Learning of Robot Manipulation Tasks via Tactile Intrinsic
  Motivation
Improved Learning of Robot Manipulation Tasks via Tactile Intrinsic MotivationIEEE Robotics and Automation Letters (RA-L), 2021
Nikola Vulin
Sammy Christen
Stefan Stevšić
Otmar Hilliges
123
29
0
22 Feb 2021
Dealing with Non-Stationarity in MARL via Trust-Region Decomposition
Dealing with Non-Stationarity in MARL via Trust-Region DecompositionInternational Conference on Learning Representations (ICLR), 2021
Wenhao Li
Xiangfeng Wang
Bo Jin
Junjie Sheng
H. Zha
364
14
0
21 Feb 2021
Previous
123...575859...949596
Next