ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.02971
  4. Cited By
Continuous control with deep reinforcement learning
v1v2v3v4v5v6 (latest)

Continuous control with deep reinforcement learning

9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
ArXiv (abs)PDFHTML

Papers citing "Continuous control with deep reinforcement learning"

50 / 4,796 papers shown
Hierarchical Reinforcement Learning for Air-to-Air Combat
Hierarchical Reinforcement Learning for Air-to-Air CombatInternational Conference on Unmanned Aircraft Systems (ICUAS), 2021
Adrian P. Pope
J. Ide
Daria Mićović
Henry Diaz
D. Rosenbluth
Lee Ritholtz
Jason C. Twedt
Thayne T. Walker
K. Alcedo
D. Javorsek
115
90
0
03 May 2021
Generative Adversarial Reward Learning for Generalized Behavior Tendency
  Inference
Generative Adversarial Reward Learning for Generalized Behavior Tendency InferenceIEEE Transactions on Knowledge and Data Engineering (TKDE), 2021
Xiaocong Chen
Lina Yao
Xianzhi Wang
Aixin Sun
Wenjie Zhang
Quan Z. Sheng
165
12
0
03 May 2021
Action Candidate Based Clipped Double Q-learning for Discrete and
  Continuous Action Tasks
Action Candidate Based Clipped Double Q-learning for Discrete and Continuous Action TasksAAAI Conference on Artificial Intelligence (AAAI), 2021
Qianliang Wu
Jin Xie
Zhiqiang Wang
OffRL
127
18
0
03 May 2021
Learning to drive from a world on rails
Learning to drive from a world on railsIEEE International Conference on Computer Vision (ICCV), 2021
Di Chen
V. Koltun
Philipp Krahenbuhl
389
151
0
03 May 2021
Curious Exploration and Return-based Memory Restoration for Deep
  Reinforcement Learning
Curious Exploration and Return-based Memory Restoration for Deep Reinforcement Learning
Saeed Tafazzol
Erfan Fathi
Mahdi Rezaei
Ehsan Asali
94
2
0
02 May 2021
Reducing Bus Bunching with Asynchronous Multi-Agent Reinforcement
  Learning
Reducing Bus Bunching with Asynchronous Multi-Agent Reinforcement LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2021
Changyin Sun
Lijun Sun
127
23
0
02 May 2021
Self-supervised Augmentation Consistency for Adapting Semantic
  Segmentation
Self-supervised Augmentation Consistency for Adapting Semantic SegmentationComputer Vision and Pattern Recognition (CVPR), 2021
Nikita Araslanov
Stefan Roth
313
267
0
30 Apr 2021
A Physics-Constrained Deep Learning Model for Simulating Multiphase Flow
  in 3D Heterogeneous Porous Media
A Physics-Constrained Deep Learning Model for Simulating Multiphase Flow in 3D Heterogeneous Porous Media
B. Yan
D. Harp
Bailian Chen
R. Pawar
AI4CE
167
88
0
30 Apr 2021
On the Emergence of Whole-body Strategies from Humanoid Robot
  Push-recovery Learning
On the Emergence of Whole-body Strategies from Humanoid Robot Push-recovery LearningIEEE Robotics and Automation Letters (RA-L), 2021
Diego Ferigo
Raffaello Camoriano
Paolo Maria Viceconte
Daniele Calandriello
Silvio Traversaro
Lorenzo Rosasco
Daniele Pucci
151
21
0
29 Apr 2021
Capability Iteration Network for Robot Path Planning
Capability Iteration Network for Robot Path Planning
Buqing Nie
Yue Gao
Yi Mei
Feng Gao
3DV
127
7
0
29 Apr 2021
End-to-End Intersection Handling using Multi-Agent Deep Reinforcement
  Learning
End-to-End Intersection Handling using Multi-Agent Deep Reinforcement Learning
Alessandro Paolo Capasso
Paolo Maramotti
Anthony DellÉva
A. Broggi
329
20
0
28 Apr 2021
Semi-On-Policy Training for Sample Efficient Multi-Agent Policy
  Gradients
Semi-On-Policy Training for Sample Efficient Multi-Agent Policy Gradients
Bozhidar Vasilev
Tarun Gupta
Bei Peng
Shimon Whiteson
152
2
0
27 Apr 2021
SocialAI 0.1: Towards a Benchmark to Stimulate Research on
  Socio-Cognitive Abilities in Deep Reinforcement Learning Agents
SocialAI 0.1: Towards a Benchmark to Stimulate Research on Socio-Cognitive Abilities in Deep Reinforcement Learning Agents
Grgur Kovač
Rémy Portelas
Katja Hofmann
Pierre-Yves Oudeyer
ALMLM&Ro
223
2
0
27 Apr 2021
Computational Performance of Deep Reinforcement Learning to find Nash
  Equilibria
Computational Performance of Deep Reinforcement Learning to find Nash EquilibriaComputational Economics (Comput. Econ.), 2021
C. Graf
Viktor Zobernig
Johannes Schmidt
Claude Klöckl
126
10
0
26 Apr 2021
ANT: Learning Accurate Network Throughput for Better Adaptive Video
  Streaming
ANT: Learning Accurate Network Throughput for Better Adaptive Video Streaming
Jiaoyang Yin
Yiling Xu
Hao Chen
Yunfei Zhang
S. Appleby
Zhan Ma
81
15
0
26 Apr 2021
Efficient Hyperparameter Optimization for Physics-based Character
  Animation
Efficient Hyperparameter Optimization for Physics-based Character AnimationProceedings of the ACM on Computer Graphics and Interactive Techniques (PACMCGIT), 2021
Zeshi Yang
Zhiqi Yin
AI4CE
184
9
0
26 Apr 2021
Formula RL: Deep Reinforcement Learning for Autonomous Racing using
  Telemetry Data
Formula RL: Deep Reinforcement Learning for Autonomous Racing using Telemetry Data
Adrian Remonda
Sarah Krebs
Eduardo E. Veas
Granit Luzhnica
Roman Kern
OffRL
203
28
0
22 Apr 2021
Tackling Variabilities in Autonomous Driving
Tackling Variabilities in Autonomous Driving
Yuqiong Qi
Yang Hu
Haibin Wu
Shen Li
Haiyu Mao
Xiaochun Ye
Xiaochun Ye
Ninghui Sun
85
1
0
21 Apr 2021
Discrete-continuous Action Space Policy Gradient-based Attention for
  Image-Text Matching
Discrete-continuous Action Space Policy Gradient-based Attention for Image-Text MatchingComputer Vision and Pattern Recognition (CVPR), 2021
Shiyang Yan
Li Yu
Yuan Xie
260
34
0
21 Apr 2021
Scalable Synthesis of Verified Controllers in Deep Reinforcement
  Learning
Scalable Synthesis of Verified Controllers in Deep Reinforcement Learning
Zikang Xiong
Suresh Jagannathan
194
6
0
20 Apr 2021
Outcome-Driven Reinforcement Learning via Variational Inference
Outcome-Driven Reinforcement Learning via Variational InferenceNeural Information Processing Systems (NeurIPS), 2021
Tim G. J. Rudner
Vitchyr H. Pong
R. McAllister
Y. Gal
Sergey Levine
291
22
0
20 Apr 2021
Adaptive learning for financial markets mixing model-based and
  model-free RL for volatility targeting
Adaptive learning for financial markets mixing model-based and model-free RL for volatility targetingSocial Science Research Network (SSRN), 2021
Eric Benhamou
David Saltiel
S. Tabachnik
Sui Kai Wong
François Chareyron
OOD
219
4
0
19 Apr 2021
Deep Reinforcement Learning in a Monetary Model
Deep Reinforcement Learning in a Monetary Model
Mingli Chen
Andreas Joseph
Michael Kumhof
Xinlei Pan
Xuan Zhou
39
31
0
19 Apr 2021
Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement
  Learning
Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning
Jie Ren
Yewen Li
Zihan Ding
Wei Pan
Hao Dong
BDLMoE
121
33
0
19 Apr 2021
Planning with Expectation Models for Control
Planning with Expectation Models for Control
Katya Kudashkina
Yi Wan
Abhishek Naik
R. Sutton
OffRL
143
0
0
17 Apr 2021
In Defense of the Paper
In Defense of the Paper
Owen Lockwood
74
0
0
16 Apr 2021
Towards Standardising Reinforcement Learning Approaches for Production
  Scheduling Problems
Towards Standardising Reinforcement Learning Approaches for Production Scheduling ProblemsProcedia CIRP (PC), 2021
Alexandru Rinciog
Anne Meyer
OffRL
111
13
0
16 Apr 2021
Reinforced Neighborhood Selection Guided Multi-Relational Graph Neural
  Networks
Reinforced Neighborhood Selection Guided Multi-Relational Graph Neural Networks
Hao Peng
Ruitong Zhang
Yingtong Dou
Renyu Yang
Jingyi Zhang
Philip S. Yu
491
137
0
16 Apr 2021
Curiosity-Driven Exploration via Latent Bayesian Surprise
Curiosity-Driven Exploration via Latent Bayesian SurpriseAAAI Conference on Artificial Intelligence (AAAI), 2021
Pietro Mazzaglia
Ozan Çatal
Tim Verbelen
Bart Dhoedt
235
42
0
15 Apr 2021
Discover the Hidden Attack Path in Multi-domain Cyberspace Based on
  Reinforcement Learning
Discover the Hidden Attack Path in Multi-domain Cyberspace Based on Reinforcement Learning
Lei Zhang
Wei Bai
Wei Li
Shiming Xia
Qibin Zheng
47
2
0
15 Apr 2021
TAAC: Temporally Abstract Actor-Critic for Continuous Control
TAAC: Temporally Abstract Actor-Critic for Continuous ControlNeural Information Processing Systems (NeurIPS), 2021
Haonan Yu
Wei Xu
Haichao Zhang
OffRL
225
26
0
13 Apr 2021
Data-Driven Reinforcement Learning for Virtual Character Animation
  Control
Data-Driven Reinforcement Learning for Virtual Character Animation Control
V. Gamage
Cathy Ennis
Robert J Ross
AI4CE
71
0
0
13 Apr 2021
Learning and Planning in Complex Action Spaces
Learning and Planning in Complex Action SpacesInternational Conference on Machine Learning (ICML), 2021
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
M. Barekatain
Simon Schmitt
David Silver
224
89
0
13 Apr 2021
Podracer architectures for scalable Reinforcement Learning
Podracer architectures for scalable Reinforcement Learning
Matteo Hessel
M. Kroiss
Aidan Clark
Iurii Kemaev
John Quan
Thomas Keck
Fabio Viola
H. V. Hasselt
175
46
0
13 Apr 2021
Subgoal-based Reward Shaping to Improve Efficiency in Reinforcement
  Learning
Subgoal-based Reward Shaping to Improve Efficiency in Reinforcement LearningIEEE Access (IEEE Access), 2021
Takato Okudo
Seiji Yamada
OffRL
127
23
0
13 Apr 2021
Reward Shaping with Dynamic Trajectory Aggregation
Reward Shaping with Dynamic Trajectory AggregationIEEE International Joint Conference on Neural Network (IJCNN), 2021
Takato Okudo
Seiji Yamada
82
2
0
13 Apr 2021
Deep Deterministic Path Following
Deep Deterministic Path Following
Georg Hess
William Ljungbergh
BDL
66
0
0
13 Apr 2021
Two-stage training algorithm for AI robot soccer
Two-stage training algorithm for AI robot soccerPeerJ Computer Science (PeerJ Comput. Sci.), 2021
Taeyoung Kim
L. Vecchietti
Kyujin Choi
Sanem Sariel
Dongsoo Har
182
8
0
13 Apr 2021
Thief, Beware of What Get You There: Towards Understanding Model
  Extraction Attack
Thief, Beware of What Get You There: Towards Understanding Model Extraction Attack
Xinyi Zhang
Chengfang Fang
Jie Shi
MIACVMLAUSILM
185
20
0
13 Apr 2021
Survey on reinforcement learning for language processing
Survey on reinforcement learning for language processingArtificial Intelligence Review (AIR), 2021
Víctor Uc Cetina
Nicolás Navarro-Guerrero
A. Martín-González
C. Weber
S. Wermter
OffRL
327
127
0
12 Apr 2021
Behavior-Guided Actor-Critic: Improving Exploration via Learning Policy
  Behavior Representation for Deep Reinforcement Learning
Behavior-Guided Actor-Critic: Improving Exploration via Learning Policy Behavior Representation for Deep Reinforcement Learning
Ammar Fayad
M. Ibrahim
BDL
134
3
0
09 Apr 2021
Learning Sampling Policy for Faster Derivative Free Optimization
Learning Sampling Policy for Faster Derivative Free Optimization
Zhou Zhai
Bin Gu
Heng-Chiao Huang
150
1
0
09 Apr 2021
Reinforced Attention for Few-Shot Learning and Beyond
Reinforced Attention for Few-Shot Learning and BeyondComputer Vision and Pattern Recognition (CVPR), 2021
Jie Hong
Pengfei Fang
Weihao Li
Tong Zhang
Christian Simon
Mehrtash Harandi
L. Petersson
166
54
0
09 Apr 2021
ACERAC: Efficient reinforcement learning in fine time discretization
ACERAC: Efficient reinforcement learning in fine time discretizationIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
Jakub Łyskawa
Pawel Wawrzyñski
234
3
0
08 Apr 2021
Efficient time stepping for numerical integration using reinforcement
  learning
Efficient time stepping for numerical integration using reinforcement learningSIAM Journal on Scientific Computing (SISC), 2021
M. Dellnitz
Eyke Hüllermeier
M. Lucke
Sina Ober-Blobaum
Christian Offen
Sebastian Peitz
Karlson Pfannschmidt
72
7
0
08 Apr 2021
Progressive extension of reinforcement learning action dimension for
  asymmetric assembly tasks
Progressive extension of reinforcement learning action dimension for asymmetric assembly tasks
Yuhang Gai
Jiuming Guo
Dan Wu
Ken Chen
77
0
0
06 Apr 2021
Fast Design Space Exploration of Nonlinear Systems: Part II
Fast Design Space Exploration of Nonlinear Systems: Part IIIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (IEEE TCAD), 2021
Prerit Terway
Kenza Hamidouche
N. Jha
108
4
0
05 Apr 2021
NQMIX: Non-monotonic Value Function Factorization for Deep Multi-Agent
  Reinforcement Learning
NQMIX: Non-monotonic Value Function Factorization for Deep Multi-Agent Reinforcement Learning
Quanlin Chen
OffRL
192
0
0
05 Apr 2021
A Dual-Critic Reinforcement Learning Framework for Frame-level Bit
  Allocation in HEVC/H.265
A Dual-Critic Reinforcement Learning Framework for Frame-level Bit Allocation in HEVC/H.265Data Compression Conference (DCC), 2021
Yung-Han Ho
Guo-Lun Jin
Yun Liang
Wen-Hsiao Peng
Xiaobo Li
96
17
0
05 Apr 2021
A Dynamics Perspective of Pursuit-Evasion Games of Intelligent Agents
  with the Ability to Learn
A Dynamics Perspective of Pursuit-Evasion Games of Intelligent Agents with the Ability to LearnCybersecurity and Cyberforensics Conference (CC), 2021
Hao Xiong
Huanhui Cao
Lin Zhang
Wenjie Lu
57
5
0
03 Apr 2021
Previous
123...555657...949596
Next
Page 56 of 96
Pageof 96