ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.02971
  4. Cited By
Continuous control with deep reinforcement learning
v1v2v3v4v5v6 (latest)

Continuous control with deep reinforcement learning

9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
ArXiv (abs)PDFHTML

Papers citing "Continuous control with deep reinforcement learning"

50 / 4,795 papers shown
Optimizing Electric Vehicle Charging Station Placement Using Reinforcement Learning and Agent-Based Simulations
Optimizing Electric Vehicle Charging Station Placement Using Reinforcement Learning and Agent-Based Simulations
Minh-Duc Nguyen
Dung D. Le
Phi Long Nguyen
56
0
0
03 Nov 2025
Logic-informed reinforcement learning for cross-domain optimization of large-scale cyber-physical systems
Logic-informed reinforcement learning for cross-domain optimization of large-scale cyber-physical systems
Guangxi Wan
Peng Zeng
Xiaoting Dong
Chunhe Song
Shijie Cui
Dong Li
Qingwei Dong
Y. Liu
Hongfei Bai
AI4CE
96
0
0
02 Nov 2025
Clustering-Based Weight Orthogonalization for Stabilizing Deep Reinforcement Learning
Clustering-Based Weight Orthogonalization for Stabilizing Deep Reinforcement LearningIEEE International Joint Conference on Neural Network (IJCNN), 2025
Guoqing Ma
Y. Zhang
Yuming Dai
Guangfu Hao
Yang Chen
S. Yu
OffRL
130
0
0
02 Nov 2025
Deep reinforcement learning for optimal trading with partial information
Deep reinforcement learning for optimal trading with partial information
Andrea Macri
Sebastian Jaimungal
Fabrizio Lillo
73
1
0
31 Oct 2025
Reinforcement Learning for Long-Horizon Unordered Tasks: From Boolean to Coupled Reward Machines
Reinforcement Learning for Long-Horizon Unordered Tasks: From Boolean to Coupled Reward Machines
Kristina Levina
Nikolaos Pappas
Athanasios Karapantelakis
Aneta Vulgarakis Feljan
Jendrik Seipp
151
0
0
31 Oct 2025
Morphology-Aware Graph Reinforcement Learning for Tensegrity Robot Locomotion
Morphology-Aware Graph Reinforcement Learning for Tensegrity Robot Locomotion
Chi Zhang
Mingrui Li
W. Tong
X. Y. Huang
AI4CE
103
0
0
30 Oct 2025
Reinforcement Learning for Robotic Safe Control with Force Sensing
Reinforcement Learning for Robotic Safe Control with Force Sensing
Nan Lin
Linrui Zhang
Yuxuan Chen
Z. Chen
Yujun Zhu
Ruoxi Chen
Peichen Wu
Xiaoping Chen
60
9
0
30 Oct 2025
Real-DRL: Teach and Learn in Reality
Real-DRL: Teach and Learn in Reality
Y. Mao
Yihao Cai
L. Sha
OffRL
133
0
0
30 Oct 2025
Adaptive Inverse Kinematics Framework for Learning Variable-Length Tool Manipulation in Robotics
Adaptive Inverse Kinematics Framework for Learning Variable-Length Tool Manipulation in Robotics
Prathamesh Kothavale
Sravani Boddepalli
88
0
0
30 Oct 2025
Self-Improving Vision-Language-Action Models with Data Generation via Residual RL
Self-Improving Vision-Language-Action Models with Data Generation via Residual RL
Wenli Xiao
Haotian Lin
Andy Peng
Haoru Xue
Tairan He
...
Jimmy Wu
Zhengyi Luo
Linxi Fan
Guanya Shi
Yuke Zhu
VLM
569
5
0
30 Oct 2025
Off-policy Reinforcement Learning with Model-based Exploration Augmentation
Off-policy Reinforcement Learning with Model-based Exploration Augmentation
Likun Wang
Xiangteng Zhang
Yinuo Wang
Guojian Zhan
Wenxuan Wang
Haoyu Gao
Jingliang Duan
Shengbo Eben Li
OffRL
171
0
0
29 Oct 2025
Energy-Efficient Autonomous Driving with Adaptive Perception and Robust Decision
Energy-Efficient Autonomous Driving with Adaptive Perception and Robust Decision
Yuyang Xia
Zibo Liang
Liwei Deng
Yan Zhao
Han Su
Kai Zheng
116
0
0
29 Oct 2025
Trajectory Design for UAV-Based Low-Altitude Wireless Networks in Unknown Environments: A Digital Twin-Assisted TD3 Approach
Trajectory Design for UAV-Based Low-Altitude Wireless Networks in Unknown Environments: A Digital Twin-Assisted TD3 Approach
Jihao Luo
Zesong Fei
Xinyi Wang
Le Zhao
Yuanhao Cui
Guangxu Zhu
Dusit Niyato
48
0
0
28 Oct 2025
Survey and Tutorial of Reinforcement Learning Methods in Process Systems Engineering
Survey and Tutorial of Reinforcement Learning Methods in Process Systems Engineering
Maximilian Bloor
M. Mowbray
Ehecatl Antonio del Rio Chanona
Calvin Tsay
OffRL
132
0
0
28 Oct 2025
ZTRS: Zero-Imitation End-to-end Autonomous Driving with Trajectory Scoring
ZTRS: Zero-Imitation End-to-end Autonomous Driving with Trajectory Scoring
Zhenxin Li
Wenhao Yao
Zi Wang
Xinglong Sun
Jingde Chen
...
Maying Shen
Jingyu Song
Zuxuan Wu
Shiyi Lan
Jose M. Alvarez
163
2
0
28 Oct 2025
Multi-Agent Conditional Diffusion Model with Mean Field Communication as Wireless Resource Allocation Planner
Multi-Agent Conditional Diffusion Model with Mean Field Communication as Wireless Resource Allocation Planner
Kechen Meng
Sinuo Zhang
Rongpeng Li
Xiangming Meng
Chan Wang
Chan Wang
Zhifeng Zhao
Zhifeng Zhao
DiffM
160
0
0
27 Oct 2025
Transitive RL: Value Learning via Divide and Conquer
Transitive RL: Value Learning via Divide and Conquer
S. Park
Aditya Oberai
P. Atreya
Sergey Levine
OffRL
120
0
0
26 Oct 2025
Mind Your Entropy: From Maximum Entropy to Trajectory Entropy-Constrained RL
Mind Your Entropy: From Maximum Entropy to Trajectory Entropy-Constrained RL
Guojian Zhan
Likun Wang
Pengcheng Wang
Feihong Zhang
Jingliang Duan
Masayoshi Tomizuka
Shengbo Eben Li
78
0
0
25 Oct 2025
Cloud-Fog-Edge Collaborative Computing for Sequential MIoT Workflow: A Two-Tier DDPG-Based Scheduling Framework
Cloud-Fog-Edge Collaborative Computing for Sequential MIoT Workflow: A Two-Tier DDPG-Based Scheduling Framework
Yuhao Fu
Yinghao Zhang
Yalin Liu
Bishenghui Tao
Junhong Ruan
40
0
0
24 Oct 2025
Do You Trust the Process?: Modeling Institutional Trust for Community Adoption of Reinforcement Learning Policies
Do You Trust the Process?: Modeling Institutional Trust for Community Adoption of Reinforcement Learning Policies
Naina Balepur
Xingrui Pei
Hari Sundaram
OffRL
76
0
0
24 Oct 2025
Continual Knowledge Adaptation for Reinforcement Learning
Continual Knowledge Adaptation for Reinforcement Learning
Jinwu Hu
Zihao Lian
Z. Wen
Chenghao Li
Guohao Chen
Xutao Wen
Bin Xiao
Mingkui Tan
CLLKELM
181
1
0
22 Oct 2025
Autobidding Arena: unified evaluation of the classical and RL-based autobidding algorithms
Autobidding Arena: unified evaluation of the classical and RL-based autobidding algorithms
Andrey Pudovikov
Alexandra Khirianova
Ekaterina Solodneva
Aleksandr Katrutsa
Egor Samosvat
Yuriy Dorn
101
0
0
22 Oct 2025
Efficient Model-Based Reinforcement Learning for Robot Control via Online Learning
Efficient Model-Based Reinforcement Learning for Robot Control via Online Learning
Fang Nan
Hao Ma
Qinghua Guan
Josie Hughes
Michael Muehlebach
Marco Hutter
OffRL
121
1
0
21 Oct 2025
Safe But Not Sorry: Reducing Over-Conservatism in Safety Critics via Uncertainty-Aware Modulation
Safe But Not Sorry: Reducing Over-Conservatism in Safety Critics via Uncertainty-Aware Modulation
Daniel Bethell
Simos Gerasimou
R. Calinescu
Calum Imrie
104
0
0
21 Oct 2025
Ensemble based Closed-Loop Optimal Control using Physics-Informed Neural Networks
Ensemble based Closed-Loop Optimal Control using Physics-Informed Neural Networks
Jostein Barry-Straume
Adwait D. Verulkar
A. Sarshar
Andrey A. Popov
Adrian Sandu
84
0
0
21 Oct 2025
Actor-Free Continuous Control via Structurally Maximizable Q-Functions
Actor-Free Continuous Control via Structurally Maximizable Q-Functions
Yigit Korkmaz
Urvi Bhuwania
Ayush Jain
Erdem Bıyık
OffRL
109
0
0
21 Oct 2025
Continuous Q-Score Matching: Diffusion Guided Reinforcement Learning for Continuous-Time Control
Continuous Q-Score Matching: Diffusion Guided Reinforcement Learning for Continuous-Time Control
Chengxiu Hua
Jiawen Gu
Yushun Tang
257
0
0
20 Oct 2025
ALPINE: A Lightweight and Adaptive Privacy-Decision Agent Framework for Dynamic Edge Crowdsensing
ALPINE: A Lightweight and Adaptive Privacy-Decision Agent Framework for Dynamic Edge Crowdsensing
Guanjie Cheng
Siyang Liu
Junqin Huang
Xinkui Zhao
Yin Wang
Mengying Zhu
Linghe Kong
Shuiguang Deng
116
0
0
20 Oct 2025
RL-Driven Security-Aware Resource Allocation Framework for UAV-Assisted O-RAN
RL-Driven Security-Aware Resource Allocation Framework for UAV-Assisted O-RANInternational Conference on Wireless Communications and Mobile Computing (IWCMC), 2025
Zaineh Abughazzah
Emna Baccour
Loay Ismail
Amr M. Mohamed
Mounir Hamdi
63
0
0
20 Oct 2025
Learning to Design Soft Hands using Reward Models
Learning to Design Soft Hands using Reward Models
Xueqian Bai
Nicklas Hansen
Adabhav Singh
Michael T Tolley
Yan Duan
Pieter Abbeel
Xiaolong Wang
Sha Yi
142
2
0
20 Oct 2025
D2C-HRHR: Discrete Actions with Double Distributional Critics for High-Risk-High-Return Tasks
D2C-HRHR: Discrete Actions with Double Distributional Critics for High-Risk-High-Return Tasks
Jundong Zhang
Yuhui Situ
Fanji Zhang
Rongji Deng
Tianqi Wei
OffRL
98
0
0
20 Oct 2025
OffSim: Offline Simulator for Model-based Offline Inverse Reinforcement Learning
OffSim: Offline Simulator for Model-based Offline Inverse Reinforcement Learning
Woo-Jin Ahn
Sang-Ryul Baek
Yong-Jun Lee
H. Choi
M. Lim
OffRL
104
0
0
17 Oct 2025
RLAF: Reinforcement Learning from Automaton Feedback
RLAF: Reinforcement Learning from Automaton Feedback
Mahyar Alinejad
Alvaro Velasquez
Yue Wang
George Atia
OffRL
111
0
0
17 Oct 2025
RM-RL: Role-Model Reinforcement Learning for Precise Robot Manipulation
RM-RL: Role-Model Reinforcement Learning for Precise Robot Manipulation
Xiangyu Chen
Chuhao Zhou
Yuxi Liu
Jianfei Yang
OffRL
151
0
0
16 Oct 2025
Procedural Game Level Design with Deep Reinforcement Learning
Procedural Game Level Design with Deep Reinforcement Learning
Miraç Buğra Özkan
111
0
0
16 Oct 2025
Simplicial Embeddings Improve Sample Efficiency in Actor-Critic Agents
Simplicial Embeddings Improve Sample Efficiency in Actor-Critic Agents
J. Obando-Ceron
Walter Mayor
Samuel Lavoie
Scott Fujimoto
Aaron Courville
Pablo Samuel Castro
147
1
0
15 Oct 2025
Human-in-the-Loop Bandwidth Estimation for Quality of Experience Optimization in Real-Time Video Communication
Human-in-the-Loop Bandwidth Estimation for Quality of Experience Optimization in Real-Time Video Communication
Sami Khairy
Gabriel Mittag
Vishak Gopal
Ross Cutler
93
0
0
14 Oct 2025
Expert or not? assessing data quality in offline reinforcement learning
Expert or not? assessing data quality in offline reinforcement learning
Arip Asadulaev
Fakhri Karray
Martin Takáč
OffRL
97
0
0
14 Oct 2025
Finite-time Convergence Analysis of Actor-Critic with Evolving Reward
Finite-time Convergence Analysis of Actor-Critic with Evolving Reward
Rui Hu
Yu Chen
Longbo Huang
142
0
0
14 Oct 2025
Diffusion Models for Reinforcement Learning: Foundations, Taxonomy, and Development
Diffusion Models for Reinforcement Learning: Foundations, Taxonomy, and Development
Changfu Xu
Jianxiong Guo
Yuzhu Liang
Haiyang Huang
Haodong Zou
Xi Zheng
Shui Yu
Xiaowen Chu
Jiannong Cao
Tian-sheng Wang
OffRLAI4CE
208
0
0
14 Oct 2025
Pruning Cannot Hurt Robustness: Certified Trade-offs in Reinforcement Learning
Pruning Cannot Hurt Robustness: Certified Trade-offs in Reinforcement Learning
James Pedley
Benjamin Etheridge
Stephen J. Roberts
Francesco Quinzan
OffRLAAML
116
0
0
14 Oct 2025
Bayesian Optimization for Dynamic Pricing and Learning
Bayesian Optimization for Dynamic Pricing and Learning
Anush Anand
Pranav Agrawal
Tejas Bodas
133
0
0
14 Oct 2025
Rethinking the Role of Dynamic Sparse Training for Scalable Deep Reinforcement Learning
Rethinking the Role of Dynamic Sparse Training for Scalable Deep Reinforcement Learning
Guozheng Ma
Lu Li
Zilin Wang
Haoyu Wang
Shengchao Hu
Leszek Rutkowski
D. Tao
AI4CE
167
0
0
14 Oct 2025
Heterogeneous RBCs via deep multi-agent reinforcement learning
Heterogeneous RBCs via deep multi-agent reinforcement learning
Federico Gabriele
Aldo Glielmo
Marco Taboga
72
1
0
14 Oct 2025
A Flexible Multi-Agent Deep Reinforcement Learning Framework for Dynamic Routing and Scheduling of Latency-Critical Services
A Flexible Multi-Agent Deep Reinforcement Learning Framework for Dynamic Routing and Scheduling of Latency-Critical Services
Vincenzo Norman Vitale
A. Tulino
Andreas F. Molisch
Jaime Llorca
81
0
0
13 Oct 2025
A Primer on SO(3) Action Representations in Deep Reinforcement Learning
A Primer on SO(3) Action Representations in Deep Reinforcement Learning
Martin Schuck
Sherif Samy
Angela P. Schoellig
101
0
0
13 Oct 2025
Population-Coded Spiking Neural Networks for High-Dimensional Robotic Control
Population-Coded Spiking Neural Networks for High-Dimensional Robotic Control
Kanishkha Jaisankar
Xiaoyang Jiang
Feifan Liao
Jeethu Sreenivas Amuthan
104
0
0
12 Oct 2025
Reinforcement Fine-Tuning of Flow-Matching Policies for Vision-Language-Action Models
Reinforcement Fine-Tuning of Flow-Matching Policies for Vision-Language-Action Models
Mingyang Lyu
Yinqian Sun
Erliang Lin
Huangrui Li
Ruolin Chen
Feifei Zhao
Yi Zeng
113
0
0
11 Oct 2025
Experience-Efficient Model-Free Deep Reinforcement Learning Using Pre-Training
Experience-Efficient Model-Free Deep Reinforcement Learning Using Pre-Training
Ruoxing Yang
OffRL
61
0
0
11 Oct 2025
Towards Safe Maneuvering of Double-Ackermann-Steering Robots with a Soft Actor-Critic Framework
Towards Safe Maneuvering of Double-Ackermann-Steering Robots with a Soft Actor-Critic Framework
Kohio Deflesselle
Mélodie Daniel
Aly Magassouba
Miguel Aranda
Olivier Ly
102
0
0
11 Oct 2025
Previous
12345...949596
Next