ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.02971
  4. Cited By
Continuous control with deep reinforcement learning
v1v2v3v4v5v6 (latest)

Continuous control with deep reinforcement learning

9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
ArXiv (abs)PDFHTML

Papers citing "Continuous control with deep reinforcement learning"

50 / 4,796 papers shown
Distributed Learning over Arbitrary Topology: Linear Speed-Up with Polynomial Transient Time
Distributed Learning over Arbitrary Topology: Linear Speed-Up with Polynomial Transient Time
Runze You
Shi Pu
213
2
0
20 Mar 2025
Design of Reward Function on Reinforcement Learning for Automated Driving
Design of Reward Function on Reinforcement Learning for Automated DrivingIFAC-PapersOnLine (IFAC-PapersOnLine), 2025
Takeru Goto
Yuki Kizumi
Shun Iwasaki
173
8
0
20 Mar 2025
Active management of battery degradation in wireless sensor network using deep reinforcement learning for group battery replacement
Active management of battery degradation in wireless sensor network using deep reinforcement learning for group battery replacement
Jong-Hyun Jeonga
Hongki Jo
Qiang Zhou
Tahsin Afroz Hoque Nishat
Lang Wu
82
3
0
20 Mar 2025
Efficient Action-Constrained Reinforcement Learning via Acceptance-Rejection Method and Augmented MDPs
Efficient Action-Constrained Reinforcement Learning via Acceptance-Rejection Method and Augmented MDPsInternational Conference on Learning Representations (ICLR), 2025
Wei-Ting Hung
Shao-Hua Sun
Ping-Chun Hsieh
257
3
0
17 Mar 2025
Dense Policy: Bidirectional Autoregressive Learning of Actions
Dense Policy: Bidirectional Autoregressive Learning of Actions
Yue Su
Xinyu Zhan
Hongjie Fang
Han Xue
Hao-Shu Fang
Yongqian Li
Cewu Lu
Lixin Yang
VGen
305
10
0
17 Mar 2025
Towards Better Sample Efficiency in Multi-Agent Reinforcement Learning via Exploration
Towards Better Sample Efficiency in Multi-Agent Reinforcement Learning via Exploration
Amir Baghi
Jens Sjölund
Joakim Bergdahl
Linus Gisslén
Alessandro Sestini
365
1
0
17 Mar 2025
Eval-PPO: Building an Efficient Threat Evaluator Using Proximal Policy Optimization
Eval-PPO: Building an Efficient Threat Evaluator Using Proximal Policy Optimization
Wuzhou Sun
Siyi Li
Qingxiang Zou
Zixing Liao
AAML
262
0
0
15 Mar 2025
Generative Modeling for Adversarial Lane-Change Scenarios
Generative Modeling for Adversarial Lane-Change Scenarios
Chuancheng Zhang
Zhenhao Wang
Jiangcheng Wang
Kun Su
Qiang Lv
Bin Jiang
Kunkun Hao
Wenyu Wang
194
0
0
15 Mar 2025
Training Directional Locomotion for Quadrupedal Low-Cost Robotic Systems via Deep Reinforcement Learning
Peter Böhm
Archie C. Chapman
Pauline Pounds
310
0
0
14 Mar 2025
Deep Learning for Time Series Forecasting: A SurveyInternational Journal of Machine Learning and Cybernetics (IJMLC), 2025
X. Kong
Zhenghao Chen
Weiyao Liu
Kaili Ning
Lechao Zhang
Syauqie Muhammad Marier
Yichen Liu
Yuhao Chen
Xiwei Xu
AI4TSAI4CE
345
38
0
13 Mar 2025
Safe exploration in reproducing kernel Hilbert spacesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Abdullah Tokmak
Kiran G. Krishnan
Thomas B. Schon
Dominik Baumann
209
4
0
13 Mar 2025
Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning
Zhenghai Xue
Lang Feng
Jiacheng Xu
Kang Kang
Xiang Wen
Jingyi Wang
Shuicheng Yan
OffRL
233
0
0
10 Mar 2025
Precise Insulin Delivery for Artificial Pancreas: A Reinforcement Learning Optimized Adaptive Fuzzy Control Approach
Omar Mameche
Abdelhadi Abedou
Taqwa Mezaache
Mohamed Tadjine
214
1
0
09 Mar 2025
Vairiational Stochastic Games
Zhiyu Zhao
Haifeng Zhang
240
0
0
08 Mar 2025
THE-SEAN: A Heart Rate Variation-Inspired Temporally High-Order Event-Based Visual Odometry with Self-Supervised Spiking Event Accumulation Networks
Chaoran Xiong
Litao Wei
Kehui Ma
Zhen Sun
Yan Xiang
Zihan Nan
Trieu-Kien Truong
Ling Pei
256
24
0
07 Mar 2025
Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic FlowsInternational Conference on Learning Representations (ICLR), 2025
Xiangxin Zhou
Yi Xiao
Haowei Lin
Xinheng He
Jiaqi Guan
Yang Wang
Qiang Liu
F. I. S. Kevin Zhou
Liang Wang
Jianzhu Ma
AI4CE
221
1
0
06 Mar 2025
Closing the Intent-to-Behavior Gap via Fulfillment Priority Logic
Closing the Intent-to-Behavior Gap via Fulfillment Priority Logic
B. Mabsout
Abdelrahman AbdelGawad
R. Mancuso
492
1
0
04 Mar 2025
Reinforcement Learning-based Threat Assessment
Reinforcement Learning-based Threat Assessment
Wuzhou Sun
Siyi Li
Qingxiang Zou
Zixing Liao
AAML
223
0
0
04 Mar 2025
Is Bellman Equation Enough for Learning Control?
Haoxiang You
Lekan Molu
Ian Abraham
326
0
0
04 Mar 2025
Diffusion Stabilizer Policy for Automated Surgical Robot Manipulations
Chonlam Ho
Jianshu Hu
Jian Shu
Qi Dou
Yutong Ban
MedIm
320
1
0
03 Mar 2025
Runtime Learning of Quadruped Robots in Wild Environments
Runtime Learning of Quadruped Robots in Wild Environments
Yihao Cai
Y. Mao
L. Sha
H. Cao
Marco Caccamo
273
3
0
02 Mar 2025
Scalable Reinforcement Learning for Virtual Machine Scheduling
Junjie Sheng
JieHao Wu
Haochuan Cui
Yiqiu Hu
Wenli Zhou
Lei Zhu
Qian Peng
Wenhao Li
Xiangfeng Wang
OffRL
147
0
0
01 Mar 2025
Actor-Critic Cooperative Compensation to Model Predictive Control for Off-Road Autonomous Vehicles Under Unknown Dynamics
Actor-Critic Cooperative Compensation to Model Predictive Control for Off-Road Autonomous Vehicles Under Unknown DynamicsIEEE International Conference on Robotics and Automation (ICRA), 2025
Prakhar Gupta
J. Smereka
Yunyi Jia
206
0
0
01 Mar 2025
BodyGen: Advancing Towards Efficient Embodiment Co-DesignInternational Conference on Learning Representations (ICLR), 2025
Haofei Lu
Zhe Wu
Junliang Xing
Jianshu Li
Ruoyu Li
Zhe Li
Yuanchun Shi
241
9
0
01 Mar 2025
Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control
Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control
T. Lee
Donghwan Lee
455
0
0
28 Feb 2025
Accelerating Model-Based Reinforcement Learning with State-Space World Models
Accelerating Model-Based Reinforcement Learning with State-Space World Models
Maria Krinner
Elie Aljalbout
Angel Romero
Davide Scaramuzza
OffRL
272
8
0
27 Feb 2025
On the Importance of Reward Design in Reinforcement Learning-based Dynamic Algorithm Configuration: A Case Study on OneMax with (1+($λ$,$λ$))-GA
On the Importance of Reward Design in Reinforcement Learning-based Dynamic Algorithm Configuration: A Case Study on OneMax with (1+(λλλ,λλλ))-GAAnnual Conference on Genetic and Evolutionary Computation (GECCO), 2025
Tai Nguyen
Phong Le
André Biendenkapp
Carola Doerr
Nguyen Dang
224
3
0
27 Feb 2025
Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application
Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application
Thomas Hickling
Maxwell Hogan
Abdulla Tammam
Nabil Aouf
251
5
0
27 Feb 2025
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment DependenciesAAAI Conference on Artificial Intelligence (AAAI), 2025
Zhouyu He
Peng Qiao
Rongchun Li
Yong Dou
Yusong Tan
OffRL
408
0
0
27 Feb 2025
FinTSB: A Comprehensive and Practical Benchmark for Financial Time Series Forecasting
FinTSB: A Comprehensive and Practical Benchmark for Financial Time Series Forecasting
Yifan Hu
Yuante Li
Peiyuan Liu
Yuxia Zhu
Naiqi Li
Tao Dai
Shu-Tao Xia
Dawei Cheng
Changjun Jiang
AI4TS
441
12
0
26 Feb 2025
XSS Adversarial Attacks Based on Deep Reinforcement Learning: A Replication and Extension Study
XSS Adversarial Attacks Based on Deep Reinforcement Learning: A Replication and Extension Study
Samuele Pasini
Gianluca Maragliano
Jinhan Kim
Paolo Tonella
AAML
159
0
0
26 Feb 2025
Learning Policy Committees for Effective Personalization in MDPs with Diverse Tasks
Learning Policy Committees for Effective Personalization in MDPs with Diverse Tasks
Luise Ge
Michael Lanier
Anindya Sarkar
Bengisu Guresti
Yevgeniy Vorobeychik
Chongjie Zhang
422
2
0
26 Feb 2025
Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement Learning
Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement LearningIEEE International Conference on Robotics and Automation (ICRA), 2025
Meng Feng
Viraj Parimi
B. Williams
421
4
0
25 Feb 2025
A Simulation Pipeline to Facilitate Real-World Robotic Reinforcement Learning Applications
A Simulation Pipeline to Facilitate Real-World Robotic Reinforcement Learning ApplicationsIEEE Systems Conference (SysCon), 2025
Jefferson Silveira
Joshua A. Marshall
Sidney N. Givigi Jr
292
1
0
24 Feb 2025
Reinforcement Learning-based Approach for Vehicle-to-Building Charging with Heterogeneous Agents and Long Term Rewards
Reinforcement Learning-based Approach for Vehicle-to-Building Charging with Heterogeneous Agents and Long Term RewardsAdaptive Agents and Multi-Agent Systems (AAMAS), 2025
Fangqi Liu
Rishav Sen
J. P. Talusan
Ava Pettet
Aaron Kandel
Yoshinori Suzue
Ayan Mukhopadhyay
A. Dubey
OffRL
217
3
0
24 Feb 2025
A Reinforcement Learning Approach to Non-prehensile Manipulation through SlidingIEEE Robotics and Automation Letters (IEEE RA-L), 2025
Hamidreza Raei
Elena De Momi
Arash Ajoudani
356
0
0
24 Feb 2025
Policy Learning with a Natural Language Action Space: A Causal Approach
Policy Learning with a Natural Language Action Space: A Causal Approach
Bohan Zhang
Yixin Wang
Paramveer S. Dhillon
CML
218
0
0
24 Feb 2025
Exploring Sentiment Manipulation by LLM-Enabled Intelligent Trading Agents
Exploring Sentiment Manipulation by LLM-Enabled Intelligent Trading Agents
David Byrd
LLMAGLM&RoAIFin
150
1
0
22 Feb 2025
Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual Recognition
Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2025
Chuanguang Yang
Xinqiang Yu
Han Yang
Zhulin An
Chengqing Yu
Libo Huang
Yongjun Xu
302
11
0
22 Feb 2025
Reinforcement Learning-based Receding Horizon Control using Adaptive Control Barrier Functions for Safety-Critical Systems
Reinforcement Learning-based Receding Horizon Control using Adaptive Control Barrier Functions for Safety-Critical SystemsIEEE Conference on Decision and Control (CDC), 2024
Ehsan Sabouni
Hijaz Ahmad
Vittorio Giammarino
Christos G. Cassandras
I. Paschalidis
Wenchao Li
311
9
0
21 Feb 2025
Estimating Control Barriers from Offline DataIEEE International Conference on Robotics and Automation (ICRA), 2025
Hongzhan Yu
Seth Farrell
Ryo Yoshimitsu
Zhizhen Qin
Henrik I. Christensen
Sicun Gao
OffRL
242
6
0
21 Feb 2025
PPO-MI: Efficient Black-Box Model Inversion via Proximal Policy Optimization
PPO-MI: Efficient Black-Box Model Inversion via Proximal Policy Optimization
Xinpeng Shou
215
0
0
21 Feb 2025
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee
Youngdo Lee
Takuma Seno
Donghu Kim
Peter Stone
Jaegul Choo
476
9
0
21 Feb 2025
SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning
SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning
Xuyang Li
Romit Maulik
433
0
0
21 Feb 2025
Enhancing PPO with Trajectory-Aware Hybrid Policies
Qisai Liu
Zhanhong Jiang
Hsin-Jung Yang
Mahsa Khosravi
Joshua R. Waite
Soumik Sarkar
285
0
0
21 Feb 2025
ArrayBot: Reinforcement Learning for Generalizable Distributed Manipulation through Touch
ArrayBot: Reinforcement Learning for Generalizable Distributed Manipulation through TouchIEEE International Conference on Robotics and Automation (ICRA), 2023
Zhengrong Xue
H. Zhang
Jin Cheng
Zhengmao He
Yuanchen Ju
Chan-Yu Lin
Gu Zhang
Huazhe Xu
OffRL
376
16
0
20 Feb 2025
RobotIQ: Empowering Mobile Robots with Human-Level Planning for Real-World Execution
RobotIQ: Empowering Mobile Robots with Human-Level Planning for Real-World Execution
Emmanuel K. Raptis
Athanasios Ch. Kapoutsis
Elias B. Kosmatopoulos
LM&Ro
263
1
0
18 Feb 2025
Communication Strategy on Macro-and-Micro Traffic State in Cooperative Deep Reinforcement Learning for Regional Traffic Signal Control
Communication Strategy on Macro-and-Micro Traffic State in Cooperative Deep Reinforcement Learning for Regional Traffic Signal Control
Hankang Gu
Shangbo Wang
Dongyao Jia
Yuli Zhang
Yanrong Luo
Guoqiang Mao
Jianping Wang
Eng Gee Lim
195
2
0
18 Feb 2025
Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Learning a Diffusion Model Policy from Rewards via Q-Score MatchingInternational Conference on Machine Learning (ICML), 2023
Michael Psenka
Alejandro Escontrela
Pieter Abbeel
Yi-An Ma
DiffM
461
59
0
17 Feb 2025
Data Center Cooling System Optimization Using Offline Reinforcement Learning
Data Center Cooling System Optimization Using Offline Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Xianyuan Zhan
Xiangyu Zhu
Peng Cheng
Xiao Hu
Ziteng He
...
Chenhui Liu
Tianshun Hong
Huiwen Zheng
Yunxin Liu
Feng Zhao
AI4CE
424
3
0
17 Feb 2025
Previous
123...789...949596
Next
Page 8 of 96
Pageof 96