ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.02971
  4. Cited By
Continuous control with deep reinforcement learning
v1v2v3v4v5v6 (latest)

Continuous control with deep reinforcement learning

9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
ArXiv (abs)PDFHTML

Papers citing "Continuous control with deep reinforcement learning"

50 / 4,795 papers shown
Diffusion Fine-Tuning via Reparameterized Policy Gradient of the Soft Q-Function
Diffusion Fine-Tuning via Reparameterized Policy Gradient of the Soft Q-Function
Hyeongyu Kang
Jaewoo Lee
Woocheol Shin
Kiyoung Om
Jinkyoo Park
100
0
0
04 Dec 2025
World Models for Autonomous Navigation of Terrestrial Robots from LIDAR Observations
World Models for Autonomous Navigation of Terrestrial Robots from LIDAR Observations
Raul Steinmetz
Fabio Demo Rosa
V. A. Kich
J. A. Bottega
Ricardo B. Grando
D. T. Gamarra
3DV
377
0
0
03 Dec 2025
Deep Reinforcement Learning for Dynamic Algorithm Configuration: A Case Study on Optimizing OneMax with the (1+($λ$,$λ$))-GA
Deep Reinforcement Learning for Dynamic Algorithm Configuration: A Case Study on Optimizing OneMax with the (1+(λλλ,λλλ))-GA
Tai Nguyen
Phong Le
André Biedenkapp
Carola Doerr
Nguyen Dang
53
0
0
03 Dec 2025
Guided Flow Policy: Learning from High-Value Actions in Offline Reinforcement Learning
Guided Flow Policy: Learning from High-Value Actions in Offline Reinforcement Learning
Franki Nguimatsia Tiofack
Théotime Le Hellard
Fabian Schramm
Nicolas Perrin-Gilbert
Justin Carpentier
240
0
0
03 Dec 2025
GoRL: An Algorithm-Agnostic Framework for Online Reinforcement Learning with Generative Policies
GoRL: An Algorithm-Agnostic Framework for Online Reinforcement Learning with Generative Policies
Chubin Zhang
Zhenglin Wan
Feng Chen
Xingrui Yu
Ivor W. Tsang
Bo An
76
0
0
02 Dec 2025
On the Tension Between Optimality and Adversarial Robustness in Policy Optimization
Haoran Li
Jiayu Lv
Congying Han
Zicheng Zhang
Anqi Li
Y. Liu
Tiande Guo
Nan Jiang
AAML
138
0
0
01 Dec 2025
How Market Volatility Shapes Algorithmic Collusion: A Comparative Analysis of Learning-Based Pricing Algorithms
How Market Volatility Shapes Algorithmic Collusion: A Comparative Analysis of Learning-Based Pricing Algorithms
Aheer Sravon
Md. Ibrahim
Devdyuti Mazumder
Ridwan Al Aziz
24
0
0
01 Dec 2025
How do trout regulate patterns of muscle contraction to optimize propulsive efficiency during steady swimming
Tao Li
Chunze Zhang
Weiwei Yao
Junzhao He
Ji Hou
Qin Zhou
Lu Zhang
45
0
0
01 Dec 2025
Forecasting in Offline Reinforcement Learning for Non-stationary Environments
Forecasting in Offline Reinforcement Learning for Non-stationary Environments
S. E. Ada
Georg Martius
Emre Ugur
Erhan Öztop
OffRL
169
0
0
01 Dec 2025
Algorithmic Guarantees for Distilling Supervised and Offline RL Datasets
Algorithmic Guarantees for Distilling Supervised and Offline RL Datasets
Aaryan Gupta
Rishi Saket
A. Raghuveer
OffRLDD
183
0
0
29 Nov 2025
Hardware-Software Collaborative Computing of Photonic Spiking Reinforcement Learning for Robotic Continuous Control
Hardware-Software Collaborative Computing of Photonic Spiking Reinforcement Learning for Robotic Continuous Control
Mengting Yu
Shuiying Xiang
Changjian Xie
Yonghang Chen
Haowen Zhao
Xingxing Guo
Yahui Zhang
Yanan Han
Yue Hao
81
0
0
29 Nov 2025
Fault-Tolerant MARL for CAVs under Observation Perturbations for Highway On-Ramp Merging
Fault-Tolerant MARL for CAVs under Observation Perturbations for Highway On-Ramp Merging
Yuchen Shi
Huaxin Pei
Y. Zhang
Danya Yao
AAML
222
0
0
28 Nov 2025
Safe and Sustainable Electric Bus Charging Scheduling with Constrained Hierarchical DRL
Safe and Sustainable Electric Bus Charging Scheduling with Constrained Hierarchical DRLIEEE Transactions on Vehicular Technology (IEEE Trans. Veh. Technol.), 2025
Jiaju Qi
Lei Lei
Thorsteinn Jonsson
Dusit Niyato
31
0
0
25 Nov 2025
Reinforcing Action Policies by Prophesying
Reinforcing Action Policies by Prophesying
Jiahui Zhang
Ze Huang
Chun Gu
Zipei Ma
Li Zhang
233
1
0
25 Nov 2025
Learning Massively Multitask World Models for Continuous Control
Learning Massively Multitask World Models for Continuous Control
Nicklas Hansen
Hao Su
Xiaolong Wang
OffRLCLLLM&Ro
528
0
0
24 Nov 2025
FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning
FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning
Xin Yuan
S. Li
Jiateng Wei
Chengrui Zhu
Yanming Wu
Qingpeng Li
Jiajun Lv
Xiaoke Lan
Jun Chen
Yong-Jin Liu
OffRL
373
0
0
24 Nov 2025
Multi-Agent Cross-Entropy Method with Monotonic Nonlinear Critic Decomposition
Multi-Agent Cross-Entropy Method with Monotonic Nonlinear Critic Decomposition
Yan Wang
Ke Deng
Yongli Ren
159
0
0
24 Nov 2025
First-order Sobolev Reinforcement Learning
First-order Sobolev Reinforcement Learning
Fabian Schramm
Nicolas Perrin-Gilbert
Justin Carpentier
57
0
0
24 Nov 2025
General Agentic Memory Via Deep Research
General Agentic Memory Via Deep Research
B.Y. Yan
Chaofan Li
Hongjin Qian
Shuqi Lu
Zheng Liu
45
0
0
23 Nov 2025
A Reinforcement Learning Framework for Resource Allocation in Uplink Carrier Aggregation in the Presence of Self Interference
A Reinforcement Learning Framework for Resource Allocation in Uplink Carrier Aggregation in the Presence of Self InterferenceIEEE Transactions on Machine Learning in Communications and Networking (IEEE TMLCN), 2025
Jaswanth Bodempudi
Batta Siva Sairam
Madepalli Haritha
Sandesh Rao Mattu
Ananthanarayanan Chockalingam
59
0
0
22 Nov 2025
MOMA-AC: A preference-driven actor-critic framework for continuous multi-objective multi-agent reinforcement learning
MOMA-AC: A preference-driven actor-critic framework for continuous multi-objective multi-agent reinforcement learningNeurocomputing (Neurocomputing), 2025
Adam Callaghan
Karl Mason
Patrick Mannion
AI4CE
98
0
0
22 Nov 2025
Limitations of Scalarisation in MORL: A Comparative Study in Discrete Environments
Muhammad Saóod Shah
Asad Jeewa
138
0
0
20 Nov 2025
A Hybrid Proactive And Predictive Framework For Edge Cloud Resource Management
Hrikshesh Kumar
Anika Garg
Anshul Gupta
Yashika Agarwal
180
0
0
20 Nov 2025
Mitigating Estimation Bias with Representation Learning in TD Error-Driven Regularization
Haohui Chen
Zhiyong Chen
Aoxiang Liu
Wentuo Fang
127
0
0
20 Nov 2025
Stabilizing Policy Gradient Methods via Reward Profiling
Shihab Ahmed
El Houcine Bergou
A. Dutta
Yue Wang
196
0
0
20 Nov 2025
Revisiting Fairness-aware Interactive Recommendation: Item Lifecycle as a Control Knob
Yun Lu
Xiaoyu Shi
Hong Xie
Chongjun Xia
Zhenhui Gong
Mingsheng Shang
73
0
0
20 Nov 2025
Socially aware navigation for mobile robots: a survey on deep reinforcement learning approaches
Socially aware navigation for mobile robots: a survey on deep reinforcement learning approaches
Ibrahim Khalil Kabir
Muhammad Faizan Mysorewala
81
0
0
18 Nov 2025
DeepSport: A Multimodal Large Language Model for Comprehensive Sports Video Reasoning via Agentic Reinforcement Learning
DeepSport: A Multimodal Large Language Model for Comprehensive Sports Video Reasoning via Agentic Reinforcement Learning
Junbo Zou
Haotian Xia
Zhen Ye
Shengjie Zhang
Christopher Lai
Vicente Ordonez
Weining Shen
Hanjie Chen
AI4TSReLMLRM
168
0
0
17 Nov 2025
NFQ2.0: The CartPole Benchmark Revisited
NFQ2.0: The CartPole Benchmark Revisited
Sascha Lange
Roland Hafner
Martin Riedmiller
74
0
0
16 Nov 2025
Goal-Oriented Multi-Agent Reinforcement Learning for Decentralized Agent Teams
Goal-Oriented Multi-Agent Reinforcement Learning for Decentralized Agent Teams
Hung Du
Hy Nguyen
Srikanth Thudumu
Rajesh Vasa
K. Mouzakis
56
0
0
15 Nov 2025
Deep Reinforcement Learning for Automated Stock Trading: An Ensemble Strategy
Deep Reinforcement Learning for Automated Stock Trading: An Ensemble StrategyInternational Conference on AI in Finance (ICAIF), 2020
Hongyang Yang
Xiao-Yang Liu
Shan Zhong
A. Walid
AIFin
288
270
0
15 Nov 2025
Reinforcement Learning for Charging Optimization of Inhomogeneous Dicke Quantum Batteries
Reinforcement Learning for Charging Optimization of Inhomogeneous Dicke Quantum Batteries
Xiaobin Song
Siyuan Bai
Da-Wei Wang
Hanxiao Tao
Xizhe Wang
Rebing Wu
Benben Jiang
32
0
0
15 Nov 2025
DemoTuner: Automatic Performance Tuning for Database Management Systems Based on Demonstration Reinforcement Learning
DemoTuner: Automatic Performance Tuning for Database Management Systems Based on Demonstration Reinforcement Learning
Hui Dou
Lei Jin
Yuxuan Zhou
Jiang He
Yiwen Zhang
Zibin Zheng
OffRL
225
0
0
13 Nov 2025
PrefPoE: Advantage-Guided Preference Fusion for Learning Where to Explore
PrefPoE: Advantage-Guided Preference Fusion for Learning Where to Explore
Zhihao Lin
Lin Wu
Zhen Tian
Jianglin Lan
114
0
0
11 Nov 2025
Dynamic Sparsity: Challenging Common Sparsity Assumptions for Learning World Models in Robotic Reinforcement Learning Benchmarks
Dynamic Sparsity: Challenging Common Sparsity Assumptions for Learning World Models in Robotic Reinforcement Learning Benchmarks
Muthukumar Pandaram
Jakob J. Hollenstein
David Drexel
Samuele Tosatto
A. Rodríguez-Sánchez
J. Piater
CML
199
0
0
11 Nov 2025
Statistically Assuring Safety of Control Systems using Ensembles of Safety Filters and Conformal Prediction
Statistically Assuring Safety of Control Systems using Ensembles of Safety Filters and Conformal Prediction
Ihab Tabbara
Yuxuan Yang
Hussein Sibai
128
0
0
11 Nov 2025
Multistep Quasimetric Learning for Scalable Goal-conditioned Reinforcement Learning
Multistep Quasimetric Learning for Scalable Goal-conditioned Reinforcement Learning
Bill Chunyuan Zheng
Vivek Myers
Benjamin Eysenbach
Sergey Levine
OffRL
194
0
0
11 Nov 2025
On Geometric Structures for Policy Parameterization in Continuous Control
On Geometric Structures for Policy Parameterization in Continuous Control
Zhihao Lin
245
0
0
11 Nov 2025
Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization
Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization
Sayambhu Sen
Shalabh Bhatnagar
101
0
0
10 Nov 2025
Shocks Under Control: Taming Transonic Compressible Flow over an RAE2822 Airfoil with Deep Reinforcement Learning
Shocks Under Control: Taming Transonic Compressible Flow over an RAE2822 Airfoil with Deep Reinforcement Learning
Trishit Mondal
Ricardo Vinuesa
Ameya D. Jagtap
AI4CE
100
0
0
10 Nov 2025
Cross-Platform Learnable Fuzzy Gain-Scheduled Proportional-Integral-Derivative Controller Tuning via Physics-Constrained Meta-Learning and Reinforcement Learning Adaptation
Cross-Platform Learnable Fuzzy Gain-Scheduled Proportional-Integral-Derivative Controller Tuning via Physics-Constrained Meta-Learning and Reinforcement Learning Adaptation
JiaHao Wu
ShengWen Yu
AI4CE
312
0
0
09 Nov 2025
Towards Personalized Quantum Federated Learning for Anomaly Detection
Towards Personalized Quantum Federated Learning for Anomaly DetectionIEEE Transactions on Network Science and Engineering (IEEE TNS&E), 2025
Ratun Rahman
Sina shaham
Dinh C. Nguyen
164
1
0
08 Nov 2025
Distributionally Robust Self Paced Curriculum Reinforcement Learning
Distributionally Robust Self Paced Curriculum Reinforcement Learning
Anirudh Satheesh
Keenan Powell
Vaneet Aggarwal
OODOffRL
493
0
0
07 Nov 2025
Imitation Learning in the Deep Learning Era: A Novel Taxonomy and Recent Advances
Imitation Learning in the Deep Learning Era: A Novel Taxonomy and Recent Advances
Iason Chrysomallis
Georgios Chalkiadakis
OOD
240
0
0
05 Nov 2025
Tensor-Efficient High-Dimensional Q-learning
Tensor-Efficient High-Dimensional Q-learning
Junyi Wu
Dan Li
OffRL
94
0
0
05 Nov 2025
Going Beyond Expert Performance via Deep Implicit Imitation Reinforcement Learning
Going Beyond Expert Performance via Deep Implicit Imitation Reinforcement Learning
Iason Chrysomallis
Georgios Chalkiadakis
OffRL
126
0
0
05 Nov 2025
Curriculum Design for Trajectory-Constrained Agent: Compressing Chain-of-Thought Tokens in LLMs
Curriculum Design for Trajectory-Constrained Agent: Compressing Chain-of-Thought Tokens in LLMs
Georgios Tzannetos
Parameswaran Kamalaruban
Adish Singla
148
1
0
04 Nov 2025
Optimizing Electric Vehicle Charging Station Placement Using Reinforcement Learning and Agent-Based Simulations
Optimizing Electric Vehicle Charging Station Placement Using Reinforcement Learning and Agent-Based Simulations
Minh-Duc Nguyen
Dung D. Le
Phi Long Nguyen
56
0
0
03 Nov 2025
Learning Intractable Multimodal Policies with Reparameterization and Diversity Regularization
Learning Intractable Multimodal Policies with Reparameterization and Diversity Regularization
Ziqi Wang
Jiashun Liu
L. Pan
230
0
0
03 Nov 2025
ABIDES-MARL: A Multi-Agent Reinforcement Learning Environment for Endogenous Price Formation and Execution in a Limit Order Book
ABIDES-MARL: A Multi-Agent Reinforcement Learning Environment for Endogenous Price Formation and Execution in a Limit Order Book
Patrick Cheridito
Jean-Loup Dupret
Zhexin Wu
236
1
0
03 Nov 2025
1234...949596
Next