ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.02971
  4. Cited By
Continuous control with deep reinforcement learning
v1v2v3v4v5v6 (latest)

Continuous control with deep reinforcement learning

9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
ArXiv (abs)PDFHTML

Papers citing "Continuous control with deep reinforcement learning"

50 / 4,795 papers shown
Application of Deep Reinforcement Learning to At-the-Money S&P 500 Options Hedging
Application of Deep Reinforcement Learning to At-the-Money S&P 500 Options HedgingWorking papers (WP), 2025
Zofia Bracha
Paweł Sakowski
Jakub Michañków
AIFin
190
0
0
10 Oct 2025
Hierarchical Semantic RL: Tackling the Problem of Dynamic Action Space for RL-based Recommendations
Hierarchical Semantic RL: Tackling the Problem of Dynamic Action Space for RL-based Recommendations
Minmao Wang
Xingchen Liu
Shijie Yi
Likang Wu
Hongke Zhao
Fei Pan
Qingpeng Cai
Peng Jiang
133
0
0
10 Oct 2025
Control Synthesis of Cyber-Physical Systems for Real-Time Specifications through Causation-Guided Reinforcement Learning
Control Synthesis of Cyber-Physical Systems for Real-Time Specifications through Causation-Guided Reinforcement Learning
Xiaochen Tang
Zhenya Zhang
Miaomiao Zhang
Jie An
85
0
0
09 Oct 2025
Energy-Guided Diffusion Sampling for Long-Term User Behavior Prediction in Reinforcement Learning-based Recommendation
Energy-Guided Diffusion Sampling for Long-Term User Behavior Prediction in Reinforcement Learning-based Recommendation
Xiaocong Chen
Siyu Wang
Lina Yao
OffRL
104
0
0
09 Oct 2025
GRADE: Personalized Multi-Task Fusion via Group-relative Reinforcement Learning with Adaptive Dirichlet Exploration
GRADE: Personalized Multi-Task Fusion via Group-relative Reinforcement Learning with Adaptive Dirichlet Exploration
Tingfeng Hong
Pingye Ren
Xinlong Xiao
C. Wang
Chenyi Lei
Wenwu Ou
Han Li
167
0
0
09 Oct 2025
Maximum In-Support Return Modeling for Dynamic Recommendation with Language Model Prior
Maximum In-Support Return Modeling for Dynamic Recommendation with Language Model Prior
Xiaocong Chen
Siyu Wang
Lina Yao
OffRLAI4TS
88
0
0
09 Oct 2025
Local Reinforcement Learning with Action-Conditioned Root Mean Squared Q-Functions
Local Reinforcement Learning with Action-Conditioned Root Mean Squared Q-Functions
Frank Wu
Mengye Ren
155
0
0
08 Oct 2025
What You Don't Know Can Hurt You: How Well do Latent Safety Filters Understand Partially Observable Safety Constraints?
What You Don't Know Can Hurt You: How Well do Latent Safety Filters Understand Partially Observable Safety Constraints?
Matthew Kim
Kensuke Nakamura
Andrea V. Bajcsy
103
0
0
07 Oct 2025
Controllable Audio-Visual Viewpoint Generation from 360° Spatial Information
Controllable Audio-Visual Viewpoint Generation from 360° Spatial Information
Christian Marinoni
R. F. Gramaccioni
Eleonora Grassucci
Danilo Comminiello
VGen
148
0
0
07 Oct 2025
DREAMer-VXS: A Latent World Model for Sample-Efficient AGV Exploration in Stochastic, Unobserved Environments
DREAMer-VXS: A Latent World Model for Sample-Efficient AGV Exploration in Stochastic, Unobserved Environments
Agniprabha Chakraborty
55
0
0
06 Oct 2025
HOFLON: Hybrid Offline Learning and Online Optimization for Process Start-Up and Grade-Transition Control
HOFLON: Hybrid Offline Learning and Online Optimization for Process Start-Up and Grade-Transition Control
Alex Durkin
Jasper Stolte
Mehmet Mercangöz
OffRLOnRL
262
0
0
04 Oct 2025
Unsupervised Transformer Pre-Training for Images: Self-Distillation, Mean Teachers, and Random Crops
Unsupervised Transformer Pre-Training for Images: Self-Distillation, Mean Teachers, and Random Crops
Mattia Scardecchia
ViT
166
0
0
04 Oct 2025
Physics-informed Neural-operator Predictive Control for Drag Reduction in Turbulent Flows
Physics-informed Neural-operator Predictive Control for Drag Reduction in Turbulent Flows
Zelin Zhao
Zongyi Li
Kimia Hassibi
Kamyar Azizzadenesheli
Junchi Yan
H. J. Bae
Di Zhou
Anima Anandkumar
AI4CE
114
0
0
03 Oct 2025
D2 Actor Critic: Diffusion Actor Meets Distributional Critic
D2 Actor Critic: Diffusion Actor Meets Distributional Critic
Lunjun Zhang
Shuo Han
Hanrui Lyu
Bradly C. Stadie
OffRL
263
1
0
03 Oct 2025
ExGRPO: Learning to Reason from Experience
ExGRPO: Learning to Reason from Experience
Runzhe Zhan
Yafu Li
Zhi Wang
Xiaoye Qu
Dongrui Liu
Jing Shao
Derek F. Wong
Yu Cheng
OffRLLRM
145
1
1
02 Oct 2025
From Pixels to Factors: Learning Independently Controllable State Variables for Reinforcement Learning
From Pixels to Factors: Learning Independently Controllable State Variables for Reinforcement Learning
Rafael Rodríguez-Sánchez
Cameron Allen
George Konidaris
OffRL
184
2
0
02 Oct 2025
Improved Robustness of Deep Reinforcement Learning for Control of Time-Varying Systems by Bounded Extremum Seeking
Improved Robustness of Deep Reinforcement Learning for Control of Time-Varying Systems by Bounded Extremum Seeking
Shaifalee Saxena
Alan Williams
Rafael Fierro
A. Scheinker
OOD
85
0
0
02 Oct 2025
Conflict-Based Search as a Protocol: A Multi-Agent Motion Planning Protocol for Heterogeneous Agents, Solvers, and Independent Tasks
Conflict-Based Search as a Protocol: A Multi-Agent Motion Planning Protocol for Heterogeneous Agents, Solvers, and Independent Tasks
Rishi Veerapaneni
Alvin Tang
Haodong He
Sophia Zhao
Viraj Shah
...
Gabriel Olin
Jon Arrizabalaga
Yorai Shaoul
Jiaoyang Li
Maxim Likhachev
92
1
0
01 Oct 2025
Multi-Actor Multi-Critic Deep Deterministic Reinforcement Learning with a Novel Q-Ensemble Method
Multi-Actor Multi-Critic Deep Deterministic Reinforcement Learning with a Novel Q-Ensemble Method
Andy Wu
Chun-Cheng Lin
Rung-Tzuo Liaw
Yuehua Huang
Chihjung Kuo
Chia Tong Weng
92
0
0
01 Oct 2025
Constant in an Ever-Changing World
Constant in an Ever-Changing World
Andy Wu
Chun-Cheng Lin
Yuehua Huang
Rung-Tzuo Liaw
CLL
60
0
0
01 Oct 2025
Deep Reinforcement Learning-Based Precoding for Multi-RIS-Aided Multiuser Downlink Systems with Practical Phase Shift
Deep Reinforcement Learning-Based Precoding for Multi-RIS-Aided Multiuser Downlink Systems with Practical Phase ShiftIEEE Wireless Communications Letters (WCL), 2025
Po-Heng Chou
Bo-Ren Zheng
Wan-Jen Huang
Walid Saad
Yu Tsao
Ronald Y. Chang
76
8
0
30 Sep 2025
Accelerating Transformers in Online RL
Accelerating Transformers in Online RL
Daniil Zelezetsky
A. Kovalev
Aleksandr I. Panov
OffRL
143
0
0
30 Sep 2025
Diversity-Incentivized Exploration for Versatile Reasoning
Diversity-Incentivized Exploration for Versatile Reasoning
Zican Hu
Shilin Zhang
Yafu Li
Jianhao Yan
Xuyang Hu
Leyang Cui
Xiaoye Qu
C. L. Philip Chen
Yu Cheng
Zhi Wang
LRM
146
2
0
30 Sep 2025
DyMoDreamer: World Modeling with Dynamic Modulation
DyMoDreamer: World Modeling with Dynamic Modulation
Boxuan Zhang
Runqing Wang
Wei Xiao
Weipu Zhang
Jian Sun
Gao Huang
Jie Chen
Gang Wang
144
0
0
29 Sep 2025
Polychromic Objectives for Reinforcement Learning
Polychromic Objectives for Reinforcement Learning
Jubayer Ibn Hamid
Ifdita Hasan Orney
Ellen Xu
Chelsea Finn
Dorsa Sadigh
OffRL
104
1
0
29 Sep 2025
Robust Policy Expansion for Offline-to-Online RL under Diverse Data Corruption
Robust Policy Expansion for Offline-to-Online RL under Diverse Data Corruption
Longxiang He
Deheng Ye
Junbo Tan
Xueqian Wang
Li Shen
OnRL
310
0
0
29 Sep 2025
Safe In-Context Reinforcement Learning
Safe In-Context Reinforcement Learning
Amir Moeini
Minjae Kwon
Alper Kamil Bozkurt
Yuichi Motai
Rohan Chandra
Lu Feng
Shangtong Zhang
OffRL
135
1
0
29 Sep 2025
An Investigation of Batch Normalization in Off-Policy Actor-Critic Algorithms
An Investigation of Batch Normalization in Off-Policy Actor-Critic Algorithms
Li Wang
Sudun
X. Zhang
Wenjun Wu
Lei Huang
OffRL
153
0
0
28 Sep 2025
Bridging Discrete and Continuous RL: Stable Deterministic Policy Gradient with Martingale Characterization
Bridging Discrete and Continuous RL: Stable Deterministic Policy Gradient with Martingale Characterization
Ziheng Cheng
Xin Guo
Yufei Zhang
OffRL
97
0
0
28 Sep 2025
Continuous-Time Reinforcement Learning for Asset-Liability Management
Continuous-Time Reinforcement Learning for Asset-Liability Management
Yilie Huang
76
0
0
27 Sep 2025
From Parameters to Behavior: Unsupervised Compression of the Policy Space
From Parameters to Behavior: Unsupervised Compression of the Policy Space
Davide Tenedini
Riccardo Zamboni
Mirco Mutti
Marcello Restelli
136
1
0
26 Sep 2025
Functional Critics Are Essential in Off-Policy Actor-Critic: Provable Convergence and Efficient Exploration
Functional Critics Are Essential in Off-Policy Actor-Critic: Provable Convergence and Efficient Exploration
Qinxun Bai
Yuxuan Han
Wei Xu
Zhengyuan Zhou
OffRL
158
0
0
26 Sep 2025
The Use of the Simplex Architecture to Enhance Safety in Deep-Learning-Powered Autonomous Systems
The Use of the Simplex Architecture to Enhance Safety in Deep-Learning-Powered Autonomous Systems
F. Nesti
Niko Salamini
Mauro Marinoni
Giorgio Maria Cicero
Gabriele Serra
Alessandro Biondi
Giorgio Buttazzo
173
0
0
25 Sep 2025
Leveraging Temporally Extended Behavior Sharing for Multi-task Reinforcement Learning
Leveraging Temporally Extended Behavior Sharing for Multi-task Reinforcement Learning
Gawon Lee
Daesol Cho
H. J. Kim
200
0
0
25 Sep 2025
Analysis of approximate linear programming solution to Markov decision problem with log barrier function
Analysis of approximate linear programming solution to Markov decision problem with log barrier function
Donghwan Lee
Hyukjun Yang
Bum Geun Park
142
0
0
24 Sep 2025
Frictional Q-Learning
Frictional Q-Learning
Hyunwoo Kim
Hyo Kyung Lee
OffRL
153
0
0
24 Sep 2025
Memory-Augmented Potential Field Theory: A Framework for Adaptive Control in Non-Convex Domains
Memory-Augmented Potential Field Theory: A Framework for Adaptive Control in Non-Convex Domains
Dongzhe Zheng
Wenjie Mei
109
0
0
24 Sep 2025
AnySafe: Adapting Latent Safety Filters at Runtime via Safety Constraint Parameterization in the Latent Space
AnySafe: Adapting Latent Safety Filters at Runtime via Safety Constraint Parameterization in the Latent Space
Sankalp Agrawal
Junwon Seo
Kensuke Nakamura
Ran Tian
Andrea V. Bajcsy
116
1
0
23 Sep 2025
Residual Off-Policy RL for Finetuning Behavior Cloning Policies
Residual Off-Policy RL for Finetuning Behavior Cloning Policies
Lars Ankile
Zhenyu Jiang
Rocky Duan
Guanya Shi
Pieter Abbeel
Anusha Nagabandi
OffRL
221
4
0
23 Sep 2025
SOE: Sample-Efficient Robot Policy Self-Improvement via On-Manifold Exploration
SOE: Sample-Efficient Robot Policy Self-Improvement via On-Manifold Exploration
Yang Jin
Jun Lv
Han Xue
Wendi Chen
Chuan Wen
Cewu Lu
177
0
0
23 Sep 2025
EigenSafe: A Spectral Framework for Learning-Based Stochastic Safety Filtering
EigenSafe: A Spectral Framework for Learning-Based Stochastic Safety Filtering
Inkyu Jang
Jonghae Park
Chams E. Mballo
Sihyun Cho
Claire J. Tomlin
H. J. Kim
111
0
0
22 Sep 2025
Fast Trajectory Planner with a Reinforcement Learning-based Controller for Robotic Manipulators
Fast Trajectory Planner with a Reinforcement Learning-based Controller for Robotic ManipulatorsEngineering applications of artificial intelligence (EAAI), 2025
Yongliang Wang
Hamidreza Kasaei
112
0
0
22 Sep 2025
MCP: A Control-Theoretic Orchestration Framework for Synergistic Efficiency and Interpretability in Multimodal Large Language Models
MCP: A Control-Theoretic Orchestration Framework for Synergistic Efficiency and Interpretability in Multimodal Large Language Models
Luyan Zhang
84
0
0
20 Sep 2025
HypeMARL: Multi-Agent Reinforcement Learning For High-Dimensional, Parametric, and Distributed Systems
HypeMARL: Multi-Agent Reinforcement Learning For High-Dimensional, Parametric, and Distributed Systems
N. Botteghi
Matteo Tomasetto
Urban Fasel
Francesco Braghin
Andrea Manzoni
136
0
0
20 Sep 2025
GP3: A 3D Geometry-Aware Policy with Multi-View Images for Robotic Manipulation
GP3: A 3D Geometry-Aware Policy with Multi-View Images for Robotic Manipulation
Quanhao Qian
Guoyang Zhao
Gongjie Zhang
Jiuniu Wang
Ran Xu
Junlong Gao
Deli Zhao
132
3
0
19 Sep 2025
Accelerating Atomic Fine Structure Determination with Graph Reinforcement Learning
Accelerating Atomic Fine Structure Determination with Graph Reinforcement Learning
M. Ding
V.-A. Darvariu
A. N. Ryabtsev
N. Hawes
J. C. Pickering
98
0
0
19 Sep 2025
Deep Learning Empowered Super-Resolution: A Comprehensive Survey and Future Prospects
Deep Learning Empowered Super-Resolution: A Comprehensive Survey and Future ProspectsProceedings of the IEEE (Proc. IEEE), 2025
Le Zhang
Ao Li
Qibin Hou
Ce Zhu
Yonina C. Eldar
SupR
285
1
0
19 Sep 2025
Designing Latent Safety Filters using Pre-Trained Vision Models
Designing Latent Safety Filters using Pre-Trained Vision Models
Ihab Tabbara
Yuxuan Yang
Ahmad Hamzeh
Maxwell Astafyev
Hussein Sibai
81
1
0
18 Sep 2025
Safe Reinforcement Learning using Action Projection: Safeguard the Policy or the Environment?
Safe Reinforcement Learning using Action Projection: Safeguard the Policy or the Environment?
Hannah Markgraf
Shamburaj Sawant
Hanna Krasowski
Lukas Schäfer
S. Gros
Matthias Althoff
OffRL
156
0
0
16 Sep 2025
CORB-Planner: Corridor as Observations for RL Planning in High-Speed Flight
CORB-Planner: Corridor as Observations for RL Planning in High-Speed Flight
Y. Zhang
Bin Gao
G. Wang
Jian Sun
Zhuo Li
90
0
0
14 Sep 2025
Previous
123456...949596
Next