ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.02971
  4. Cited By
Continuous control with deep reinforcement learning

Continuous control with deep reinforcement learning

9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
ArXivPDFHTML

Papers citing "Continuous control with deep reinforcement learning"

50 / 3,188 papers shown
Title
Embedded Mean Field Reinforcement Learning for Perimeter-defense Game
Embedded Mean Field Reinforcement Learning for Perimeter-defense Game
Li Wang
Xin Yu
Xuxin Lv
Gangzheng Ai
Wenjun Wu
AAML
7
0
0
20 May 2025
Multi-parameter Control for the (1+($λ$,$λ$))-GA on OneMax via Deep Reinforcement Learning
Multi-parameter Control for the (1+(λλλ,λλλ))-GA on OneMax via Deep Reinforcement Learning
Tai Nguyen
Phong Le
Carola Doerr
Nguyen Dang
24
0
0
19 May 2025
A Dataless Reinforcement Learning Approach to Rounding Hyperplane Optimization for Max-Cut
A Dataless Reinforcement Learning Approach to Rounding Hyperplane Optimization for Max-Cut
Gabriel Malikal
Ismail R. Alkhouri
Alvaro Velasquez
Adam M Alessio
S. Ravishankar
2
0
0
19 May 2025
DisCO: Reinforcing Large Reasoning Models with Discriminative Constrained Optimization
DisCO: Reinforcing Large Reasoning Models with Discriminative Constrained Optimization
Gang Li
Ming Lin
Tomer Galanti
Zhengzhong Tu
Tianbao Yang
14
0
0
18 May 2025
Multi-CALF: A Policy Combination Approach with Statistical Guarantees
Multi-CALF: A Policy Combination Approach with Statistical Guarantees
Georgiy Malaniya
Anton Bolychev
Grigory Yaremenko
Anastasia Krasnaya
Pavel Osinenko
12
0
0
18 May 2025
SAINT: Attention-Based Modeling of Sub-Action Dependencies in Multi-Action Policies
SAINT: Attention-Based Modeling of Sub-Action Dependencies in Multi-Action Policies
Matthew Landers
Taylor W. Killian
Thomas Hartvigsen
Afsaneh Doryab
12
0
0
17 May 2025
ReaCritic: Large Reasoning Transformer-based DRL Critic-model Scaling For Heterogeneous Networks
ReaCritic: Large Reasoning Transformer-based DRL Critic-model Scaling For Heterogeneous Networks
Feiran You
Hongyang Du
OffRL
LRM
22
0
0
16 May 2025
GLOVA: Global and Local Variation-Aware Analog Circuit Design with Risk-Sensitive Reinforcement Learning
GLOVA: Global and Local Variation-Aware Analog Circuit Design with Risk-Sensitive Reinforcement Learning
Dongjun Kim
Junwoo Park
Chaehyeon Shin
Jaeheon Jung
Kyungho Shin
...
Sanghyuk Heo
Woongrae Kim
Inchul Jeong
Joohwan Cho
Jongsun Park
17
0
0
16 May 2025
Bi-Level Policy Optimization with Nyström Hypergradients
Bi-Level Policy Optimization with Nyström Hypergradients
Arjun Prakash
Naicheng He
Denizalp Goktas
Amy Greenwald
4
0
0
16 May 2025
Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions
Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions
Kehan Long
Jorge Cortés
Nikolay Atanasov
12
0
0
16 May 2025
Zero-Shot Visual Generalization in Robot Manipulation
Zero-Shot Visual Generalization in Robot Manipulation
Sumeet Batra
Gaurav Sukhatme
14
0
0
16 May 2025
Reasoning with OmniThought: A Large CoT Dataset with Verbosity and Cognitive Difficulty Annotations
Reasoning with OmniThought: A Large CoT Dataset with Verbosity and Cognitive Difficulty Annotations
Wenrui Cai
Chengyu Wang
Junbing Yan
Jun Huang
Xiangzhong Fang
LRM
19
0
0
16 May 2025
Knowledge capture, adaptation and composition (KCAC): A framework for cross-task curriculum learning in robotic manipulation
Knowledge capture, adaptation and composition (KCAC): A framework for cross-task curriculum learning in robotic manipulation
Xinrui Wang
Yan Jin
29
0
0
15 May 2025
Modular Robot Control with Motor Primitives
Modular Robot Control with Motor Primitives
Moses C. Nah
Johannes Lachner
Neville Hogan
21
0
0
15 May 2025
Accelerating Visual-Policy Learning through Parallel Differentiable Simulation
Accelerating Visual-Policy Learning through Parallel Differentiable Simulation
Haoxiang You
Yilang Liu
Ian Abraham
12
0
0
15 May 2025
Deep reinforcement learning-based longitudinal control strategy for automated vehicles at signalised intersections
Deep reinforcement learning-based longitudinal control strategy for automated vehicles at signalised intersections
Pankaj Kumar
Aditya Mishra
Pranamesh Chakraborty
Subrahmanya Swamy Peruru
24
0
0
13 May 2025
Adaptive Security Policy Management in Cloud Environments Using Reinforcement Learning
Adaptive Security Policy Management in Cloud Environments Using Reinforcement Learning
Muhammad Saqib
Dipkumar Mehta
Fnu Yashu
Shubham Malhotra
24
0
0
13 May 2025
MA-ROESL: Motion-aware Rapid Reward Optimization for Efficient Robot Skill Learning from Single Videos
MA-ROESL: Motion-aware Rapid Reward Optimization for Efficient Robot Skill Learning from Single Videos
Xueliang Wang
Xinming Zhang
Yanjun Chen
Xiaoyu Shen
Wei Zhang
29
0
0
13 May 2025
Deep Reinforcement Learning for Power Grid Multi-Stage Cascading Failure Mitigation
Deep Reinforcement Learning for Power Grid Multi-Stage Cascading Failure Mitigation
Bo Meng
Chenghao Xu
Yongli Zhu
AI4CE
14
0
0
13 May 2025
Multi-agent Embodied AI: Advances and Future Directions
Multi-agent Embodied AI: Advances and Future Directions
Zhaohan Feng
Ruiqi Xue
Lei Yuan
Yang Yu
Ning Ding
M. Liu
Bingzhao Gao
Jian Sun
Gang Wang
AI4CE
60
1
0
08 May 2025
Graph Neural Network Aided Deep Reinforcement Learning for Resource Allocation in Dynamic Terahertz UAV Networks
Graph Neural Network Aided Deep Reinforcement Learning for Resource Allocation in Dynamic Terahertz UAV Networks
Zhifeng Hu
Chong Han
46
0
0
08 May 2025
Optimization of Infectious Disease Intervention Measures Based on Reinforcement Learning - Empirical analysis based on UK COVID-19 epidemic data
Optimization of Infectious Disease Intervention Measures Based on Reinforcement Learning - Empirical analysis based on UK COVID-19 epidemic data
Baida Zhang
Yakai Chen
Huichun Li
Zhenghu Zu
34
0
0
07 May 2025
Adaptive and Robust DBSCAN with Multi-agent Reinforcement Learning
Adaptive and Robust DBSCAN with Multi-agent Reinforcement Learning
Hao Peng
Xiang Huang
Shuo Sun
Ruitong Zhang
Philip S. Yu
48
0
0
07 May 2025
Merging and Disentangling Views in Visual Reinforcement Learning for Robotic Manipulation
Merging and Disentangling Views in Visual Reinforcement Learning for Robotic Manipulation
Abdulaziz Almuzairee
Rohan Patil
Dwait Bhatt
Henrik I. Christensen
34
0
0
07 May 2025
Joint Resource Management for Energy-efficient UAV-assisted SWIPT-MEC: A Deep Reinforcement Learning Approach
Joint Resource Management for Energy-efficient UAV-assisted SWIPT-MEC: A Deep Reinforcement Learning Approach
Yue Chen
Hui Kang
Jiahui Li
Geng Sun
Boxiong Wang
Jiacheng Wang
Cong Liang
Shuang Liang
Dusit Niyato
49
0
0
06 May 2025
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning
Caleb Chuck
Fan Feng
Carl Qi
Chang Shi
Siddhant Agarwal
Amy Zhang
S. Niekum
47
0
0
06 May 2025
VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making
VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making
Jake Grigsby
Yuke Zhu
Michael S Ryoo
Juan Carlos Niebles
OffRL
VLM
41
0
0
06 May 2025
Automated Hybrid Reward Scheduling via Large Language Models for Robotic Skill Learning
Automated Hybrid Reward Scheduling via Large Language Models for Robotic Skill Learning
Changxin Huang
Junyang Liang
Yanbin Chang
Jingzhao Xu
Jianqiang Li
34
0
0
05 May 2025
A Wireless Collaborated Inference Acceleration Framework for Plant Disease Recognition
A Wireless Collaborated Inference Acceleration Framework for Plant Disease Recognition
Hele Zhu
Xinyi Huang
Haojia Gao
Mengfei Jiang
Haohua Que
Lei Mu
32
0
0
05 May 2025
Universal Approximation Theorem of Deep Q-Networks
Universal Approximation Theorem of Deep Q-Networks
Qian Qi
45
1
0
04 May 2025
Resolving Conflicting Constraints in Multi-Agent Reinforcement Learning with Layered Safety
Resolving Conflicting Constraints in Multi-Agent Reinforcement Learning with Layered Safety
Jason J. Choi
Jasmine Jerry Aloor
Jingqi Li
Maria G. Mendoza
H. Balakrishnan
Claire J. Tomlin
31
0
0
04 May 2025
Wasserstein Policy Optimization
Wasserstein Policy Optimization
David Pfau
Ian Davies
Diana Borsa
Joao G. M. Araujo
Brendan D. Tracey
H. V. Hasselt
29
0
0
01 May 2025
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Yuhan Li
Eugene Han
Yifan Hu
Wenzhuo Zhou
Zhengling Qi
Yifan Cui
Ruoqing Zhu
OffRL
204
0
0
01 May 2025
A General Approach of Automated Environment Design for Learning the Optimal Power Flow
A General Approach of Automated Environment Design for Learning the Optimal Power Flow
Thomas Wolgast
Astrid Nieße
AI4CE
26
0
0
01 May 2025
SimPRIVE: a Simulation framework for Physical Robot Interaction with Virtual Environments
SimPRIVE: a Simulation framework for Physical Robot Interaction with Virtual Environments
F. Nesti
G. D’Amico
Mauro Marinoni
Giorgio Buttazzo
35
0
0
30 Apr 2025
Investigating Adaptive Tuning of Assistive Exoskeletons Using Offline Reinforcement Learning: Challenges and Insights
Investigating Adaptive Tuning of Assistive Exoskeletons Using Offline Reinforcement Learning: Challenges and Insights
Yasin Findik
Christopher Coco
Reza Azadeh
OffRL
26
0
0
30 Apr 2025
Model-based controller assisted domain randomization in deep reinforcement learning: application to nonlinear powertrain control
Model-based controller assisted domain randomization in deep reinforcement learning: application to nonlinear powertrain control
Heisei Yonezawa
Ansei Yonezawa
Itsuro Kajiwara
49
0
0
28 Apr 2025
Data-Assimilated Model-Based Reinforcement Learning for Partially Observed Chaotic Flows
Data-Assimilated Model-Based Reinforcement Learning for Partially Observed Chaotic Flows
D. E. Ozan
Andrea Nóvoa
Luca Magri
AI4CE
19
0
0
23 Apr 2025
Policy-Based Radiative Transfer: Solving the $2$-Level Atom Non-LTE Problem using Soft Actor-Critic Reinforcement Learning
Policy-Based Radiative Transfer: Solving the 222-Level Atom Non-LTE Problem using Soft Actor-Critic Reinforcement Learning
Brandon Panos
Ivan Milic
OffRL
23
0
0
22 Apr 2025
Grasping Deformable Objects via Reinforcement Learning with Cross-Modal Attention to Visuo-Tactile Inputs
Grasping Deformable Objects via Reinforcement Learning with Cross-Modal Attention to Visuo-Tactile Inputs
Yonghyun Lee
Sungeun Hong
Min-gu Kim
Gyeonghwan Kim
Changjoo Nam
26
0
0
22 Apr 2025
CaRoSaC: A Reinforcement Learning-Based Kinematic Control of Cable-Driven Parallel Robots by Addressing Cable Sag through Simulation
CaRoSaC: A Reinforcement Learning-Based Kinematic Control of Cable-Driven Parallel Robots by Addressing Cable Sag through Simulation
Rohit Dhakate
Thomas Jantos
Eren Allak
Stephan Weiss
J. Steinbrener
44
0
0
22 Apr 2025
Autonomous Control of Redundant Hydraulic Manipulator Using Reinforcement Learning with Action Feedback
Autonomous Control of Redundant Hydraulic Manipulator Using Reinforcement Learning with Action Feedback
Rohit Dhakate
Christian Brommer
C. Böhm
Stephan Weiss
J. Steinbrener
36
5
0
22 Apr 2025
Learning to Reason under Off-Policy Guidance
Learning to Reason under Off-Policy Guidance
Jianhao Yan
Yafu Li
Zican Hu
Zhi Wang
Ganqu Cui
Xiaoye Qu
Yu Cheng
Yue Zhang
OffRL
LRM
44
1
0
21 Apr 2025
Is Intelligence the Right Direction in New OS Scheduling for Multiple Resources in Cloud Environments?
Is Intelligence the Right Direction in New OS Scheduling for Multiple Resources in Cloud Environments?
Xinglei Dou
Lei Liu
Limin Xiao
VLM
43
0
0
21 Apr 2025
Accelerating Visual Reinforcement Learning with Separate Primitive Policy for Peg-in-Hole Tasks
Accelerating Visual Reinforcement Learning with Separate Primitive Policy for Peg-in-Hole Tasks
Zichun Xu
Zhaomin Wang
Yuntao Li
Lei Zhuang
Zhiyuan Zhao
Guocai Yang
Jingdong Zhao
31
0
0
21 Apr 2025
Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey
Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey
Ahsan Bilal
Muhammad Ahmed Mohsin
Muhammad Umer
Muhammad Awais Khan Bangash
Muhammad Ali Jamshed
LLMAG
LRM
AI4CE
58
0
0
20 Apr 2025
HF4Rec: Human-Like Feedback-Driven Optimization Framework for Explainable Recommendation
HF4Rec: Human-Like Feedback-Driven Optimization Framework for Explainable Recommendation
Jiakai Tang
Jingsen Zhang
Zihang Tian
Xueyang Feng
Lei Wang
Xu Chen
OffRL
204
0
0
19 Apr 2025
QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?
QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?
Zhouyang Jiang
Bin Zhang
Airong Wei
Zhiwei Xu
OffRL
37
0
0
17 Apr 2025
High-Fidelity Image Inpainting with Multimodal Guided GAN Inversion
High-Fidelity Image Inpainting with Multimodal Guided GAN Inversion
Libo Zhang
Yongsheng Yu
Jiali Yao
Heng Fan
48
0
0
17 Apr 2025
Modelling Mean-Field Games with Neural Ordinary Differential Equations
Modelling Mean-Field Games with Neural Ordinary Differential Equations
Anna C. M. Thöni
Yoram Bachrach
Tal Kachman
38
0
0
17 Apr 2025
1234...626364
Next