ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.02971
  4. Cited By
Continuous control with deep reinforcement learning

Continuous control with deep reinforcement learning

9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
ArXivPDFHTML

Papers citing "Continuous control with deep reinforcement learning"

50 / 3,214 papers shown
Title
Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control
Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control
T. Lee
Donghwan Lee
35
0
0
28 Feb 2025
On the Importance of Reward Design in Reinforcement Learning-based Dynamic Algorithm Configuration: A Case Study on OneMax with (1+($λ$,$λ$))-GA
On the Importance of Reward Design in Reinforcement Learning-based Dynamic Algorithm Configuration: A Case Study on OneMax with (1+(λλλ,λλλ))-GA
Tai Nguyen
Phong Le
André Biendenkapp
Carola Doerr
Nguyen Dang
42
0
0
27 Feb 2025
Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application
Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application
Thomas Hickling
Maxwell Hogan
Abdulla Tammam
Nabil Aouf
76
0
0
27 Feb 2025
Accelerating Model-Based Reinforcement Learning with State-Space World Models
Accelerating Model-Based Reinforcement Learning with State-Space World Models
Maria Krinner
Elie Aljalbout
Angel Romero
Davide Scaramuzza
OffRL
76
1
0
27 Feb 2025
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Highly Parallelized Reinforcement Learning Training with Relaxed Assignment Dependencies
Zhouyu He
Peng Qiao
Rongchun Li
Yong Dou
Yusong Tan
OffRL
59
0
0
27 Feb 2025
XSS Adversarial Attacks Based on Deep Reinforcement Learning: A Replication and Extension Study
XSS Adversarial Attacks Based on Deep Reinforcement Learning: A Replication and Extension Study
Samuele Pasini
Gianluca Maragliano
Jinhan Kim
Paolo Tonella
AAML
40
0
0
26 Feb 2025
FinTSB: A Comprehensive and Practical Benchmark for Financial Time Series Forecasting
FinTSB: A Comprehensive and Practical Benchmark for Financial Time Series Forecasting
Yifan Hu
Yuante Li
Peiyuan Liu
Yuxia Zhu
Naiqi Li
Tao Dai
Shu-Tao Xia
Dawei Cheng
Changjun Jiang
AI4TS
82
1
0
26 Feb 2025
Learning Policy Committees for Effective Personalization in MDPs with Diverse Tasks
Luise Ge
Michael Lanier
Anindya Sarkar
Bengisu Guresti
Yevgeniy Vorobeychik
Chongjie Zhang
47
0
0
26 Feb 2025
Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement Learning
Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement Learning
Meng Feng
Viraj Parimi
B. Williams
77
1
0
25 Feb 2025
Policy Learning with a Natural Language Action Space: A Causal Approach
Policy Learning with a Natural Language Action Space: A Causal Approach
Bohan Zhang
Yixin Wang
Paramveer S. Dhillon
CML
46
0
0
24 Feb 2025
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee
Youngdo Lee
Takuma Seno
Donghu Kim
Peter Stone
Jaegul Choo
70
1
0
24 Feb 2025
SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning
SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning
Xuyang Li
Romit Maulik
48
0
0
24 Feb 2025
A Reinforcement Learning Approach to Non-prehensile Manipulation through Sliding
Hamidreza Raei
Elena De Momi
Arash Ajoudani
41
0
0
24 Feb 2025
A Simulation Pipeline to Facilitate Real-World Robotic Reinforcement Learning Applications
A Simulation Pipeline to Facilitate Real-World Robotic Reinforcement Learning Applications
Jefferson Silveira
Joshua A. Marshall
Sidney N. Givigi Jr
64
0
0
24 Feb 2025
Reinforcement Learning-based Approach for Vehicle-to-Building Charging with Heterogeneous Agents and Long Term Rewards
Reinforcement Learning-based Approach for Vehicle-to-Building Charging with Heterogeneous Agents and Long Term Rewards
Fangqi Liu
Rishav Sen
J. P. Talusan
Ava Pettet
Aaron Kandel
Yoshinori Suzue
Ayan Mukhopadhyay
A. Dubey
OffRL
44
0
0
24 Feb 2025
Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual Recognition
Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual Recognition
Chuanguang Yang
Xinqiang Yu
Han Yang
Zhulin An
Chengqing Yu
Libo Huang
Yongjun Xu
36
1
0
22 Feb 2025
Exploring Sentiment Manipulation by LLM-Enabled Intelligent Trading Agents
Exploring Sentiment Manipulation by LLM-Enabled Intelligent Trading Agents
David Byrd
LLMAG
LM&Ro
AIFin
56
0
0
22 Feb 2025
Reinforcement Learning-based Receding Horizon Control using Adaptive Control Barrier Functions for Safety-Critical Systems
Reinforcement Learning-based Receding Horizon Control using Adaptive Control Barrier Functions for Safety-Critical Systems
Ehsan Sabouni
Hijaz Ahmad
Vittorio Giammarino
Christos G. Cassandras
I. Paschalidis
Wenchao Li
119
2
0
21 Feb 2025
Enhancing PPO with Trajectory-Aware Hybrid Policies
Qisai Liu
Zhanhong Jiang
Hsin-Jung Yang
Mahsa Khosravi
Joshua R. Waite
S. Sarkar
49
0
0
21 Feb 2025
Estimating Control Barriers from Offline Data
Hongzhan Yu
Seth Farrell
Ryo Yoshimitsu
Zhizhen Qin
Henrik I. Christensen
Sicun Gao
OffRL
58
3
0
21 Feb 2025
PPO-MI: Efficient Black-Box Model Inversion via Proximal Policy Optimization
PPO-MI: Efficient Black-Box Model Inversion via Proximal Policy Optimization
Xinpeng Shou
81
0
0
21 Feb 2025
ArrayBot: Reinforcement Learning for Generalizable Distributed Manipulation through Touch
ArrayBot: Reinforcement Learning for Generalizable Distributed Manipulation through Touch
Zhengrong Xue
H. Zhang
Jin Cheng
Zhengmao He
Yuanchen Ju
Chan-Yu Lin
Gu Zhang
Huazhe Xu
OffRL
101
9
0
20 Feb 2025
RobotIQ: Empowering Mobile Robots with Human-Level Planning for Real-World Execution
RobotIQ: Empowering Mobile Robots with Human-Level Planning for Real-World Execution
Emmanuel K. Raptis
Athanasios Ch. Kapoutsis
Elias B. Kosmatopoulos
LM&Ro
82
0
0
18 Feb 2025
Data Center Cooling System Optimization Using Offline Reinforcement Learning
Data Center Cooling System Optimization Using Offline Reinforcement Learning
Xianyuan Zhan
Xiangyu Zhu
Peng Cheng
Xiao Hu
Ziteng He
...
Chenhui Liu
Tianshun Hong
Huiwen Zheng
Yunxin Liu
Feng Zhao
AI4CE
62
0
0
17 Feb 2025
Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Michael Psenka
Alejandro Escontrela
Pieter Abbeel
Yi Ma
DiffM
93
24
0
17 Feb 2025
Deep Reinforcement Learning-Based Bidding Strategies for Prosumers Trading in Double Auction-Based Transactive Energy Market
Jun Jiang
Yuanliang Li
Luyang Hou
Mohsen Ghafouri
Peng Zhang
Jun Yan
Yuhong Liu
46
0
0
16 Feb 2025
Task Offloading in Vehicular Edge Computing using Deep Reinforcement Learning: A Survey
Task Offloading in Vehicular Edge Computing using Deep Reinforcement Learning: A Survey
Ashab Uddin
Ahmed Hamdi Sakr
Ning Zhang
OffRL
62
0
0
10 Feb 2025
Infinite-Horizon Value Function Approximation for Model Predictive Control
Armand Jordana
Sébastien Kleff
Arthur Haffemayer
Joaquim Ortiz de Haro
Justin Carpentier
Nicolas Mansard
Ludovic Righetti
41
0
0
10 Feb 2025
Deep Reinforcement Learning based Triggering Function for Early Classifiers of Time Series
Aurélien Renault
A. Bondu
Antoine Cornuéjols
Vincent Lemaire
49
0
0
10 Feb 2025
Leveraging Constraint Violation Signals For Action-Constrained Reinforcement Learning
Leveraging Constraint Violation Signals For Action-Constrained Reinforcement Learning
J. Brahmanage
Jiajing Ling
Akshat Kumar
42
0
0
08 Feb 2025
Policy-Guided Causal State Representation for Offline Reinforcement Learning Recommendation
Policy-Guided Causal State Representation for Offline Reinforcement Learning Recommendation
Siyu Wang
Xiaocong Chen
Lina Yao
CML
OffRL
93
0
0
04 Feb 2025
Circular Microalgae-Based Carbon Control for Net Zero
Circular Microalgae-Based Carbon Control for Net Zero
Federico Zocco
Joan García
W. Haddad
124
0
0
04 Feb 2025
VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play
VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play
Zelai Xu
Chao Yu
Chao Yu
Huining Yuan
Xiangmin Yi
...
Wenhao Tang
Yu Wang
Wenbo Ding
Xiusi Chen
Yu Wang
148
0
0
04 Feb 2025
Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification
Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification
Rudolf Reiter
Jasper Hoffmann
D. Reinhardt
Florian Messerer
Katrin Baumgärtner
Shamburaj Sawant
Joschka Boedecker
Moritz Diehl
S. Gros
89
5
0
04 Feb 2025
Search-Based Adversarial Estimates for Improving Sample Efficiency in Off-Policy Reinforcement Learning
Search-Based Adversarial Estimates for Improving Sample Efficiency in Off-Policy Reinforcement Learning
Federico Malato
Ville Hautamaki
42
0
0
03 Feb 2025
HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment
HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment
Lifan Jiang
Boxi Wu
Jiahui Zhang
Xiaotong Guan
Shuang Chen
VGen
71
1
0
02 Feb 2025
Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network
Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network
Jijia Liu
Feng Gao
Q. Liao
Chao Yu
Yu Wang
OffRL
76
0
0
01 Feb 2025
On-Line Learning for Planning and Control of Underactuated Robots with Uncertain Dynamics
On-Line Learning for Planning and Control of Underactuated Robots with Uncertain Dynamics
Giulio Turrisi
Marco Capotondi
C. Gaz
Valerio Modugno
Giuseppe Oriolo
Alessandro De Luca
35
8
0
30 Jan 2025
Reinforcement-Learning Portfolio Allocation with Dynamic Embedding of Market Information
Reinforcement-Learning Portfolio Allocation with Dynamic Embedding of Market Information
Jinghai He
Cheng Hua
Chunyang Zhou
Zeyu Zheng
AIFin
48
1
0
29 Jan 2025
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Haque Ishfaq
Guangyuan Wang
Sami Nur Islam
Doina Precup
60
2
0
29 Jan 2025
Contextual Knowledge Sharing in Multi-Agent Reinforcement Learning with Decentralized Communication and Coordination
Hung Du
Srikanth Thudumu
Hy Nguyen
Rajesh Vasa
K. Mouzakis
37
0
0
28 Jan 2025
Towards General-Purpose Model-Free Reinforcement Learning
Scott Fujimoto
P. DÓro
Amy Zhang
Yuandong Tian
Michael Rabbat
OffRL
46
3
0
28 Jan 2025
FuzzyLight: A Robust Two-Stage Fuzzy Approach for Traffic Signal Control Works in Real Cities
Mingyuan Li
Jiahao Wang
Bo Du
Jun Shen
Qiang Wu
59
1
0
28 Jan 2025
UNIDOOR: A Universal Framework for Action-Level Backdoor Attacks in Deep Reinforcement Learning
Oubo Ma
L. Du
Yang Dai
Chunyi Zhou
Qingming Li
Yuwen Pu
Shouling Ji
48
0
0
28 Jan 2025
Data Duplication: A Novel Multi-Purpose Attack Paradigm in Machine Unlearning
Data Duplication: A Novel Multi-Purpose Attack Paradigm in Machine Unlearning
Dayong Ye
Tainqing Zhu
Junlong Li
Kun Gao
B. Liu
Lefei Zhang
Wanlei Zhou
Yujian Zhang
AAML
MU
80
0
0
28 Jan 2025
Reinforcement Teaching
Reinforcement Teaching
Alex Lewandowski
Calarina Muslimani
Dale Schuurmans
Matthew E. Taylor
Jun Luo
87
1
0
28 Jan 2025
EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement Learning
EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement Learning
Bowen Zheng
Ran Cheng
Kay Chen Tan
47
0
0
25 Jan 2025
ABPT: Amended Backpropagation through Time with Partially Differentiable Rewards
ABPT: Amended Backpropagation through Time with Partially Differentiable Rewards
Fanxing Li
Fangyu Sun
Tianbao Zhang
Danping Zou
39
0
0
24 Jan 2025
TrueReason: An Exemplar Personalised Learning System Integrating Reasoning with Foundational Models
TrueReason: An Exemplar Personalised Learning System Integrating Reasoning with Foundational Models
Sahan Bulathwela
Daniel Van Niekerk
Jarrod Shipton
Maria Perez-Ortiz
Benjamin Rosman
John Shawe-Taylor
LRM
36
0
0
23 Jan 2025
Audio-Driven Reinforcement Learning for Head-Orientation in Naturalistic Environments
Audio-Driven Reinforcement Learning for Head-Orientation in Naturalistic Environments
Wessel Ledder
Yuzhen Qin
Kiki van der Heijden
108
0
0
20 Jan 2025
Previous
123456...636465
Next