ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.02971
  4. Cited By
Continuous control with deep reinforcement learning

Continuous control with deep reinforcement learning

9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
ArXivPDFHTML

Papers citing "Continuous control with deep reinforcement learning"

50 / 3,243 papers shown
Title
Human-Readable Programs as Actors of Reinforcement Learning Agents Using
  Critic-Moderated Evolution
Human-Readable Programs as Actors of Reinforcement Learning Agents Using Critic-Moderated Evolution
Senne Deproost
Denis Steckelmacher
Ann Nowé
41
0
0
29 Oct 2024
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Jing Zhang
Linjiajie Fang
Kexin Shi
Wenjia Wang
Bing-Yi Jing
OffRL
44
0
0
27 Oct 2024
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Kaiyan Zhao
Yiming Wang
Yuyang Chen
Yan Li
Leong Hou U
Xiaoguang Niu
41
1
0
27 Oct 2024
MILES: Making Imitation Learning Easy with Self-Supervision
MILES: Making Imitation Learning Easy with Self-Supervision
Georgios Papagiannis
Edward Johns
VLM
SSL
55
5
0
25 Oct 2024
Reinforcement Learning Controllers for Soft Robots using Learned
  Environments
Reinforcement Learning Controllers for Soft Robots using Learned Environments
Uljad Berdica
Matthew Jackson
Niccolò Enrico Veronese
Jakob Foerster
Perla Maiolino
DRL
16
1
0
24 Oct 2024
Learning Transparent Reward Models via Unsupervised Feature Selection
Learning Transparent Reward Models via Unsupervised Feature Selection
Daulet Baimukashev
G. Alcan
K. Luck
Ville Kyrki
SSL
OffRL
41
0
0
24 Oct 2024
CARLA2Real: a tool for reducing the sim2real gap in CARLA simulator
CARLA2Real: a tool for reducing the sim2real gap in CARLA simulator
Stefanos Pasios
Nikos Nikolaidis
49
1
0
23 Oct 2024
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Michael Noukhovitch
Shengyi Huang
Sophie Xhonneux
Arian Hosseini
Rishabh Agarwal
Rameswar Panda
OffRL
85
6
0
23 Oct 2024
Episodic Future Thinking Mechanism for Multi-agent Reinforcement
  Learning
Episodic Future Thinking Mechanism for Multi-agent Reinforcement Learning
Dongsu Lee
Minhae Kwon
31
1
0
22 Oct 2024
Safe Load Balancing in Software-Defined-Networking
Safe Load Balancing in Software-Defined-Networking
L. Dinh
Pham Tran Anh Quang
Jérémie Leguay
34
0
0
22 Oct 2024
Augmented Lagrangian-Based Safe Reinforcement Learning Approach for
  Distribution System Volt/VAR Control
Augmented Lagrangian-Based Safe Reinforcement Learning Approach for Distribution System Volt/VAR Control
Guibin Chen
OffRL
21
0
0
19 Oct 2024
GUIDE: Real-Time Human-Shaped Agents
GUIDE: Real-Time Human-Shaped Agents
Lingyu Zhang
Zhengran Ji
Nicholas R Waytowich
Boyuan Chen
39
2
0
19 Oct 2024
GNNRL-Smoothing: A Prior-Free Reinforcement Learning Model for Mesh
  Smoothing
GNNRL-Smoothing: A Prior-Free Reinforcement Learning Model for Mesh Smoothing
Zhichao Wang
Xinhai Chen
Chunye Gong
Bo Yang
Liang Deng
Yufei Sun
Yufei Pang
Jie Liu
AI4CE
35
0
0
19 Oct 2024
Hierarchical Reinforced Trader (HRT): A Bi-Level Approach for Optimizing
  Stock Selection and Execution
Hierarchical Reinforced Trader (HRT): A Bi-Level Approach for Optimizing Stock Selection and Execution
Zijie Zhao
Roy E. Welsch
AIFin
15
1
0
19 Oct 2024
Reinfier and Reintrainer: Verification and Interpretation-Driven Safe Deep Reinforcement Learning Frameworks
Reinfier and Reintrainer: Verification and Interpretation-Driven Safe Deep Reinforcement Learning Frameworks
Zixuan Yang
Jiaqi Zheng
Guihai Chen
OffRL
36
0
0
19 Oct 2024
Online Reinforcement Learning with Passive Memory
Online Reinforcement Learning with Passive Memory
Anay Pattanaik
Lav R. Varshney
CLL
OffRL
28
0
0
18 Oct 2024
Streaming Deep Reinforcement Learning Finally Works
Streaming Deep Reinforcement Learning Finally Works
Mohamed Elsayed
Gautham Vasan
A. R. Mahmood
OffRL
54
4
0
18 Oct 2024
Deep Reinforcement Learning for Online Optimal Execution Strategies
Deep Reinforcement Learning for Online Optimal Execution Strategies
Alessandro Micheli
Mélodie Monod
OffRL
21
0
0
17 Oct 2024
Novelty-based Sample Reuse for Continuous Robotics Control
Novelty-based Sample Reuse for Continuous Robotics Control
Ke Duan
Kai Yang
Houde Liu
Xueqian Wang
35
0
0
17 Oct 2024
RecoveryChaining: Learning Local Recovery Policies for Robust Manipulation
RecoveryChaining: Learning Local Recovery Policies for Robust Manipulation
Shivam Vats
Devesh K. Jha
Maxim Likhachev
Oliver Kroemer
Diego Romeres
OffRL
38
0
0
17 Oct 2024
Reinforcement Learning with Euclidean Data Augmentation for State-Based
  Continuous Control
Reinforcement Learning with Euclidean Data Augmentation for State-Based Continuous Control
Jinzhu Luo
Dingyang Chen
Qi Zhang
OffRL
26
0
0
16 Oct 2024
Dual Action Policy for Robust Sim-to-Real Reinforcement Learning
Dual Action Policy for Robust Sim-to-Real Reinforcement Learning
Ng Wen Zheng Terence
Chen Jianda
26
0
0
16 Oct 2024
The State of Robot Motion Generation
The State of Robot Motion Generation
Kostas E. Bekris
Joe H. Doerr
Patrick Meng
Sumanth Tangirala
3DV
41
2
0
16 Oct 2024
Counterfactual Generative Modeling with Variational Causal Inference
Counterfactual Generative Modeling with Variational Causal Inference
Yulun Wu
Louie McConnell
Claudia Iriondo
CML
BDL
27
0
0
16 Oct 2024
Mitigating Suboptimality of Deterministic Policy Gradients in Complex
  Q-functions
Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Ayush Jain
Norio Kosaka
Xinhu Li
Kyung-Min Kim
Erdem Bıyık
Joseph J. Lim
OffRL
21
0
0
15 Oct 2024
Solving The Dynamic Volatility Fitting Problem: A Deep Reinforcement
  Learning Approach
Solving The Dynamic Volatility Fitting Problem: A Deep Reinforcement Learning Approach
Emmanuel Gnabeyeu
Omar Karkar
Imad Idboufous
31
0
0
15 Oct 2024
Exploiting Risk-Aversion and Size-dependent fees in FX Trading with
  Fitted Natural Actor-Critic
Exploiting Risk-Aversion and Size-dependent fees in FX Trading with Fitted Natural Actor-Critic
Vito Alessandro Monaco
Antonio Riva
Luca Sabbioni
L. Bisi
Edoardo Vittori
Marco Pinciroli
Michele Trapletti
Marcello Restelli
14
0
0
15 Oct 2024
Learning Agents With Prioritization and Parameter Noise in Continuous
  State and Action Space
Learning Agents With Prioritization and Parameter Noise in Continuous State and Action Space
Rajesh Mangannavar
Gopalakrishnan Srinivasaraghavan
25
2
0
15 Oct 2024
Action Gaps and Advantages in Continuous-Time Distributional
  Reinforcement Learning
Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning
Harley Wiltzer
Marc G. Bellemare
David Meger
Patrick Shafto
Yash Jhaveri
34
1
0
14 Oct 2024
Large Language Model Evaluation via Matrix Nuclear-Norm
Large Language Model Evaluation via Matrix Nuclear-Norm
Heng Chang
Tingyu Xia
Yi-Ju Chang
Yuan Wu
37
1
0
14 Oct 2024
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement
  Learning
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Hyunseung Kim
Jun Jet Tai
K. Subramanian
Peter R. Wurman
Jaegul Choo
Peter Stone
Takuma Seno
OffRL
78
7
0
13 Oct 2024
HG2P: Hippocampus-inspired High-reward Graph and Model-Free Q-Gradient Penalty for Path Planning and Motion Control
HG2P: Hippocampus-inspired High-reward Graph and Model-Free Q-Gradient Penalty for Path Planning and Motion Control
Haoran Wang
Yaoru Sun
Zeshen Tang
Haibo Shi
Chenyuan Jiao
37
0
0
12 Oct 2024
Can we hop in general? A discussion of benchmark selection and design
  using the Hopper environment
Can we hop in general? A discussion of benchmark selection and design using the Hopper environment
C. Voelcker
Marcel Hussing
Eric Eaton
OffRL
31
3
0
11 Oct 2024
Overcoming Slow Decision Frequencies in Continuous Control: Model-Based Sequence Reinforcement Learning for Model-Free Control
Overcoming Slow Decision Frequencies in Continuous Control: Model-Based Sequence Reinforcement Learning for Model-Free Control
Devdhar Patel
H. Siegelmann
OffRL
37
0
0
11 Oct 2024
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
46
2
0
11 Oct 2024
POSEIDON : Efficient Function Placement at the Edge using Deep
  Reinforcement Learning
POSEIDON : Efficient Function Placement at the Edge using Deep Reinforcement Learning
Prakhar Jain
Prakhar Singhal
Divyansh Pandey
Giovanni Quatrocchi
Karthik Vaidhyanathan
26
0
0
10 Oct 2024
Neuroplastic Expansion in Deep Reinforcement Learning
Neuroplastic Expansion in Deep Reinforcement Learning
Jiashun Liu
J. Obando-Ceron
Rameswar Panda
L. Pan
44
3
0
10 Oct 2024
Deep End-to-End Survival Analysis with Temporal Consistency
Deep End-to-End Survival Analysis with Temporal Consistency
Mariana Vargas Vieyra
Pascal Frossard
21
0
0
09 Oct 2024
Learning in complex action spaces without policy gradients
Learning in complex action spaces without policy gradients
Arash Tavakoli
Sina Ghiassian
Nemanja Rakićević
OffRL
34
0
0
08 Oct 2024
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Jasmine Bayrooti
Carl Henrik Ek
Amanda Prorok
45
0
0
07 Oct 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
43
1
0
07 Oct 2024
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
Zhaolin Gao
Wenhao Zhan
Jonathan D. Chang
Gokul Swamy
Kianté Brantley
Jason D. Lee
Wen Sun
OffRL
78
3
0
06 Oct 2024
Solving Reach-Avoid-Stay Problems Using Deep Deterministic Policy
  Gradients
Solving Reach-Avoid-Stay Problems Using Deep Deterministic Policy Gradients
Gabriel Chenevert
Jingqi Li
Achyuta kannan
S. Bae
Donggun Lee
30
2
0
03 Oct 2024
Learning Emergence of Interaction Patterns across Independent RL Agents
  in Multi-Agent Environments
Learning Emergence of Interaction Patterns across Independent RL Agents in Multi-Agent Environments
Vasanth Reddy Baddam
Suat Gumussoy
Almuatazbellah Boker
Hoda Eldardiry
OffRL
AI4CE
26
0
0
03 Oct 2024
From Reward Shaping to Q-Shaping: Achieving Unbiased Learning with
  LLM-Guided Knowledge
From Reward Shaping to Q-Shaping: Achieving Unbiased Learning with LLM-Guided Knowledge
Xiefeng Wu
OffRL
34
1
0
02 Oct 2024
MARLens: Understanding Multi-agent Reinforcement Learning for Traffic
  Signal Control via Visual Analytics
MARLens: Understanding Multi-agent Reinforcement Learning for Traffic Signal Control via Visual Analytics
Yutian Zhang
Guohong Zheng
Zhiyuan Liu
Quan Li
Haipeng Zeng
45
2
0
02 Oct 2024
Sampling from Energy-based Policies using Diffusion
Sampling from Energy-based Policies using Diffusion
V. Jain
Tara Akhound-Sadegh
Siamak Ravanbakhsh
DiffM
53
1
0
02 Oct 2024
Dual Approximation Policy Optimization
Dual Approximation Policy Optimization
Zhihan Xiong
Maryam Fazel
Lin Xiao
38
1
0
02 Oct 2024
Enabling Multi-Robot Collaboration from Single-Human Guidance
Enabling Multi-Robot Collaboration from Single-Human Guidance
Zhengran Ji
Lingyu Zhang
Paul Sajda
Boyuan Chen
42
1
0
30 Sep 2024
Generalizability of Graph Neural Networks for Decentralized Unlabeled
  Motion Planning
Generalizability of Graph Neural Networks for Decentralized Unlabeled Motion Planning
Shreyas Muthusamy
Damian Owerko
Charilaos I. Kanatsoulis
Saurav Agarwal
Alejandro Ribeiro
31
1
0
29 Sep 2024
Previous
123456...636465
Next