Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.02971
Cited By
Continuous control with deep reinforcement learning
9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Continuous control with deep reinforcement learning"
50 / 3,200 papers shown
Title
Stability Enhancement in Reinforcement Learning via Adaptive Control Lyapunov Function
Donghe Chen
Han Wang
Lin Cheng
Shengping Gong
209
0
0
18 Jan 2025
The surprising efficiency of temporal difference learning for rare event prediction
Xiaoou Cheng
Jonathan Weare
OffRL
46
0
0
17 Jan 2025
Dynamic Portfolio Optimization via Augmented DDPG with Quantum Price Levels-Based Trading Strategy
Runsheng Lin
Zihan Xing
Mingze Ma
Raymond S.T. Lee
49
2
0
15 Jan 2025
RbRL2.0: Integrated Reward and Policy Learning for Rating-based Reinforcement Learning
Mingkang Wu
Devin White
Vernon J. Lawhern
Nicholas R. Waytowich
Yongcan Cao
OffRL
39
0
0
13 Jan 2025
An Empirical Study of Deep Reinforcement Learning in Continuing Tasks
Yi Wan
D. Korenkevych
Zheqing Zhu
OffRL
CLL
55
0
0
12 Jan 2025
CuRLA: Curriculum Learning Based Deep Reinforcement Learning for Autonomous Driving
Bhargava Uppuluri
Anjel Patel
Neil Mehta
Sridhar Kamath
Pratyush Chakraborty
57
0
0
10 Jan 2025
On the role of Artificial Intelligence methods in modern force-controlled manufacturing robotic tasks
Vincenzo Petrone
Enrico Ferrentino
Pasquale Chiacchio
39
0
0
10 Jan 2025
Learn A Flexible Exploration Model for Parameterized Action Markov Decision Processes
Zijian Wang
Bin Wang
Mingwen Shao
Hongbo Dou
Boxiang Tao
38
0
0
06 Jan 2025
Proposing Hierarchical Goal-Conditioned Policy Planning in Multi-Goal Reinforcement Learning
Gavin B. Rens
53
0
0
03 Jan 2025
CREW: Facilitating Human-AI Teaming Research
Lingyu Zhang
Zhengran Ji
Boyuan Chen
54
3
0
03 Jan 2025
Distributed Multi-Agent Reinforcement Learning with One-hop Neighbors and Compute Straggler Mitigation
Baoqian Wang
Junfei Xie
Nikolay Atanasov
42
10
0
03 Jan 2025
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
Han Zhang
Xiaoman Pan
Hongwei Wang
Kaixin Ma
Wenhao Yu
Dong Yu
LLMAG
69
3
0
03 Jan 2025
Image Classification with Deep Reinforcement Active Learning
Mingyuan Jiu
Xuguang Song
H. Sahbi
Shupan Li
Yan Chen
Wei Guo
Lihua Guo
Mingliang Xu
VLM
29
0
0
31 Dec 2024
Game Theory and Multi-Agent Reinforcement Learning : From Nash Equilibria to Evolutionary Dynamics
Neil De La Fuente
Miquel Noguer i Alonso
Guim Casadellà
38
0
0
31 Dec 2024
Predictive Monitoring of Black-Box Dynamical Systems
T. Henzinger
Fabian Kresse
Kaushik Mallik
Emily Yu
Đorđe Žikelić
75
0
0
21 Dec 2024
When Should We Prefer State-to-Visual DAgger Over Visual Reinforcement Learning?
Tongzhou Mu
Zhaoyang Li
Stanisław Wiktor Strzelecki
Xiu Yuan
Yunchao Yao
Litian Liang
H. Su
OffRL
88
2
0
18 Dec 2024
Harvesting energy from turbulent winds with Reinforcement Learning
Lorenzo Basile
Maria Grazia Berni
Antonio Celani
74
0
0
18 Dec 2024
Practicable Black-box Evasion Attacks on Link Prediction in Dynamic Graphs -- A Graph Sequential Embedding Method
Jiate Li
Meng Pang
Binghui Wang
AAML
79
1
0
17 Dec 2024
An Advantage-based Optimization Method for Reinforcement Learning in Large Action Space
Hai Lin
Cheng Huang
Zhihong Chen
OffRL
74
0
0
17 Dec 2024
Physics-model-guided Worst-case Sampling for Safe Reinforcement Learning
H. Cao
Y. Mao
L. Sha
Marco Caccamo
OffRL
100
0
0
17 Dec 2024
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Eliot Xing
Vernon Luk
Jean Oh
89
0
0
16 Dec 2024
Adaptive Reward Design for Reinforcement Learning
Minjae Kwon
Ingy Elsayed-Aly
Lu Feng
75
2
0
14 Dec 2024
Sample-efficient Unsupervised Policy Cloning from Ensemble Self-supervised Labeled Videos
Xin Liu
Yaran Chen
Haoran Li
SSL
94
0
0
14 Dec 2024
Advances in Transformers for Robotic Applications: A Review
Nikunj Sanghai
Nik Bear Brown
AI4CE
86
0
0
13 Dec 2024
Distributional Reinforcement Learning based Integrated Decision Making and Control for Autonomous Surface Vehicles
Xi Lin
Paul Szenher
Yewei Huang
Brendan Englot
81
1
0
12 Dec 2024
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone
Max Sobol Mark
Tian Gao
Georgia Gabriela Sampaio
Mohan Kumar Srirama
Archit Sharma
Chelsea Finn
Aviral Kumar
OffRL
OnRL
106
4
0
09 Dec 2024
A Scalable Decentralized Reinforcement Learning Framework for UAV Target Localization Using Recurrent PPO
Leon Fernando
Billy Pik Lik Lau
Chau Yuen
U-Xuan Tan
67
0
0
09 Dec 2024
Conformal Symplectic Optimization for Stable Reinforcement Learning
Yao Lyu
Xiangteng Zhang
Shengbo Eben Li
Jingliang Duan
Letian Tao
Qing Xu
Lei He
Keqiang Li
73
0
0
03 Dec 2024
A Memory-Based Reinforcement Learning Approach to Integrated Sensing and Communication
Homa Nikbakht
Michèle Wigger
S. Shamai
H. Vincent Poor
67
4
0
02 Dec 2024
Towards Fault Tolerance in Multi-Agent Reinforcement Learning
Yuchen Shi
Huaxin Pei
Liang Feng
Yi Zhang
D. Yao
75
0
0
30 Nov 2024
A Local Information Aggregation based Multi-Agent Reinforcement Learning for Robot Swarm Dynamic Task Allocation
Yang Lv
Jinlong Lei
Peng Yi
57
1
0
29 Nov 2024
Application of Soft Actor-Critic Algorithms in Optimizing Wastewater Treatment with Time Delays Integration
Esmaeel Mohammadi
D. O. Arroyo
A. A. Hansen
Mikkel Stokholm-Bjerregaard
S. Gros
Akhil S. Anand
Petar Durdevic
68
0
0
27 Nov 2024
Monocular Obstacle Avoidance Based on Inverse PPO for Fixed-wing UAVs
Haochen Chai
Meimei Su
Yang Lyu
Zhunga Liu
Chunhui Zhao
Quan Pan
81
0
0
27 Nov 2024
Broad Critic Deep Actor Reinforcement Learning for Continuous Control
Shiron Thalagala
Pak Kin Wong
Xiaozheng Wang
Tianang Sun
OffRL
76
0
0
24 Nov 2024
Provably Efficient Action-Manipulation Attack Against Continuous Reinforcement Learning
Zhi Luo
Xiaoyu Yang
Pan Zhou
D. Wang
AAML
76
0
0
20 Nov 2024
AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers
Jake Grigsby
Justin Sasek
Samyak Parajuli
Daniel Adebi
Amy Zhang
Yuke Zhu
OffRL
28
3
0
17 Nov 2024
Wireless Resource Allocation with Collaborative Distributed and Centralized DRL under Control Channel Attacks
Ke Wang
Wen Liu
Teng Joon Lim
29
0
0
16 Nov 2024
OCMDP: Observation-Constrained Markov Decision Process
Taiyi Wang
Jianheng Liu
Bryan Lee
Zhihao Wu
Yu Wu
36
1
0
11 Nov 2024
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
29
2
0
08 Nov 2024
A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model
Panwen Hu
Nan Xiao
Feifei Li
Yongquan Chen
Rui Huang
VGen
OffRL
60
3
0
07 Nov 2024
Robust Real-Time Mortality Prediction in the Intensive Care Unit using Temporal Difference Learning
Thomas Frost
Kezhi Li
Steve Harris
OOD
29
1
0
06 Nov 2024
Two-Timescale Model Caching and Resource Allocation for Edge-Enabled AI-Generated Content Services
Zhang Liu
Hongyang Du
Xiangwang Hou
Lianfen Huang
Seyyedali Hosseinalipour
Dusit Niyato
K. B. Letaief
DiffM
49
1
0
03 Nov 2024
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
Kai Yan
Alex Schwing
Yu-xiong Wang
OffRL
OnRL
41
0
0
31 Oct 2024
Maximum Entropy Hindsight Experience Replay
Douglas C. Crowder
Matthew L. Trappett
Darrien M. McKenzie
Frances S. Chance
37
0
0
31 Oct 2024
Deterministic Exploration via Stationary Bellman Error Maximization
Sebastian Griesbach
Carlo DÉramo
33
0
0
31 Oct 2024
Multi-Robot Pursuit in Parameterized Formation via Imitation Learning
Jinyong Chen
Rui Zhou
Zhaozong Wang
Yunjie Zhang
Guibin Sun
33
0
0
31 Oct 2024
ARQ: A Mixed-Precision Quantization Framework for Accurate and Certifiably Robust DNNs
Yuchen Yang
Shubham Ugare
Yifan Zhao
Gagandeep Singh
Sasa Misailovic
MQ
35
0
0
31 Oct 2024
Resource Governance in Networked Systems via Integrated Variational Autoencoders and Reinforcement Learning
Qiliang Chen
Babak Heydari
DRL
41
0
0
30 Oct 2024
Bridging the Human to Robot Dexterity Gap through Object-Oriented Rewards
Irmak Güzey
Yinlong Dai
Georgy Savva
Raunaq M. Bhirangi
Lerrel Pinto
46
7
0
30 Oct 2024
A Monte Carlo Framework for Calibrated Uncertainty Estimation in Sequence Prediction
Qidong Yang
Weicheng Zhu
Joseph Keslin
L. Zanna
Tim G. J. Rudner
Carlos Fernandez-Granda
BDL
UQCV
AI4TS
48
0
0
30 Oct 2024
Previous
1
2
3
4
5
...
62
63
64
Next