ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.02971
  4. Cited By
Continuous control with deep reinforcement learning
v1v2v3v4v5v6 (latest)

Continuous control with deep reinforcement learning

9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
ArXiv (abs)PDFHTML

Papers citing "Continuous control with deep reinforcement learning"

50 / 4,796 papers shown
Reinforcement Learning with Random Time Horizons
Reinforcement Learning with Random Time Horizons
Enric Ribera Borrell
Lorenz Richter
Christof Schütte
AI4TS
181
1
0
01 Jun 2025
Optimistic critics can empower small actors
Optimistic critics can empower small actors
Olya Mastikhina
Dhruv Sreenivas
Pablo Samuel Castro
517
3
0
01 Jun 2025
Optimized Local Updates in Federated Learning via Reinforcement Learning
Optimized Local Updates in Federated Learning via Reinforcement Learning
Ali Murad
Bo Hui
Wei-Shinn Ku
FedML
269
0
0
31 May 2025
Proxy Target: Bridging the Gap Between Discrete Spiking Neural Networks and Continuous Control
Proxy Target: Bridging the Gap Between Discrete Spiking Neural Networks and Continuous Control
Zijie Xu
Tong Bu
Zecheng Hao
Jianhao Ding
Zhaofei Yu
238
0
0
30 May 2025
Learning Recommender Mechanisms for Bayesian Stochastic Games
Learning Recommender Mechanisms for Bayesian Stochastic Games
Bengisu Guresti
Chongjie Zhang
Yevgeniy Vorobeychik
OffRL
238
0
0
29 May 2025
Diffusion Guidance Is a Controllable Policy Improvement Operator
Diffusion Guidance Is a Controllable Policy Improvement Operator
Kevin Frans
Seohong Park
Pieter Abbeel
Sergey Levine
OffRL
283
11
0
29 May 2025
Convergent Functions, Divergent Forms
Convergent Functions, Divergent Forms
Hyeonseong Jeon
Ainaz Eftekhar
Aaron Walsman
Kuo-Hao Zeng
Ali Farhadi
Ranjay Krishna
204
3
0
27 May 2025
Deep Actor-Critics with Tight Risk Certificates
Deep Actor-Critics with Tight Risk Certificates
Bahareh Tasdighi
Manuel Haussmann
Yi-Shan Wu
A. Masegosa
M. Kandemir
UQCV
367
0
0
26 May 2025
Accelerating Nash Learning from Human Feedback via Mirror Prox
Accelerating Nash Learning from Human Feedback via Mirror Prox
D. Tiapkin
Daniele Calandriello
Denis Belomestny
Eric Moulines
Alexey Naumov
Kashif Rasul
Michal Valko
Pierre Ménard
243
3
0
26 May 2025
MultiPhishGuard: An LLM-based Multi-Agent System for Phishing Email Detection
MultiPhishGuard: An LLM-based Multi-Agent System for Phishing Email Detection
Yinuo Xue
Eric Spero
Yun Sing Koh
Giovanni Russello
AAML
264
9
0
26 May 2025
Surrogate-Assisted Evolutionary Reinforcement Learning Based on Autoencoder and Hyperbolic Neural Network
Surrogate-Assisted Evolutionary Reinforcement Learning Based on Autoencoder and Hyperbolic Neural Network
Bingdong Li
Mei Jiang
Hong Qian
Shengcai Liu
W. Hong
Peng Yang
375
1
0
26 May 2025
Improving Value Estimation Critically Enhances Vanilla Policy Gradient
Improving Value Estimation Critically Enhances Vanilla Policy Gradient
Tao Wang
Ruipeng Zhang
Sicun Gao
OffRL
196
2
0
25 May 2025
AmorLIP: Efficient Language-Image Pretraining via Amortization
AmorLIP: Efficient Language-Image Pretraining via Amortization
Haotian Sun
Yitong Li
Yuchen Zhuang
Niao He
Hanjun Dai
Bo Dai
VLM
364
1
0
25 May 2025
Reduce Computational Cost In Deep Reinforcement Learning Via Randomized Policy Learning
Reduce Computational Cost In Deep Reinforcement Learning Via Randomized Policy Learning
Zhuochen Liu
Rahul Jain
Quan Nguyen
172
0
0
25 May 2025
Beyond Domain Randomization: Event-Inspired Perception for Visually Robust Adversarial Imitation from Videos
Beyond Domain Randomization: Event-Inspired Perception for Visually Robust Adversarial Imitation from Videos
Andrea Ramazzina
Vittorio Giammarino
Matteo El-Hariry
Mario Bijelic
VGenAAML
190
1
0
24 May 2025
CiRL: Open-Source Environments for Reinforcement Learning in Circular Economy and Net Zero
CiRL: Open-Source Environments for Reinforcement Learning in Circular Economy and Net Zero
Federico Zocco
Andrea Corti
Monica Malvezzi
AI4CE
336
1
0
24 May 2025
DiffusionRL: Efficient Training of Diffusion Policies for Robotic Grasping Using RL-Adapted Large-Scale Datasets
DiffusionRL: Efficient Training of Diffusion Policies for Robotic Grasping Using RL-Adapted Large-Scale Datasets
Maria Makarova
Qian Liu
Dzmitry Tsetserukou
208
1
0
24 May 2025
Bootstrapping Imitation Learning for Long-horizon Manipulation via Hierarchical Data Collection Space
Bootstrapping Imitation Learning for Long-horizon Manipulation via Hierarchical Data Collection Space
Jinrong Yang
Kexun Chen
Zhuoling Li
Shengkai Wu
Yong Zhao
...
Chaohui Shang
Meiyu Zhi
Linfeng Gao
Mingshan Sun
Hui Cheng
245
1
0
23 May 2025
Distances for Markov chains from sample streams
Distances for Markov chains from sample streams
Sergio Calo
Anders Jonsson
Gergely Neu
Ludovic Schwartz
Javier Segovia-Aguas
194
1
0
23 May 2025
Solving General-Utility Markov Decision Processes in the Single-Trial Regime with Online Planning
Solving General-Utility Markov Decision Processes in the Single-Trial Regime with Online Planning
Pedro P. Santos
Alberto Sardinha
Francisco S. Melo
91
0
0
21 May 2025
GCNT: Graph-Based Transformer Policies for Morphology-Agnostic Reinforcement Learning
GCNT: Graph-Based Transformer Policies for Morphology-Agnostic Reinforcement LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Yingbo Luo
Meibao Yao
Xueming Xiao
311
4
0
21 May 2025
Embedded Mean Field Reinforcement Learning for Perimeter-defense Game
Embedded Mean Field Reinforcement Learning for Perimeter-defense Game
Li Wang
Xin Yu
Xuxin Lv
Gangzheng Ai
Wenjun Wu
AAML
230
0
0
20 May 2025
A Dataless Reinforcement Learning Approach to Rounding Hyperplane Optimization for Max-Cut
A Dataless Reinforcement Learning Approach to Rounding Hyperplane Optimization for Max-Cut
Gabriel Malikal
Ismail Alkhouri
Alvaro Velasquez
Adam M Alessio
S. Ravishankar
347
0
0
19 May 2025
Multi-parameter Control for the $(1+(λ,λ))$-GA on OneMax via Deep Reinforcement Learning
Multi-parameter Control for the (1+(λ,λ))(1+(λ,λ))(1+(λ,λ))-GA on OneMax via Deep Reinforcement LearningFoundations of Genetic Algorithms (FOGA), 2025
Tai Nguyen
Phong Le
Carola Doerr
Nguyen Dang
372
0
0
19 May 2025
DisCO: Reinforcing Large Reasoning Models with Discriminative Constrained Optimization
DisCO: Reinforcing Large Reasoning Models with Discriminative Constrained Optimization
Gang Li
Ming Lin
Tomer Galanti
Zhengzhong Tu
Tianbao Yang
486
8
0
18 May 2025
Multi-CALF: A Policy Combination Approach with Statistical Guarantees
Multi-CALF: A Policy Combination Approach with Statistical Guarantees
Georgiy Malaniya
Anton Bolychev
Grigory Yaremenko
Anastasia Krasnaya
Pavel Osinenko
227
0
0
18 May 2025
SAINT: Attention-Based Policies for Discrete Combinatorial Action Spaces
SAINT: Attention-Based Policies for Discrete Combinatorial Action Spaces
Matthew Landers
Taylor W. Killian
Thomas Hartvigsen
Afsaneh Doryab
214
0
0
17 May 2025
Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions
Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions
Kehan Long
Jorge Cortés
Nikolay Atanasov
425
2
0
16 May 2025
Zero-Shot Visual Generalization in Robot Manipulation
Zero-Shot Visual Generalization in Robot Manipulation
Sumeet Batra
Gaurav Sukhatme
227
3
0
16 May 2025
Bi-Level Policy Optimization with Nyström Hypergradients
Bi-Level Policy Optimization with Nyström Hypergradients
Arjun Prakash
Naicheng He
Denizalp Goktas
Amy Greenwald
229
0
0
16 May 2025
Reasoning with OmniThought: A Large CoT Dataset with Verbosity and Cognitive Difficulty Annotations
Reasoning with OmniThought: A Large CoT Dataset with Verbosity and Cognitive Difficulty Annotations
Wenrui Cai
Chengyu Wang
Junbing Yan
Jun Huang
Xiangzhong Fang
LRM
162
0
0
16 May 2025
ReaCritic: Large Reasoning Transformer-based DRL Critic-model Scaling For Heterogeneous Networks
ReaCritic: Large Reasoning Transformer-based DRL Critic-model Scaling For Heterogeneous Networks
Feiran You
Hongyang Du
OffRLLRM
230
6
0
16 May 2025
GLOVA: Global and Local Variation-Aware Analog Circuit Design with Risk-Sensitive Reinforcement Learning
GLOVA: Global and Local Variation-Aware Analog Circuit Design with Risk-Sensitive Reinforcement LearningDesign Automation Conference (DAC), 2025
Dongjun Kim
Junwoo Park
Chaehyeon Shin
Jaeheon Jung
Kyungho Shin
...
Sanghyuk Heo
Woongrae Kim
Inchul Jeong
Joohwan Cho
Jongsun Park
185
2
0
16 May 2025
Modular Robot Control with Motor Primitives
Modular Robot Control with Motor Primitives
Moses C. Nah
Johannes Lachner
Neville Hogan
315
2
0
15 May 2025
Knowledge capture, adaptation and composition (KCAC): A framework for cross-task curriculum learning in robotic manipulation
Knowledge capture, adaptation and composition (KCAC): A framework for cross-task curriculum learning in robotic manipulation
Xinrui Wang
Yan Jin
331
0
0
15 May 2025
Accelerating Visual-Policy Learning through Parallel Differentiable Simulation
Accelerating Visual-Policy Learning through Parallel Differentiable Simulation
Haoxiang You
Yilang Liu
Ian Abraham
394
0
0
15 May 2025
Deep reinforcement learning-based longitudinal control strategy for automated vehicles at signalised intersections
Deep reinforcement learning-based longitudinal control strategy for automated vehicles at signalised intersections
Pankaj Kumar
Aditya Mishra
Pranamesh Chakraborty
Subrahmanya Swamy Peruru
217
0
0
13 May 2025
Deep Reinforcement Learning for Power Grid Multi-Stage Cascading Failure Mitigation
Deep Reinforcement Learning for Power Grid Multi-Stage Cascading Failure Mitigation
Bo Meng
Chenghao Xu
Yongli Zhu
AI4CE
135
0
0
13 May 2025
MA-ROESL: Motion-aware Rapid Reward Optimization for Efficient Robot Skill Learning from Single Videos
MA-ROESL: Motion-aware Rapid Reward Optimization for Efficient Robot Skill Learning from Single Videos
Xinyu Wang
Xinming Zhang
Yanjun Chen
Xiaoyu Shen
Wei Zhang
244
0
0
13 May 2025
Adaptive Security Policy Management in Cloud Environments Using Reinforcement Learning
Adaptive Security Policy Management in Cloud Environments Using Reinforcement Learning
Muhammad Saqib
Dipkumar Mehta
Fnu Yashu
Shubham Malhotra
104
2
0
13 May 2025
Learning Value of Information towards Joint Communication and Control in 6G V2X
Learning Value of Information towards Joint Communication and Control in 6G V2X
Lei Lei
K. Zheng
Xuemin
Shen
349
4
0
11 May 2025
Multi-agent Embodied AI: Advances and Future Directions
Multi-agent Embodied AI: Advances and Future Directions
Zhaohan Feng
Ruiqi Xue
Lei Yuan
Yang Yu
Ning Ding
M. Liu
Bingzhao Gao
Jian Sun
Xinhu Zheng
Gang Wang
AI4CE
541
24
0
08 May 2025
Graph Neural Network Aided Deep Reinforcement Learning for Resource Allocation in Dynamic Terahertz UAV Networks
Graph Neural Network Aided Deep Reinforcement Learning for Resource Allocation in Dynamic Terahertz UAV Networks
Zhifeng Hu
Chong Han
246
1
0
08 May 2025
Merging and Disentangling Views in Visual Reinforcement Learning for Robotic Manipulation
Merging and Disentangling Views in Visual Reinforcement Learning for Robotic Manipulation
Abdulaziz Almuzairee
Rohan Patil
Dwait Bhatt
Henrik I. Christensen
364
1
0
07 May 2025
Adaptive and Robust DBSCAN with Multi-agent Reinforcement Learning
Adaptive and Robust DBSCAN with Multi-agent Reinforcement Learning
Hao Peng
Xiang Huang
Shuo Sun
Ruitong Zhang
Philip S. Yu
221
0
0
07 May 2025
Optimization of Infectious Disease Intervention Measures Based on Reinforcement Learning - Empirical analysis based on UK COVID-19 epidemic data
Optimization of Infectious Disease Intervention Measures Based on Reinforcement Learning - Empirical analysis based on UK COVID-19 epidemic data
Baida Zhang
Yakai Chen
Huichun Li
Zhenghu Zu
449
0
0
07 May 2025
VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making
VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making
Jake Grigsby
Yuke Zhu
Michael S Ryoo
Juan Carlos Niebles
OffRLVLM
334
2
0
06 May 2025
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Caleb Chuck
Fan Feng
Carl Qi
Chang Shi
Siddhant Agarwal
Amy Zhang
S. Niekum
327
2
0
06 May 2025
Joint Resource Management for Energy-efficient UAV-assisted SWIPT-MEC: A Deep Reinforcement Learning Approach
Joint Resource Management for Energy-efficient UAV-assisted SWIPT-MEC: A Deep Reinforcement Learning ApproachIEEE Internet of Things Journal (IEEE IoT J.), 2025
Yue Chen
Hui Kang
Jiahui Li
Geng Sun
Boxiong Wang
Jiacheng Wang
Cong Liang
Shuang Liang
Dusit Niyato
488
7
0
06 May 2025
Automated Hybrid Reward Scheduling via Large Language Models for Robotic Skill Learning
Automated Hybrid Reward Scheduling via Large Language Models for Robotic Skill LearningIEEE International Conference on Robotics and Automation (ICRA), 2025
Changxin Huang
Junyang Liang
Yanbin Chang
Jingzhao Xu
Jianqiang Li
263
0
0
05 May 2025
Previous
123...567...949596
Next