ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.02971
  4. Cited By
Continuous control with deep reinforcement learning
v1v2v3v4v5v6 (latest)

Continuous control with deep reinforcement learning

9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
ArXiv (abs)PDFHTML

Papers citing "Continuous control with deep reinforcement learning"

50 / 4,796 papers shown
Directional Ensemble Aggregation for Actor-Critics
Directional Ensemble Aggregation for Actor-Critics
Nicklas Werge
Yi-Shan Wu
Bahareh Tasdighi
M. Kandemir
OffRL
182
0
0
31 Jul 2025
One-Step Flow Policy Mirror Descent
One-Step Flow Policy Mirror Descent
Tianyi Chen
Haitong Ma
Na Li
Kai Wang
Bo Dai
257
1
0
31 Jul 2025
RL as Regressor: A Reinforcement Learning Approach for Function Approximation
RL as Regressor: A Reinforcement Learning Approach for Function Approximation
Yongchao Huang
OffRL
17
0
0
31 Jul 2025
Deep Reinforcement Learning in Factor Investment
Deep Reinforcement Learning in Factor Investment
Junlin Liu
AIFin
74
0
0
30 Jul 2025
Model Predictive Adversarial Imitation Learning for Planning from Observation
Model Predictive Adversarial Imitation Learning for Planning from Observation
Tyler Han
Yanda Bao
Bhaumik Mehta
Gabriel Guo
Anubhav Vishwakarma
...
Sanghun Jung
Rosario Scalise
Jason Zhou
Bryan Xu
Byron Boots
134
1
0
29 Jul 2025
Geometry of Neural Reinforcement Learning in Continuous State and Action Spaces
Geometry of Neural Reinforcement Learning in Continuous State and Action SpacesInternational Conference on Learning Representations (ICLR), 2025
Saket Tiwari
Omer Gottesman
George Konidaris
226
3
0
28 Jul 2025
Spatial-Temporal Reinforcement Learning for Network Routing with Non-Markovian Traffic
Spatial-Temporal Reinforcement Learning for Network Routing with Non-Markovian Traffic
Molly Wang
Kin.K Leung
94
0
0
27 Jul 2025
ASNN: Learning to Suggest Neural Architectures from Performance Distributions
ASNN: Learning to Suggest Neural Architectures from Performance Distributions
Jinwook Hong
65
0
0
27 Jul 2025
Observations Meet Actions: Learning Control-Sufficient Representations for Robust Policy Generalization
Observations Meet Actions: Learning Control-Sufficient Representations for Robust Policy Generalization
Yuliang Gu
H. Cao
Marco Caccamo
N. Hovakimyan
OffRLBDL
200
0
0
25 Jul 2025
Simulation-Driven Reinforcement Learning in Queuing Network Routing Optimization
Simulation-Driven Reinforcement Learning in Queuing Network Routing Optimization
Fatima Al-Ani
Molly Wang
Jevon Charles
Aaron Ong
Joshua Forday
Vinayak Modi
40
1
0
24 Jul 2025
From Individual Learning to Market Equilibrium: Correcting Structural and Parametric Biases in RL Simulations of Economic Models
From Individual Learning to Market Equilibrium: Correcting Structural and Parametric Biases in RL Simulations of Economic Models
Ruxin Chen
Ruxin Chen
222
0
0
24 Jul 2025
HARLF: Hierarchical Reinforcement Learning and Lightweight LLM-Driven Sentiment Integration for Financial Portfolio Optimization
HARLF: Hierarchical Reinforcement Learning and Lightweight LLM-Driven Sentiment Integration for Financial Portfolio Optimization
Benjamin Coriat
Eric Benhamou
AIFin
101
1
0
24 Jul 2025
Confidence Calibration in Vision-Language-Action Models
Confidence Calibration in Vision-Language-Action Models
Thomas P. Zollo
R. Zemel
149
1
0
23 Jul 2025
Guided Reinforcement Learning for Omnidirectional 3D Jumping in Quadruped Robots
Guided Reinforcement Learning for Omnidirectional 3D Jumping in Quadruped Robots
Riccardo Bussola
Michele Focchi
Giulio Turrisi
Claudio Semini
Luigi Palopoli
270
2
0
22 Jul 2025
Robust Control with Gradient Uncertainty
Robust Control with Gradient Uncertainty
Qian Qi
96
0
0
20 Jul 2025
Federated Reinforcement Learning in Heterogeneous Environments
Federated Reinforcement Learning in Heterogeneous Environments
Ukjo Hwang
Songnam Hong
FedML
152
1
0
19 Jul 2025
Supervised Fine Tuning on Curated Data is Reinforcement Learning (and can be improved)
Supervised Fine Tuning on Curated Data is Reinforcement Learning (and can be improved)
Chongli Qin
Jost Tobias Springenberg
OffRL
209
12
0
17 Jul 2025
Relative Entropy Pathwise Policy Optimization
Relative Entropy Pathwise Policy Optimization
C. Voelcker
Axel Brunnbauer
Marcel Hussing
Michal Nauman
Pieter Abbeel
Eric Eaton
Radu Grosu
Amir-massoud Farahmand
Igor Gilitschenski
368
0
0
15 Jul 2025
Solving dynamic portfolio selection problems via score-based diffusion models
Solving dynamic portfolio selection problems via score-based diffusion models
Ahmad Aghapour
Erhan Bayraktar
Fengyi Yuan
DiffM
264
2
0
14 Jul 2025
Multimodal Visual Transformer for Sim2real Transfer in Visual Reinforcement Learning
Multimodal Visual Transformer for Sim2real Transfer in Visual Reinforcement Learning
Zichun Xu
Yuntao Li
Zhaomin Wang
Lei Zhuang
Guocai Yang
Jingdong Zhao
MDE
280
0
0
12 Jul 2025
Reinforcement Learning with Action Chunking
Reinforcement Learning with Action Chunking
Qiyang Li
Zhiyuan Zhou
Sergey Levine
OffRLOnRL
391
24
0
10 Jul 2025
Mirror Descent Policy Optimisation for Robust Constrained Markov Decision Processes
Mirror Descent Policy Optimisation for Robust Constrained Markov Decision Processes
David Bossens
Atsushi Nitanda
371
0
0
29 Jun 2025
An Introduction to Zero-Order Optimization Techniques for Robotics
An Introduction to Zero-Order Optimization Techniques for Robotics
Armand Jordana
J. Zhang
Joseph Amigo
Ludovic Righetti
155
2
0
27 Jun 2025
Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning
Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning
Prajwal Koirala
Cody Fleming
OffRL
319
4
0
26 Jun 2025
CyGym: A Simulation-Based Game-Theoretic Analysis Framework for Cybersecurity
CyGym: A Simulation-Based Game-Theoretic Analysis Framework for Cybersecurity
Michael Lanier
Yevgeniy Vorobeychik
AAML
137
1
0
26 Jun 2025
Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
Guozheng Ma
Lu Li
Zilin Wang
Li Shen
Pierre-Luc Bacon
Dacheng Tao
OffRL
182
6
0
20 Jun 2025
CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization
CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization
Ranting Hu
OffRL
304
0
0
18 Jun 2025
Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning
Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning
Roger Creus Castanyer
J. Obando-Ceron
Lu Li
Pierre-Luc Bacon
Glen Berseth
Aaron Courville
Pablo Samuel Castro
216
4
0
18 Jun 2025
Steering Your Diffusion Policy with Latent Space Reinforcement Learning
Steering Your Diffusion Policy with Latent Space Reinforcement Learning
Andrew Wagenmaker
Mitsuhiko Nakamoto
Yunchu Zhang
S. Park
Waleed Yagoub
Anusha Nagabandi
Abhishek Gupta
Sergey Levine
OffRL
325
26
0
18 Jun 2025
Common Benchmarks Undervalue the Generalization Power of Programmatic Policies
Common Benchmarks Undervalue the Generalization Power of Programmatic Policies
Amirhossein Rajabpour
Kiarash Aghakasiri
Sandra Zilles
Levi H. S. Lelis
OffRL
186
0
0
17 Jun 2025
Overcoming Overfitting in Reinforcement Learning via Gaussian Process Diffusion Policy
Overcoming Overfitting in Reinforcement Learning via Gaussian Process Diffusion PolicySymposium on Software Performance (SP), 2025
Amornyos Horprasert
Esa Apriaskar
Xingyu Liu
Lanlan Su
Lyudmila S. Mihaylova
147
1
0
16 Jun 2025
Touch begins where vision ends: Generalizable policies for contact-rich manipulation
Touch begins where vision ends: Generalizable policies for contact-rich manipulation
Zifan Zhao
Siddhant Haldar
Jinda Cui
Lerrel Pinto
Raunaq M. Bhirangi
OffRL
257
4
0
16 Jun 2025
Dynamic Reinsurance Treaty Bidding via Multi-Agent Reinforcement Learning
Dynamic Reinsurance Treaty Bidding via Multi-Agent Reinforcement Learning
Stella C. Dong
James R. Finlay
139
2
0
16 Jun 2025
Similarity as Reward Alignment: Robust and Versatile Preference-based Reinforcement Learning
Similarity as Reward Alignment: Robust and Versatile Preference-based Reinforcement Learning
Sara Rajaram
R. J. Cotton
Fabian H. Sinz
181
1
0
14 Jun 2025
Palpation Alters Auditory Pain Expressions with Gender-Specific Variations in Robopatients
Palpation Alters Auditory Pain Expressions with Gender-Specific Variations in RobopatientsIEEE Transactions on Medical Robotics and Bionics (TMRB), 2025
Chapa Sirithunge
Yue Xie
Saitarun Nadipineni
Fumiya Iida
Thilina Dulantha Lalitharatne
141
0
0
13 Jun 2025
Your Ride, Your Rules: Psychology and Cognition Enabled Automated Driving Systems
Your Ride, Your Rules: Psychology and Cognition Enabled Automated Driving Systems
Zhipeng Bao
Qianwen Li
245
0
0
13 Jun 2025
Wasserstein Barycenter Soft Actor-Critic
Wasserstein Barycenter Soft Actor-Critic
Zahra Shahrooei
Ali Baheri
OffRL
277
1
0
11 Jun 2025
GPS Spoofing Attacks on AI-based Navigation Systems with Obstacle Avoidance in UAV
Ji Hyuk Jung
Mi Yeon Hong
Ji Won Yoon
AAML
114
1
0
10 Jun 2025
Uncovering the Computational Roles of Nonlinearity in Sequence Modeling Using Almost-Linear RNNs
Uncovering the Computational Roles of Nonlinearity in Sequence Modeling Using Almost-Linear RNNs
Manuel Brenner
G. Koppe
203
0
0
09 Jun 2025
Monotone and Conservative Policy Iteration Beyond the Tabular Case
Monotone and Conservative Policy Iteration Beyond the Tabular Case
Eshwar S. R.
Gugan Thoppe
Ananyabrata Barua
Aditya Gopalan
Gal Dalal
182
1
0
08 Jun 2025
AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification
AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification
Geonwoo Cho
Jaemoon Lee
Jaegyun Im
Subi Lee
Jihwan Lee
Sundong Kim
299
0
0
06 Jun 2025
Gradual Transition from Bellman Optimality Operator to Bellman Operator in Online Reinforcement Learning
Gradual Transition from Bellman Optimality Operator to Bellman Operator in Online Reinforcement Learning
Motoki Omura
Kazuki Ota
Takayuki Osa
Yusuke Mukuta
Tatsuya Harada
OffRL
298
0
0
06 Jun 2025
Dream to Generalize: Zero-Shot Model-Based Reinforcement Learning for Unseen Visual Distractions
Dream to Generalize: Zero-Shot Model-Based Reinforcement Learning for Unseen Visual DistractionsAAAI Conference on Artificial Intelligence (AAAI), 2023
Jeongsoo Ha
Kyungsoo Kim
Yusung Kim
OffRLVLM
177
10
0
05 Jun 2025
When Maximum Entropy Misleads Policy Optimization
When Maximum Entropy Misleads Policy Optimization
Ruipeng Zhang
Ya-Chien Chang
Sicun Gao
166
6
0
05 Jun 2025
An Efficient Task-Oriented Dialogue Policy: Evolutionary Reinforcement Learning Injected by Elite Individuals
An Efficient Task-Oriented Dialogue Policy: Evolutionary Reinforcement Learning Injected by Elite IndividualsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Yangyang Zhao
Ben Niu
L. Qin
Shihan Wang
242
3
0
04 Jun 2025
Autonomous Vehicle Lateral Control Using Deep Reinforcement Learning with MPC-PID Demonstration
Autonomous Vehicle Lateral Control Using Deep Reinforcement Learning with MPC-PID Demonstration
Chengdong Wu
Sven Kirchner
Nils Purschke
Alois Knoll
172
2
0
04 Jun 2025
A Novel Deep Reinforcement Learning Method for Computation Offloading in Multi-User Mobile Edge Computing with Decentralization
A Novel Deep Reinforcement Learning Method for Computation Offloading in Multi-User Mobile Edge Computing with DecentralizationInternational Conference on Autonomic and Trusted Computing (ATC), 2024
Nguyen Chi Long
Trinh Van Chien
Ta Hai Tung
V. Nguyen
Trong-Minh Hoang
Nguyen Ngoc Hai Dang
126
0
0
03 Jun 2025
Data-assimilated model-informed reinforcement learning
Data-assimilated model-informed reinforcement learning
D. E. Ozan
Andrea Nóvoa
Georgios Rigas
Luca Magri
AI4CE
313
1
0
02 Jun 2025
Bidirectional Soft Actor-Critic: Leveraging Forward and Reverse KL Divergence for Efficient Reinforcement Learning
Bidirectional Soft Actor-Critic: Leveraging Forward and Reverse KL Divergence for Efficient Reinforcement Learning
Yixian Zhang
Huaze Tang
Changxu Wei
Wenbo Ding
174
0
0
02 Jun 2025
Optimistic critics can empower small actors
Optimistic critics can empower small actors
Olya Mastikhina
Dhruv Sreenivas
Pablo Samuel Castro
517
3
0
01 Jun 2025
Previous
123456...949596
Next