ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.02971
  4. Cited By
Continuous control with deep reinforcement learning
v1v2v3v4v5v6 (latest)

Continuous control with deep reinforcement learning

9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
ArXiv (abs)PDFHTML

Papers citing "Continuous control with deep reinforcement learning"

50 / 4,794 papers shown
VisuoSpatial Foresight for Physical Sequential Fabric Manipulation
VisuoSpatial Foresight for Physical Sequential Fabric ManipulationAutonomous Robots (Auton. Robots), 2021
Ryan Hoque
Daniel Seita
Ashwin Balakrishna
Aditya Ganapathi
A. Tanwani
Nawid Jamali
K. Yamane
Soshi Iba
Ken Goldberg
148
43
0
19 Feb 2021
Decentralized Deterministic Multi-Agent Reinforcement Learning
Decentralized Deterministic Multi-Agent Reinforcement LearningIEEE Conference on Decision and Control (CDC), 2021
Antoine Grosnit
D. Cai
L. Wynter
OffRL
178
7
0
19 Feb 2021
State Entropy Maximization with Random Encoders for Efficient
  Exploration
State Entropy Maximization with Random Encoders for Efficient ExplorationInternational Conference on Machine Learning (ICML), 2021
Younggyo Seo
Lili Chen
Jinwoo Shin
Honglak Lee
Pieter Abbeel
Kimin Lee
258
146
0
18 Feb 2021
Reinforcement Learning for Datacenter Congestion Control
Reinforcement Learning for Datacenter Congestion ControlAAAI Conference on Artificial Intelligence (AAAI), 2021
Chen Tessler
Yuval Shpigelman
Gal Dalal
Amit Mandelbaum
Doron Haritan Kazakov
Benjamin Fuhrer
Gal Chechik
Shie Mannor
216
42
0
18 Feb 2021
Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm
Finite-Sample Analysis of Off-Policy Natural Actor-Critic AlgorithmInternational Conference on Machine Learning (ICML), 2021
S. Khodadadian
Zaiwei Chen
S. T. Maguluri
CMLOffRL
250
32
0
18 Feb 2021
Continuous Doubly Constrained Batch Reinforcement Learning
Continuous Doubly Constrained Batch Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2021
Rasool Fakoor
Jonas W. Mueller
Kavosh Asadi
Pratik Chaudhari
Alex Smola
OffRL
530
33
0
18 Feb 2021
Learning Memory-Dependent Continuous Control from Demonstrations
Learning Memory-Dependent Continuous Control from Demonstrations
Siqing Hou
Dongqi Han
Jun Tani
73
0
0
18 Feb 2021
Multi-Agent Reinforcement Learning of 3D Furniture Layout Simulation in
  Indoor Graphics Scenes
Multi-Agent Reinforcement Learning of 3D Furniture Layout Simulation in Indoor Graphics Scenes
Xinhan Di
Pengqian Yu
AI4CE3DV
151
12
0
18 Feb 2021
SCAPE: Learning Stiffness Control from Augmented Position Control
  Experiences
SCAPE: Learning Stiffness Control from Augmented Position Control ExperiencesConference on Robot Learning (CoRL), 2021
Mincheol Kim
S. Niekum
A. Deshpande
205
8
0
16 Feb 2021
TradeR: Practical Deep Hierarchical Reinforcement Learning for Trade
  Execution
TradeR: Practical Deep Hierarchical Reinforcement Learning for Trade Execution
Karush Suri
Xiaolong Shi
Konstantinos Plataniotis
Y. Lawryshyn
OffRL
104
4
0
16 Feb 2021
Model-based Meta Reinforcement Learning using Graph Structured Surrogate
  Models
Model-based Meta Reinforcement Learning using Graph Structured Surrogate ModelsInternational Conference on Machine Learning (ICML), 2021
Q. Wang
H. V. Hoof
OffRL
140
23
0
16 Feb 2021
RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement
  Learning Agents
RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning AgentsNeural Information Processing Systems (NeurIPS), 2021
Wei Qiu
Xinrun Wang
Runsheng Yu
Xu He
Rongpin Wang
Bo An
S. Obraztsova
Zinovi Rabinovich
157
58
0
16 Feb 2021
Transferring Domain Knowledge with an Adviser in Continuous Tasks
Transferring Domain Knowledge with an Adviser in Continuous TasksPacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2021
Rukshan Wijesinghe
Kasun Vithanage
Dumindu Tissera
A. Xavier
Subha Fernando
Jayathu Samarawickrama
CLL
102
0
0
16 Feb 2021
A Survey of Machine Learning for Computer Architecture and Systems
A Survey of Machine Learning for Computer Architecture and SystemsACM Computing Surveys (CSUR), 2021
Nan Wu
Yuan Xie
AI4TSAI4CE
249
186
0
16 Feb 2021
Learning image quality assessment by reinforcing task amenable data
  selection
Learning image quality assessment by reinforcing task amenable data selectionInformation Processing in Medical Imaging (IPMI), 2021
Shaheer U. Saeed
Yunguan Fu
Zachary Michael Cieman Baum
Qianye Yang
M. Rusu
Richard E. Fan
G. Sonn
D. Barratt
Yipeng Hu
120
16
0
15 Feb 2021
Intelligent Electric Vehicle Charging Recommendation Based on
  Multi-Agent Reinforcement Learning
Intelligent Electric Vehicle Charging Recommendation Based on Multi-Agent Reinforcement LearningThe Web Conference (WWW), 2021
Weijiao Zhang
Hao Liu
Fan Wang
Tong Xu
Haoran Xin
Dejing Dou
Hui Xiong
137
99
0
15 Feb 2021
Reinforcement Learning for IoT Security: A Comprehensive Survey
Reinforcement Learning for IoT Security: A Comprehensive SurveyIEEE Internet of Things Journal (IEEE IoT Journal), 2021
Aashma Uprety
D. Rawat
AAML
204
152
0
14 Feb 2021
Resilient Machine Learning for Networked Cyber Physical Systems: A
  Survey for Machine Learning Security to Securing Machine Learning for CPS
Resilient Machine Learning for Networked Cyber Physical Systems: A Survey for Machine Learning Security to Securing Machine Learning for CPSIEEE Communications Surveys and Tutorials (COMST), 2021
Felix O. Olowononi
D. Rawat
Chunmei Liu
302
159
0
14 Feb 2021
Q-Value Weighted Regression: Reinforcement Learning with Limited Data
Q-Value Weighted Regression: Reinforcement Learning with Limited DataIEEE International Joint Conference on Neural Network (IJCNN), 2021
Piotr Kozakowski
Lukasz Kaiser
Henryk Michalewski
Afroz Mohiuddin
Katarzyna Kañska
OffRL
170
6
0
12 Feb 2021
Deep Reinforcement Learning for Backup Strategies against Adversaries
Deep Reinforcement Learning for Backup Strategies against Adversaries
Pascal Debus
Nicolas Müller
Konstantin Böttinger
AAML
85
1
0
12 Feb 2021
Domain Adaptation In Reinforcement Learning Via Latent Unified State
  Representation
Domain Adaptation In Reinforcement Learning Via Latent Unified State RepresentationAAAI Conference on Artificial Intelligence (AAAI), 2021
Jinwei Xing
Takashi Nagata
Kexin Chen
Xinyun Zou
Emre Neftci
J. Krichmar
OOD
187
58
0
10 Feb 2021
Derivative-Free Reinforcement Learning: A Review
Derivative-Free Reinforcement Learning: A Review
Hong Qian
Yang Yu
OffRL
254
47
0
10 Feb 2021
Personalization for Web-based Services using Offline Reinforcement
  Learning
Personalization for Web-based Services using Offline Reinforcement LearningMachine-mediated learning (ML), 2021
P. Apostolopoulos
Zehui Wang
Hanson Wang
Chad Zhou
Kittipat Virochsiri
Norm Zhou
Igor L. Markov
OffRLOnRL
162
7
0
10 Feb 2021
Deep Reinforcement Learning with Symmetric Prior for Predictive Power
  Allocation to Mobile Users
Deep Reinforcement Learning with Symmetric Prior for Predictive Power Allocation to Mobile Users
Jianyu Zhao
Chenyang Yang
65
0
0
10 Feb 2021
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Florian E. Dorner
138
13
0
09 Feb 2021
Continuous-Time Model-Based Reinforcement Learning
Continuous-Time Model-Based Reinforcement LearningInternational Conference on Machine Learning (ICML), 2021
Çağatay Yıldız
Markus Heinonen
Harri Lähdesmäki
OffRL
232
70
0
09 Feb 2021
Reverb: A Framework For Experience Replay
Reverb: A Framework For Experience Replay
Albin Cassirer
Gabriel Barth-Maron
E. Brevdo
Sabela Ramos
Toby Boyd
Thibault Sottiaux
M. Kroiss
VLMOffRL
151
42
0
09 Feb 2021
Adversarially Guided Actor-Critic
Adversarially Guided Actor-CriticInternational Conference on Learning Representations (ICLR), 2021
Yannis Flet-Berliac
Johan Ferret
Olivier Pietquin
Philippe Preux
Matthieu Geist
159
78
0
08 Feb 2021
Deep Reinforcement Learning for the Control of Robotic Manipulation: A
  Focussed Mini-Review
Deep Reinforcement Learning for the Control of Robotic Manipulation: A Focussed Mini-Review
Rongrong Liu
F. Nageotte
P. Zanne
M. de Mathelin
Birgitta Dresp
169
181
0
08 Feb 2021
OPT-GAN: A Broad-Spectrum Global Optimizer for Black-box Problems by
  Learning Distribution
OPT-GAN: A Broad-Spectrum Global Optimizer for Black-box Problems by Learning DistributionAAAI Conference on Artificial Intelligence (AAAI), 2021
Minfang Lu
Shuai Ning
Shuangrong Liu
Fengyang Sun
Bo Zhang
Bo Yang
Linshan Wang
362
6
0
07 Feb 2021
Tactical Optimism and Pessimism for Deep Reinforcement Learning
Tactical Optimism and Pessimism for Deep Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2021
Theodore H. Moskovitz
Jack Parker-Holder
Aldo Pacchiano
Michael Arbel
Sai Li
321
70
0
07 Feb 2021
A Hybrid Approach for Reinforcement Learning Using Virtual Policy
  Gradient for Balancing an Inverted Pendulum
A Hybrid Approach for Reinforcement Learning Using Virtual Policy Gradient for Balancing an Inverted Pendulum
Dylan Bates
46
5
0
06 Feb 2021
MSPM: A Modularized and Scalable Multi-Agent Reinforcement
  Learning-based System for Financial Portfolio Management
MSPM: A Modularized and Scalable Multi-Agent Reinforcement Learning-based System for Financial Portfolio ManagementPLoS ONE (PLOS ONE), 2021
Zhenhan Huang
F. Tanaka
AIFin
277
35
0
06 Feb 2021
Corner Case Generation and Analysis for Safety Assessment of Autonomous
  Vehicles
Corner Case Generation and Analysis for Safety Assessment of Autonomous VehiclesTransportation Research Record (TRR), 2021
Haowei Sun
Shuo Feng
Xintao Yan
Henry X. Liu
AAML
163
63
0
06 Feb 2021
Topology-Aware Network Pruning using Multi-stage Graph Embedding and
  Reinforcement Learning
Topology-Aware Network Pruning using Multi-stage Graph Embedding and Reinforcement LearningInternational Conference on Machine Learning (ICML), 2021
Sixing Yu
Arya Mazaheri
Ali Jannesari
236
51
0
05 Feb 2021
A Survey on Mathematical Aspects of Machine Learning in GeoPhysics: The
  Cases of Weather Forecast, Wind Energy, Wave Energy, Oil and Gas Exploration
A Survey on Mathematical Aspects of Machine Learning in GeoPhysics: The Cases of Weather Forecast, Wind Energy, Wave Energy, Oil and Gas ExplorationMediterranean Conference on Embedded Computing (MECO), 2021
Miroslav Kosanic
V. Milutinovic
AI4CE
114
3
0
05 Feb 2021
An advantage actor-critic algorithm for robotic motion planning in dense
  and dynamic scenarios
An advantage actor-critic algorithm for robotic motion planning in dense and dynamic scenarios
Chengmin Zhou
Bingding Huang
Pasi Fränti
107
1
0
05 Feb 2021
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've
  Learned
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've Learned
Julian Ibarz
Jie Tan
Chelsea Finn
Mrinal Kalakrishnan
P. Pastor
Sergey Levine
OffRL
285
618
0
04 Feb 2021
A review of motion planning algorithms for intelligent robotics
A review of motion planning algorithms for intelligent robotics
Chengmin Zhou
Bingding Huang
Pasi Fränti
129
5
0
04 Feb 2021
Improved Cooperation by Exploiting a Common Signal
Improved Cooperation by Exploiting a Common SignalAutonomous Agents and Multi-Agent Systems (AAMAS), 2021
Panayiotis Danassis
Zeki Doruk Erden
Boi Faltings
133
2
0
03 Feb 2021
MolGrow: A Graph Normalizing Flow for Hierarchical Molecular Generation
MolGrow: A Graph Normalizing Flow for Hierarchical Molecular GenerationAAAI Conference on Artificial Intelligence (AAAI), 2021
Maksim Kuznetsov
Daniil Polykovskiy
204
53
0
03 Feb 2021
Policy Analysis using Synthetic Controls in Continuous-Time
Policy Analysis using Synthetic Controls in Continuous-TimeInternational Conference on Machine Learning (ICML), 2021
Alexis Bellot
M. Schaar
OffRL
138
18
0
02 Feb 2021
DRLDO: A novel DRL based De-ObfuscationSystem for Defense against
  Metamorphic Malware
DRLDO: A novel DRL based De-ObfuscationSystem for Defense against Metamorphic MalwareDefence Science Journal (DSJ), 2021
Mohit Sewak
S. K. Sahay
Hemant Rathore
129
16
0
01 Feb 2021
Hybrid Beamforming for mmWave MU-MISO Systems Exploiting Multi-agent
  Deep Reinforcement Learning
Hybrid Beamforming for mmWave MU-MISO Systems Exploiting Multi-agent Deep Reinforcement LearningIEEE Wireless Communications Letters (WCL), 2021
Qisheng Wang
Xiao Li
Shi Jin
Yijian Chen
96
16
0
01 Feb 2021
Scalable Voltage Control using Structure-Driven Hierarchical Deep
  Reinforcement Learning
Scalable Voltage Control using Structure-Driven Hierarchical Deep Reinforcement Learning
Sayak Mukherjee
Renke Huang
Qiuhua Huang
T. Vu
Tianzhixi Yin
109
7
0
29 Jan 2021
Predicting Nanorobot Shapes via Generative Models
Predicting Nanorobot Shapes via Generative Models
Emma Benjaminson
Rebecca E. Taylor
Matthew Travers
97
0
0
29 Jan 2021
OffCon$^3$: What is state of the art anyway?
OffCon3^33: What is state of the art anyway?
Philip J. Ball
Stephen J. Roberts
OffRL
161
8
0
27 Jan 2021
Learning to falsify automated driving vehicles with prior knowledge
Learning to falsify automated driving vehicles with prior knowledgeIFAC-PapersOnLine (IFAC-PapersOnLine), 2021
Andrea Favrin
Vladislav Nenchev
Angelo Cenedese
110
8
0
25 Jan 2021
Breaking the Deadly Triad with a Target Network
Breaking the Deadly Triad with a Target NetworkInternational Conference on Machine Learning (ICML), 2021
Shangtong Zhang
Hengshuai Yao
Shimon Whiteson
AAML
734
56
0
21 Jan 2021
Robust Reinforcement Learning on State Observations with Learned Optimal
  Adversary
Robust Reinforcement Learning on State Observations with Learned Optimal AdversaryInternational Conference on Learning Representations (ICLR), 2021
Huan Zhang
Hongge Chen
Duane S. Boning
Cho-Jui Hsieh
286
193
0
21 Jan 2021
Previous
123...585960...949596
Next