ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.02971
  4. Cited By
Continuous control with deep reinforcement learning
v1v2v3v4v5v6 (latest)

Continuous control with deep reinforcement learning

9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
ArXiv (abs)PDFHTML

Papers citing "Continuous control with deep reinforcement learning"

50 / 4,796 papers shown
One-Shot Imitation Learning
One-Shot Imitation Learning
Yan Duan
Marcin Andrychowicz
Bradly C. Stadie
Jonathan Ho
Jonas Schneider
Ilya Sutskever
Pieter Abbeel
Wojciech Zaremba
OffRL
316
722
0
21 Mar 2017
Learning to Navigate Cloth using Haptics
Learning to Navigate Cloth using Haptics
Alexander Clegg
Wenhao Yu
Zackory M. Erickson
Jie Tan
Chenxi Liu
Greg Turk
218
23
0
20 Mar 2017
Vision-based Robotic Arm Imitation by Human Gesture
Vision-based Robotic Arm Imitation by Human Gesture
Cheng Xuan
Zhiqiang Tang
Jinxin Xu
57
0
0
15 Mar 2017
Sensor Fusion for Robot Control through Deep Reinforcement Learning
Sensor Fusion for Robot Control through Deep Reinforcement Learning
Steven Bohez
Tim Verbelen
E. D. Coninck
B. Vankeirsbilck
Pieter Simoens
Bart Dhoedt
SSL
113
29
0
13 Mar 2017
Combining Model-Based and Model-Free Updates for Trajectory-Centric
  Reinforcement Learning
Combining Model-Based and Model-Free Updates for Trajectory-Centric Reinforcement Learning
Yevgen Chebotar
Karol Hausman
Marvin Zhang
Gaurav Sukhatme
S. Schaal
Sergey Levine
321
163
0
08 Mar 2017
Learning Invariant Feature Spaces to Transfer Skills with Reinforcement
  Learning
Learning Invariant Feature Spaces to Transfer Skills with Reinforcement Learning
Abhishek Gupta
Coline Devin
YuXuan Liu
Pieter Abbeel
Sergey Levine
220
279
0
08 Mar 2017
Learning a Unified Control Policy for Safe Falling
Learning a Unified Control Policy for Safe Falling
Visak C. V. Kumar
Sehoon Ha
Karen Liu
98
23
0
08 Mar 2017
Robust Adversarial Reinforcement Learning
Robust Adversarial Reinforcement Learning
Lerrel Pinto
James Davidson
Rahul Sukthankar
Abhinav Gupta
OOD
310
968
0
08 Mar 2017
Towards Generalization and Simplicity in Continuous Control
Towards Generalization and Simplicity in Continuous Control
Aravind Rajeswaran
Kendall Lowrey
E. Todorov
Sham Kakade
OffRL
245
287
0
08 Mar 2017
Revisiting stochastic off-policy action-value gradients
Revisiting stochastic off-policy action-value gradients
Yemi Okesanjo
Victor Kofia
OffRL
53
0
0
06 Mar 2017
Combining Self-Supervised Learning and Imitation for Vision-Based Rope
  Manipulation
Combining Self-Supervised Learning and Imitation for Vision-Based Rope Manipulation
Ashvin Nair
Dian Chen
Pulkit Agrawal
Phillip Isola
Pieter Abbeel
Jitendra Malik
Sergey Levine
SSL
227
324
0
06 Mar 2017
EX2: Exploration with Exemplar Models for Deep Reinforcement Learning
EX2: Exploration with Exemplar Models for Deep Reinforcement Learning
Justin Fu
John D. Co-Reyes
Sergey Levine
OffRL
259
160
0
03 Mar 2017
FeUdal Networks for Hierarchical Reinforcement Learning
FeUdal Networks for Hierarchical Reinforcement Learning
A. Vezhnevets
Simon Osindero
Tom Schaul
N. Heess
Max Jaderberg
David Silver
Koray Kavukcuoglu
FedML
285
990
0
03 Mar 2017
Deep Predictive Policy Training using Reinforcement Learning
Deep Predictive Policy Training using Reinforcement Learning
Ali Ghadirzadeh
A. Maki
Danica Kragic
Mårten Björkman
156
132
0
02 Mar 2017
Reinforcement Learning for Pivoting Task
Reinforcement Learning for Pivoting Task
Rika Antonova
S. Cruciani
Christian Smith
Danica Kragic
139
70
0
01 Mar 2017
Virtual-to-real Deep Reinforcement Learning: Continuous Control of
  Mobile Robots for Mapless Navigation
Virtual-to-real Deep Reinforcement Learning: Continuous Control of Mobile Robots for Mapless Navigation
L. Tai
Giuseppe Paolo
Ming-Yuan Liu
463
765
0
01 Mar 2017
Bridging the Gap Between Value and Policy Based Reinforcement Learning
Bridging the Gap Between Value and Policy Based Reinforcement Learning
Ofir Nachum
Mohammad Norouzi
Kelvin Xu
Dale Schuurmans
484
514
0
28 Feb 2017
Reinforcement Learning with Deep Energy-Based Policies
Reinforcement Learning with Deep Energy-Based PoliciesInternational Conference on Machine Learning (ICML), 2017
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
416
1,495
0
27 Feb 2017
Learning Control for Air Hockey Striking using Deep Reinforcement
  Learning
Learning Control for Air Hockey Striking using Deep Reinforcement Learning
Ayal Taitler
N. Shimkin
152
11
0
26 Feb 2017
Robot gains Social Intelligence through Multimodal Deep Reinforcement
  Learning
Robot gains Social Intelligence through Multimodal Deep Reinforcement LearningIEEE-RAS International Conference on Humanoid Robots (Humanoids), 2016
A. H. Qureshi
Yutaka Nakamura
Yuichiro Yoshikawa
H. Ishiguro
97
95
0
24 Feb 2017
Towards a Common Implementation of Reinforcement Learning for Multiple
  Robotic Tasks
Towards a Common Implementation of Reinforcement Learning for Multiple Robotic TasksExpert systems with applications (ESWA), 2017
Angel Martínez-Tenor
Juan-Antonio Fernández-Madrigal
A. Cruz-Martín
Javier González Jiménez
OffRL
100
32
0
21 Feb 2017
Learning to Multi-Task by Active Sampling
Learning to Multi-Task by Active SamplingInternational Conference on Learning Representations (ICLR), 2017
Sahil Sharma
Ashutosh Jha
Parikshit Hegde
Balaraman Ravindran
247
21
0
20 Feb 2017
Preparing for the Unknown: Learning a Universal Policy with Online
  System Identification
Preparing for the Unknown: Learning a Universal Policy with Online System Identification
Wenhao Yu
Jie Tan
Chenxi Liu
Greg Turk
OffRL
365
331
0
08 Feb 2017
Trainable Greedy Decoding for Neural Machine Translation
Trainable Greedy Decoding for Neural Machine TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2017
Jiatao Gu
Dong Wang
Victor O.K. Li
249
77
0
08 Feb 2017
Adversarial Attacks on Neural Network Policies
Adversarial Attacks on Neural Network PoliciesInternational Conference on Learning Representations (ICLR), 2017
Sandy Huang
Nicolas Papernot
Ian Goodfellow
Yan Duan
Pieter Abbeel
MLAUAAML
303
908
0
08 Feb 2017
Uncertainty-Aware Reinforcement Learning for Collision Avoidance
Uncertainty-Aware Reinforcement Learning for Collision Avoidance
G. Kahn
Adam R. Villaflor
Vitchyr H. Pong
Pieter Abbeel
Sergey Levine
205
326
0
03 Feb 2017
Deep Reinforcement Learning for Robotic Manipulation-The state of the
  art
Deep Reinforcement Learning for Robotic Manipulation-The state of the art
S. Amarjyoti
81
71
0
31 Jan 2017
Learning Light Transport the Reinforced Way
Learning Light Transport the Reinforced Way
Ken Dahm
A. Keller
95
67
0
25 Jan 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRLVLM
959
1,742
0
25 Jan 2017
Efficient iterative policy optimization
Efficient iterative policy optimization
Nicolas Le Roux
96
8
0
28 Dec 2016
The Predictron: End-To-End Learning and Planning
The Predictron: End-To-End Learning and PlanningInternational Conference on Machine Learning (ICML), 2016
David Silver
H. V. Hasselt
Matteo Hessel
Tom Schaul
A. Guez
...
Gabriel Dulac-Arnold
David P. Reichert
Neil C. Rabinowitz
André Barreto
T. Degris
245
300
0
28 Dec 2016
A Survey of Deep Network Solutions for Learning Control in Robotics:
  From Reinforcement to Imitation
A Survey of Deep Network Solutions for Learning Control in Robotics: From Reinforcement to Imitation
L. Tai
Jingwei Zhang
Ming-Yuan Liu
Joschka Boedecker
Wolfram Burgard
OffRL
304
78
0
21 Dec 2016
Deep Reinforcement Learning with Successor Features for Navigation
  across Similar Environments
Deep Reinforcement Learning with Successor Features for Navigation across Similar EnvironmentsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2016
Jingwei Zhang
Jost Tobias Springenberg
Joschka Boedecker
Wolfram Burgard
218
305
0
16 Dec 2016
Hierarchy through Composition with Linearly Solvable Markov Decision
  Processes
Hierarchy through Composition with Linearly Solvable Markov Decision Processes
Andrew M. Saxe
A. Earle
Benjamin Rosman
42
1
0
08 Dec 2016
Cryptocurrency Portfolio Management with Deep Reinforcement Learning
Cryptocurrency Portfolio Management with Deep Reinforcement Learning
Zhengyao Jiang
Jinjun Liang
AIFin
313
221
0
05 Dec 2016
Combining Deep Reinforcement Learning and Safety Based Control for
  Autonomous Driving
Combining Deep Reinforcement Learning and Safety Based Control for Autonomous Driving
Xincheng Xiong
Jianqiang Wang
Fang Zhang
Keqiang Li
160
68
0
01 Dec 2016
Nonparametric General Reinforcement Learning
Nonparametric General Reinforcement Learning
Jan Leike
OffRL
249
29
0
28 Nov 2016
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a
  GPU
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU
Mohammad Babaeizadeh
I. Frosio
Stephen Tyree
Jason Clemons
Jan Kautz
OffRL
231
286
0
18 Nov 2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement
  Learning
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
OffRL
392
825
0
15 Nov 2016
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models
  with KL-control
Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control
Natasha Jaques
S. Gu
Dzmitry Bahdanau
José Miguel Hernández-Lobato
Richard Turner
Douglas Eck
573
201
0
09 Nov 2016
RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning
RL2^22: Fast Reinforcement Learning via Slow Reinforcement Learning
Yan Duan
John Schulman
Xi Chen
Peter L. Bartlett
Ilya Sutskever
Pieter Abbeel
OffRL
303
1,098
0
09 Nov 2016
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic
S. Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard Turner
Sergey Levine
OffRLBDL
309
357
0
07 Nov 2016
Designing Neural Network Architectures using Reinforcement Learning
Designing Neural Network Architectures using Reinforcement Learning
Bowen Baker
O. Gupta
Nikhil Naik
Ramesh Raskar
347
1,530
0
07 Nov 2016
Learning to Act by Predicting the Future
Learning to Act by Predicting the Future
Alexey Dosovitskiy
V. Koltun
285
289
0
06 Nov 2016
Combining policy gradient and Q-learning
Combining policy gradient and Q-learning
Brendan O'Donoghue
Rémi Munos
Koray Kavukcuoglu
Volodymyr Mnih
OffRLOnRL
350
143
0
05 Nov 2016
Sample Efficient Actor-Critic with Experience Replay
Sample Efficient Actor-Critic with Experience Replay
Ziyun Wang
V. Bapst
N. Heess
Volodymyr Mnih
Rémi Munos
Koray Kavukcuoglu
Nando de Freitas
390
796
0
03 Nov 2016
Learning Locomotion Skills Using DeepRL: Does the Choice of Action Space
  Matter?
Learning Locomotion Skills Using DeepRL: Does the Choice of Action Space Matter?
Xue Bin Peng
M. van de Panne
257
196
0
03 Nov 2016
Deep Learning Approximation for Stochastic Control Problems
Deep Learning Approximation for Stochastic Control Problems
Jiequn Han
E. Weinan
BDL
145
214
0
02 Nov 2016
Towards Lifelong Self-Supervision: A Deep Learning Direction for
  Robotics
Towards Lifelong Self-Supervision: A Deep Learning Direction for Robotics
J. M. Wong
246
12
0
01 Nov 2016
Learning and Transfer of Modulated Locomotor Controllers
Learning and Transfer of Modulated Locomotor Controllers
N. Heess
Greg Wayne
Yuval Tassa
Timothy Lillicrap
Martin Riedmiller
David Silver
178
218
0
17 Oct 2016
Previous
123...949596
Next