ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1509.02971
  4. Cited By
Continuous control with deep reinforcement learning
v1v2v3v4v5v6 (latest)

Continuous control with deep reinforcement learning

9 September 2015
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
ArXiv (abs)PDFHTML

Papers citing "Continuous control with deep reinforcement learning"

50 / 4,796 papers shown
RAIL: Risk-Averse Imitation Learning
RAIL: Risk-Averse Imitation Learning
Anirban Santara
A. Naik
Balaraman Ravindran
Dipankar Das
Dheevatsa Mudigere
Sasikanth Avancha
Bharat Kaul
266
19
0
20 Jul 2017
Imagination-Augmented Agents for Deep Reinforcement Learning
Imagination-Augmented Agents for Deep Reinforcement Learning
T. Weber
S. Racanière
David P. Reichert
Lars Buesing
A. Guez
...
Razvan Pascanu
Peter W. Battaglia
Demis Hassabis
David Silver
Daan Wierstra
LM&Ro
242
587
0
19 Jul 2017
Reverse Curriculum Generation for Reinforcement Learning
Reverse Curriculum Generation for Reinforcement Learning
Carlos Florensa
David Held
Markus Wulfmeier
Michael Zhang
Pieter Abbeel
368
487
0
17 Jul 2017
Control of a Quadrotor with Reinforcement Learning
Control of a Quadrotor with Reinforcement Learning
Jemin Hwangbo
Inkyu Sa
Roland Siegwart
Marco Hutter
214
533
0
17 Jul 2017
The Intentional Unintentional Agent: Learning to Solve Many Continuous
  Control Tasks Simultaneously
The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously
Serkan Cabi
Sergio Gomez Colmenarejo
Matthew W. Hoffman
Misha Denil
Ziyun Wang
Nando de Freitas
179
31
0
11 Jul 2017
Learning Heuristic Search via Imitation
Learning Heuristic Search via Imitation
M. Bhardwaj
Sanjiban Choudhury
Sebastian Scherer
151
90
0
10 Jul 2017
Vision-Based Multi-Task Manipulation for Inexpensive Robots Using
  End-To-End Learning from Demonstration
Vision-Based Multi-Task Manipulation for Inexpensive Robots Using End-To-End Learning from Demonstration
Rouhollah Rahmatizadeh
P. Abolghasemi
Ladislau Bölöni
Sergey Levine
288
271
0
10 Jul 2017
Robust Imitation of Diverse Behaviors
Robust Imitation of Diverse Behaviors
Ziyun Wang
J. Merel
Scott E. Reed
Greg Wayne
Nando de Freitas
N. Heess
251
215
0
10 Jul 2017
Emergence of Locomotion Behaviours in Rich Environments
Emergence of Locomotion Behaviours in Rich Environments
N. Heess
TB Dhruva
S. Sriram
Jay Lemmon
J. Merel
...
Tom Erez
Ziyun Wang
S. M. Ali Eslami
Martin Riedmiller
David Silver
483
979
0
07 Jul 2017
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control
Ofir Nachum
Mohammad Norouzi
Kelvin Xu
Dale Schuurmans
209
113
0
06 Jul 2017
Hindsight Experience Replay
Hindsight Experience Replay
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
705
2,555
0
05 Jul 2017
Hashing over Predicted Future Frames for Informed Exploration of Deep
  Reinforcement Learning
Hashing over Predicted Future Frames for Informed Exploration of Deep Reinforcement Learning
Haiyan Yin
Jianda Chen
Sinno Jialin Pan
67
5
0
03 Jul 2017
Teacher-Student Curriculum Learning
Teacher-Student Curriculum Learning
Tambet Matiisen
Avital Oliver
Taco S. Cohen
John Schulman
ODL
465
427
0
01 Jul 2017
Noisy Networks for Exploration
Noisy Networks for Exploration
Meire Fortunato
M. G. Azar
Bilal Piot
Jacob Menick
Ian Osband
...
Rémi Munos
Demis Hassabis
Olivier Pietquin
Charles Blundell
Shane Legg
330
967
0
30 Jun 2017
A Deep Reinforcement Learning Framework for the Financial Portfolio
  Management Problem
A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem
Zhengyao Jiang
Dixing Xu
Jinjun Liang
OOD
266
404
0
30 Jun 2017
Path Integral Networks: End-to-End Differentiable Optimal Control
Path Integral Networks: End-to-End Differentiable Optimal Control
Masashi Okada
Luca Rigazio
T. Aoshima
PINN
196
59
0
29 Jun 2017
Learning to Learn: Meta-Critic Networks for Sample Efficient Learning
Learning to Learn: Meta-Critic Networks for Sample Efficient Learning
Flood Sung
Li Zhang
Tao Xiang
Timothy M. Hospedales
Yongxin Yang
OffRL
133
136
0
29 Jun 2017
Programmable Agents
Programmable Agents
Misha Denil
Sergio Gomez Colmenarejo
Serkan Cabi
D. Saxton
Nando de Freitas
AI4CE
176
43
0
20 Jun 2017
Expected Policy Gradients
Expected Policy Gradients
K. Ciosek
Shimon Whiteson
380
59
0
15 Jun 2017
Data-Efficient Policy Evaluation Through Behavior Policy Search
Data-Efficient Policy Evaluation Through Behavior Policy SearchInternational Conference on Machine Learning (ICML), 2017
Josiah P. Hanna
Philip S. Thomas
Peter Stone
S. Niekum
OffRL
178
46
0
12 Jun 2017
ACCNet: Actor-Coordinator-Critic Net for "Learning-to-Communicate" with
  Deep Multi-agent Reinforcement Learning
ACCNet: Actor-Coordinator-Critic Net for "Learning-to-Communicate" with Deep Multi-agent Reinforcement Learning
Hangyu Mao
Zhibo Gong
Yan Ni
Zhen Xiao
225
46
0
10 Jun 2017
Unlocking the Potential of Simulators: Design with RL in Mind
Unlocking the Potential of Simulators: Design with RL in Mind
Rika Antonova
S. Cruciani
120
2
0
08 Jun 2017
Generalized Value Iteration Networks: Life Beyond Lattices
Generalized Value Iteration Networks: Life Beyond LatticesAAAI Conference on Artificial Intelligence (AAAI), 2017
Sufeng Niu
Siheng Chen
Hanyu Guo
Colin Targonski
M. C. Smith
J. Kovacevic
GNN
239
57
0
08 Jun 2017
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive EnvironmentsNeural Information Processing Systems (NeurIPS), 2017
Ryan J. Lowe
Yi Wu
Aviv Tamar
J. Harb
Pieter Abbeel
Igor Mordatch
757
5,326
0
07 Jun 2017
Parameter Space Noise for Exploration
Parameter Space Noise for ExplorationInternational Conference on Learning Representations (ICLR), 2017
Matthias Plappert
Rein Houthooft
Prafulla Dhariwal
Szymon Sidor
Richard Y. Chen
Xi Chen
Tamim Asfour
Pieter Abbeel
Marcin Andrychowicz
246
655
0
06 Jun 2017
Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient
  Estimation for Deep Reinforcement Learning
Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2017
S. Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard Turner
Bernhard Schölkopf
Sergey Levine
OffRL
175
172
0
01 Jun 2017
Sequential Dynamic Decision Making with Deep Neural Nets on a Test-Time
  Budget
Sequential Dynamic Decision Making with Deep Neural Nets on a Test-Time Budget
Henghui Zhu
Feng Nan
I. Paschalidis
Venkatesh Saligrama
90
2
0
31 May 2017
Constrained Policy Optimization
Constrained Policy OptimizationInternational Conference on Machine Learning (ICML), 2017
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
1.3K
1,571
0
30 May 2017
Learning End-to-end Multimodal Sensor Policies for Autonomous Navigation
Learning End-to-end Multimodal Sensor Policies for Autonomous NavigationConference on Robot Learning (CoRL), 2017
Guan-Horng Liu
Avinash Siravuru
Sai P. Selvaraj
Manuela Veloso
George Kantor
170
71
0
30 May 2017
The Marginal Value of Adaptive Gradient Methods in Machine Learning
The Marginal Value of Adaptive Gradient Methods in Machine Learning
Ashia Wilson
Rebecca Roelofs
Mitchell Stern
Nathan Srebro
Benjamin Recht
ODL
396
1,108
0
23 May 2017
Visual Semantic Planning using Deep Successor Representations
Visual Semantic Planning using Deep Successor Representations
Yuke Zhu
Daniel Gordon
Eric Kolve
Dieter Fox
Li Fei-Fei
Abhinav Gupta
Roozbeh Mottaghi
Ali Farhadi
199
142
0
23 May 2017
Learning to Mix n-Step Returns: Generalizing lambda-Returns for Deep
  Reinforcement Learning
Learning to Mix n-Step Returns: Generalizing lambda-Returns for Deep Reinforcement Learning
Sahil Sharma
J. GirishRaguvir
S. Ramesh
Balaraman Ravindran
93
6
0
21 May 2017
Learning to Factor Policies and Action-Value Functions: Factored Action
  Space Representations for Deep Reinforcement learning
Learning to Factor Policies and Action-Value Functions: Factored Action Space Representations for Deep Reinforcement learning
Sahil Sharma
A. Suresh
Rahul Ramesh
Balaraman Ravindran
OffRL
141
37
0
20 May 2017
Automatic Goal Generation for Reinforcement Learning Agents
Automatic Goal Generation for Reinforcement Learning Agents
Carlos Florensa
David Held
Xinyang Geng
Pieter Abbeel
523
553
0
17 May 2017
Curiosity-driven Exploration by Self-supervised Prediction
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRMSSL
553
2,728
0
15 May 2017
Discrete Sequential Prediction of Continuous Actions for Deep RL
Discrete Sequential Prediction of Continuous Actions for Deep RL
Luke Metz
Julian Ibarz
Navdeep Jaitly
James Davidson
BDLOffRL
330
132
0
14 May 2017
Metacontrol for Adaptive Imagination-Based Optimization
Metacontrol for Adaptive Imagination-Based Optimization
Jessica B. Hamrick
A. J. Ballard
Razvan Pascanu
Oriol Vinyals
N. Heess
Peter W. Battaglia
152
69
0
07 May 2017
Toward Low-Flying Autonomous MAV Trail Navigation using Deep Neural
  Networks for Environmental Awareness
Toward Low-Flying Autonomous MAV Trail Navigation using Deep Neural Networks for Environmental Awareness
Nikolai Smolyanskiy
A. Kamenev
Jeffrey Smith
Stan Birchfield
343
233
0
07 May 2017
On Improving Deep Reinforcement Learning for POMDPs
On Improving Deep Reinforcement Learning for POMDPs
Q. Hu
Xin Li
Pascal Poupart
Guanghui Miao
293
134
0
26 Apr 2017
Inception Recurrent Convolutional Neural Network for Object Recognition
Inception Recurrent Convolutional Neural Network for Object Recognition
Md. Zahangir Alom
Mahmudul Hasan
C. Yakopcic
T. Taha
158
91
0
25 Apr 2017
The Reactor: A fast and sample-efficient Actor-Critic agent for
  Reinforcement Learning
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
A. Gruslys
Will Dabney
M. G. Azar
Bilal Piot
Marc G. Bellemare
Rémi Munos
144
59
0
15 Apr 2017
Virtual to Real Reinforcement Learning for Autonomous Driving
Virtual to Real Reinforcement Learning for Autonomous Driving
Xinlei Pan
Yurong You
Ziyan Wang
Cewu Lu
OffRL
362
350
0
13 Apr 2017
Data-efficient Deep Reinforcement Learning for Dexterous Manipulation
Data-efficient Deep Reinforcement Learning for Dexterous Manipulation
I. Popov
N. Heess
Timothy Lillicrap
Agrim Gupta
Gabriel Barth-Maron
Matej Vecerík
Thomas Lampe
Yuval Tassa
Tom Erez
Martin Riedmiller
OffRL
281
274
0
10 Apr 2017
Stochastic Neural Networks for Hierarchical Reinforcement Learning
Stochastic Neural Networks for Hierarchical Reinforcement Learning
Carlos Florensa
Yan Duan
Pieter Abbeel
BDL
273
373
0
10 Apr 2017
Learned Watershed: End-to-End Learning of Seeded Segmentation
Learned Watershed: End-to-End Learning of Seeded Segmentation
Steffen Wolf
Lukas Schott
Ullrich Kothe
Fred Hamprecht
185
37
0
07 Apr 2017
Learning Visual Servoing with Deep Features and Fitted Q-Iteration
Learning Visual Servoing with Deep Features and Fitted Q-Iteration
Alex X. Lee
Sergey Levine
Pieter Abbeel
SSL
133
74
0
31 Mar 2017
Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level
  Coordination in Learning to Play StarCraft Combat Games
Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level Coordination in Learning to Play StarCraft Combat Games
Peng Peng
Ying Wen
Yaodong Yang
Quan Yuan
Zhenkun Tang
Haitao Long
Jun Wang
344
357
0
29 Mar 2017
Deep Deterministic Policy Gradient for Urban Traffic Light Control
Deep Deterministic Policy Gradient for Urban Traffic Light Control
Noe Casas
123
180
0
27 Mar 2017
InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations
InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations
Yunzhu Li
Jiaming Song
Stefano Ermon
160
44
0
26 Mar 2017
Combining Neural Networks and Tree Search for Task and Motion Planning
  in Challenging Environments
Combining Neural Networks and Tree Search for Task and Motion Planning in Challenging Environments
Chris Paxton
Vasumathi Raman
Gregory Hager
Marin Kobilarov
177
127
0
22 Mar 2017
Previous
123...93949596
Next