Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1512.04455
Cited By
Memory-based control with recurrent neural networks
14 December 2015
N. Heess
Jonathan J. Hunt
Timothy Lillicrap
David Silver
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Memory-based control with recurrent neural networks"
29 / 129 papers shown
Deep Reinforcement Learning for Multi-Agent Systems: A Review of Challenges, Solutions and Applications
Thanh Thi Nguyen
Ngoc Duy Nguyen
S. Nahavandi
294
928
0
31 Dec 2018
Learning to Communicate: A Machine Learning Framework for Heterogeneous Multi-Agent Robotic Systems
Hyung-Jin Yoon
Huaiyu Chen
Kehan Long
Heling Zhang
Aditya Gahlawat
Donghwan Lee
N. Hovakimyan
95
6
0
13 Dec 2018
Hidden Markov Model Estimation-Based Q-learning for Partially Observable Markov Decision Process
Hyung-Jin Yoon
Donghwan Lee
N. Hovakimyan
178
10
0
17 Sep 2018
Deep Variational Reinforcement Learning for POMDPs
Maximilian Igl
L. Zintgraf
T. Le
Frank Wood
Shimon Whiteson
BDL
OffRL
320
284
0
06 Jun 2018
Policy Gradients for Contextual Recommendations
Feiyang Pan
Qingpeng Cai
Pingzhong Tang
Fuzhen Zhuang
Qing He
OffRL
195
44
0
12 Feb 2018
Rover Descent: Learning to optimize by learning to navigate on prototypical loss surfaces
Louis Faury
Flavian Vasile
148
2
0
22 Jan 2018
Sim2Real View Invariant Visual Servoing by Recurrent Control
Fereshteh Sadeghi
Alexander Toshev
Eric Jang
Sergey Levine
134
102
0
20 Dec 2017
Regret Minimization for Partially Observable Deep Reinforcement Learning
Peter H. Jin
Kurt Keutzer
Sergey Levine
131
53
0
31 Oct 2017
Consequentialist conditional cooperation in social dilemmas with imperfect information
A. Peysakhovich
Adam Lerer
178
65
0
19 Oct 2017
Sim-to-Real Transfer of Robotic Control with Dynamics Randomization
Xue Bin Peng
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
527
1,527
0
18 Oct 2017
Map-based Multi-Policy Reinforcement Learning: Enhancing Adaptability of Robots by Deep Reinforcement Learning
A. Kume
Eiichi Matsumoto
K. Takahashi
W. Ko
Jethro Tan
146
12
0
17 Oct 2017
Recurrent Deterministic Policy Gradient Method for Bipedal Locomotion on Rough Terrain Challenge
Doo Re Song
Chuanyu Yang
C. McGreavy
Zhibin Li
343
32
0
08 Oct 2017
Guided Deep Reinforcement Learning for Swarm Systems
Maximilian Hüttenrauch
Adrian Šošić
Gerhard Neumann
153
145
0
18 Sep 2017
Reinforcement Mechanism Design for e-commerce
Qingpeng Cai
Aris Filos-Ratsikas
Pingzhong Tang
Yiwei Zhang
98
4
0
25 Aug 2017
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
389
2,830
0
19 Aug 2017
Neural Network Memory Architectures for Autonomous Robot Navigation
Steven W. Chen
Nikolay Atanasov
Arbaaz Khan
Konstantinos Karydis
Daniel D. Lee
Vijay Kumar
157
7
0
23 May 2017
Navigating Occluded Intersections with Autonomous Vehicles using Deep Reinforcement Learning
David Isele
Reza Rahimi
Akansel Cosgun
K. Subramanian
K. Fujimura
189
136
0
02 May 2017
Robust Adversarial Reinforcement Learning
Lerrel Pinto
James Davidson
Rahul Sukthankar
Abhinav Gupta
OOD
309
968
0
08 Mar 2017
Cognitive Mapping and Planning for Visual Navigation
International Journal of Computer Vision (IJCV), 2017
Saurabh Gupta
Varun Tolani
James Davidson
Sergey Levine
Rahul Sukthankar
Jitendra Malik
406
741
0
13 Feb 2017
Preparing for the Unknown: Learning a Universal Policy with Online System Identification
Wenhao Yu
Jie Tan
Chenxi Liu
Greg Turk
OffRL
360
330
0
08 Feb 2017
Imitating Driver Behavior with Generative Adversarial Networks
Alex Kuefler
Jeremy Morton
T. Wheeler
Mykel Kochenderfer
GAN
258
427
0
24 Jan 2017
Reinforcement Learning With Temporal Logic Rewards
Xiao Li
C. Vasile
C. Belta
241
244
0
11 Dec 2016
Nonparametric General Reinforcement Learning
Jan Leike
OffRL
248
28
0
28 Nov 2016
Memory Lens: How Much Memory Does an Agent Use?
Christoph Dann
Katja Hofmann
Sebastian Nowozin
OffRL
121
3
0
21 Nov 2016
Towards Lifelong Self-Supervision: A Deep Learning Direction for Robotics
J. M. Wong
242
12
0
01 Nov 2016
Actor-critic versus direct policy search: a comparison based on sample complexity
Arnaud de Froissard de Broissia
Olivier Sigaud
119
13
0
29 Jun 2016
Review of state-of-the-arts in artificial intelligence with application to AI safety problem
V. Shakirov
143
10
0
11 May 2016
Benchmarking Deep Reinforcement Learning for Continuous Control
Yan Duan
Xi Chen
Rein Houthooft
John Schulman
Pieter Abbeel
OffRL
472
1,765
0
22 Apr 2016
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
966
14,671
0
09 Sep 2015
Previous
1
2
3