ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1603.00748
  4. Cited By
Continuous Deep Q-Learning with Model-based Acceleration

Continuous Deep Q-Learning with Model-based Acceleration

2 March 2016
S. Gu
Timothy Lillicrap
Ilya Sutskever
Sergey Levine
ArXivPDFHTML

Papers citing "Continuous Deep Q-Learning with Model-based Acceleration"

35 / 185 papers shown
Title
Synthesizing Neural Network Controllers with Probabilistic Model based
  Reinforcement Learning
Synthesizing Neural Network Controllers with Probabilistic Model based Reinforcement Learning
J. A. G. Higuera
David Meger
Gregory Dudek
BDL
22
39
0
06 Mar 2018
Deep Reinforcement Learning for Sponsored Search Real-time Bidding
Deep Reinforcement Learning for Sponsored Search Real-time Bidding
Jun Zhao
Guang Qiu
Ziyu Guan
Wei Zhao
Xiaofei He
21
81
0
01 Mar 2018
Model-Based Value Estimation for Efficient Model-Free Reinforcement
  Learning
Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning
Vladimir Feinberg
Alvin Wan
Ion Stoica
Michael I. Jordan
Joseph E. Gonzalez
Sergey Levine
OffRL
13
317
0
28 Feb 2018
Investigating Human Priors for Playing Video Games
Investigating Human Priors for Playing Video Games
Rachit Dubey
Pulkit Agrawal
Deepak Pathak
Thomas Griffiths
Alexei A. Efros
OffRL
35
144
0
28 Feb 2018
The Mirage of Action-Dependent Baselines in Reinforcement Learning
The Mirage of Action-Dependent Baselines in Reinforcement Learning
George Tucker
Surya Bhupatiraju
S. Gu
Richard Turner
Zoubin Ghahramani
Sergey Levine
OffRL
30
126
0
27 Feb 2018
Reinforcement and Imitation Learning for Diverse Visuomotor Skills
Reinforcement and Imitation Learning for Diverse Visuomotor Skills
Yuke Zhu
Ziyun Wang
J. Merel
Andrei A. Rusu
Tom Erez
...
S. Tunyasuvunakool
János Kramár
R. Hadsell
Nando de Freitas
N. Heess
SSL
34
316
0
26 Feb 2018
Temporal Difference Models: Model-Free Deep RL for Model-Based Control
Temporal Difference Models: Model-Free Deep RL for Model-Based Control
Vitchyr H. Pong
S. Gu
Murtaza Dalal
Sergey Levine
OffRL
66
238
0
25 Feb 2018
Deep Reinforcement Learning using Capsules in Advanced Game Environments
Deep Reinforcement Learning using Capsules in Advanced Game Environments
Per-Arne Andersen
18
16
0
29 Jan 2018
Experience-driven Networking: A Deep Reinforcement Learning based
  Approach
Experience-driven Networking: A Deep Reinforcement Learning based Approach
Zhiyuan Xu
Jian Tang
Jingsong Meng
Weiyi Zhang
Yanzhi Wang
C. Liu
Dejun Yang
OffRL
29
359
0
17 Jan 2018
Expected Policy Gradients for Reinforcement Learning
Expected Policy Gradients for Reinforcement Learning
K. Ciosek
Shimon Whiteson
50
51
0
10 Jan 2018
Least-Squares Temporal Difference Learning for the Linear Quadratic
  Regulator
Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator
Stephen Tu
Benjamin Recht
OffRL
37
130
0
22 Dec 2017
Action Branching Architectures for Deep Reinforcement Learning
Action Branching Architectures for Deep Reinforcement Learning
Arash Tavakoli
Fabio Pardo
Petar Kormushev
22
260
0
24 Nov 2017
Data Augmentation Generative Adversarial Networks
Data Augmentation Generative Adversarial Networks
Antreas Antoniou
Amos Storkey
Harrison Edwards
MedIm
GAN
92
1,067
0
12 Nov 2017
Applications of Deep Learning and Reinforcement Learning to Biological
  Data
Applications of Deep Learning and Reinforcement Learning to Biological Data
M. S. M. Mahmud
M. S. Kaiser
Amir Hussain
S. Vassanelli
OffRL
AI4CE
41
641
0
10 Nov 2017
TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep
  Reinforcement Learning
TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning
Gregory Farquhar
Tim Rocktaschel
Maximilian Igl
Shimon Whiteson
OffRL
25
71
0
31 Oct 2017
Explore, Exploit or Listen: Combining Human Feedback and Policy Model to
  Speed up Deep Reinforcement Learning in 3D Worlds
Explore, Exploit or Listen: Combining Human Feedback and Policy Model to Speed up Deep Reinforcement Learning in 3D Worlds
Zhiyu Lin
Brent Harrison
A. Keech
Mark O. Riedl
20
37
0
12 Sep 2017
A Brief Survey of Deep Reinforcement Learning
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
65
2,780
0
19 Aug 2017
Imagination-Augmented Agents for Deep Reinforcement Learning
Imagination-Augmented Agents for Deep Reinforcement Learning
T. Weber
S. Racanière
David P. Reichert
Lars Buesing
A. Guez
...
Razvan Pascanu
Peter W. Battaglia
Demis Hassabis
David Silver
Daan Wierstra
LM&Ro
54
551
0
19 Jul 2017
Hindsight Experience Replay
Hindsight Experience Replay
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
107
2,297
0
05 Jul 2017
Learning End-to-end Multimodal Sensor Policies for Autonomous Navigation
Learning End-to-end Multimodal Sensor Policies for Autonomous Navigation
Guan-Horng Liu
Avinash Siravuru
Sai P. Selvaraj
Manuela Veloso
George Kantor
13
69
0
30 May 2017
Data-efficient Deep Reinforcement Learning for Dexterous Manipulation
Data-efficient Deep Reinforcement Learning for Dexterous Manipulation
I. Popov
N. Heess
Timothy Lillicrap
Roland Hafner
Gabriel Barth-Maron
Matej Vecerík
Thomas Lampe
Yuval Tassa
Tom Erez
Martin Riedmiller
OffRL
31
263
0
10 Apr 2017
Combining Neural Networks and Tree Search for Task and Motion Planning
  in Challenging Environments
Combining Neural Networks and Tree Search for Task and Motion Planning in Challenging Environments
Chris Paxton
Vasumathi Raman
Gregory Hager
Marin Kobilarov
24
123
0
22 Mar 2017
Combining Model-Based and Model-Free Updates for Trajectory-Centric
  Reinforcement Learning
Combining Model-Based and Model-Free Updates for Trajectory-Centric Reinforcement Learning
Yevgen Chebotar
Karol Hausman
Marvin Zhang
Gaurav Sukhatme
S. Schaal
Sergey Levine
37
159
0
08 Mar 2017
Deep Reinforcement Learning for Robotic Manipulation-The state of the
  art
Deep Reinforcement Learning for Robotic Manipulation-The state of the art
S. Amarjyoti
23
65
0
31 Jan 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,505
0
25 Jan 2017
A Deep Learning Approach for Joint Video Frame and Reward Prediction in
  Atari Games
A Deep Learning Approach for Joint Video Frame and Reward Prediction in Atari Games
Felix Leibfried
Nate Kushman
Katja Hofmann
46
43
0
21 Nov 2016
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic
S. Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard Turner
Sergey Levine
OffRL
BDL
32
343
0
07 Nov 2016
Towards Lifelong Self-Supervision: A Deep Learning Direction for
  Robotics
Towards Lifelong Self-Supervision: A Deep Learning Direction for Robotics
J. M. Wong
27
11
0
01 Nov 2016
Learning and Transfer of Modulated Locomotor Controllers
Learning and Transfer of Modulated Locomotor Controllers
N. Heess
Greg Wayne
Yuval Tassa
Timothy Lillicrap
Martin Riedmiller
David Silver
35
207
0
17 Oct 2016
Sim-to-Real Robot Learning from Pixels with Progressive Nets
Sim-to-Real Robot Learning from Pixels with Progressive Nets
Andrei A. Rusu
Matej Vecerík
Thomas Rothörl
N. Heess
Razvan Pascanu
R. Hadsell
39
532
0
13 Oct 2016
Towards Cognitive Exploration through Deep Reinforcement Learning for
  Mobile Robots
Towards Cognitive Exploration through Deep Reinforcement Learning for Mobile Robots
L. Tai
Ming Liu
21
103
0
06 Oct 2016
Input Convex Neural Networks
Input Convex Neural Networks
Brandon Amos
Lei Xu
J. Zico Kolter
187
603
0
22 Sep 2016
Towards Deep Symbolic Reinforcement Learning
Towards Deep Symbolic Reinforcement Learning
M. Garnelo
Kai Arulkumaran
Murray Shanahan
30
225
0
18 Sep 2016
Actor-critic versus direct policy search: a comparison based on sample
  complexity
Actor-critic versus direct policy search: a comparison based on sample complexity
Arnaud de Froissard de Broissia
Olivier Sigaud
28
12
0
29 Jun 2016
Faster Eigenvector Computation via Shift-and-Invert Preconditioning
Faster Eigenvector Computation via Shift-and-Invert Preconditioning
Dan Garber
Laurent Dinh
Chi Jin
Jascha Narain Sohl-Dickstein
Samy Bengio
Praneeth Netrapalli
Aaron Sidford
97
3,649
0
26 May 2016
Previous
1234