ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.09142
  4. Cited By
Learning Continuous Control Policies by Stochastic Value Gradients

Learning Continuous Control Policies by Stochastic Value Gradients

30 October 2015
N. Heess
Greg Wayne
David Silver
Timothy Lillicrap
Yuval Tassa
Tom Erez
ArXivPDFHTML

Papers citing "Learning Continuous Control Policies by Stochastic Value Gradients"

50 / 329 papers shown
Title
Model-Based Reinforcement Learning via Meta-Policy Optimization
Model-Based Reinforcement Learning via Meta-Policy Optimization
I. Clavera
Jonas Rothfuss
John Schulman
Yasuhiro Fujita
Tamim Asfour
Pieter Abbeel
30
225
0
14 Sep 2018
Model-Based Regularization for Deep Reinforcement Learning with
  Transcoder Networks
Model-Based Regularization for Deep Reinforcement Learning with Transcoder Networks
Felix Leibfried
Peter Vrancx
OffRL
6
7
0
06 Sep 2018
SOLAR: Deep Structured Representations for Model-Based Reinforcement
  Learning
SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning
Marvin Zhang
Sharad Vikram
Laura M. Smith
Pieter Abbeel
Matthew J. Johnson
Sergey Levine
OffRL
23
41
0
28 Aug 2018
Structured Neural Network Dynamics for Model-based Control
Structured Neural Network Dynamics for Model-based Control
Alexander Broad
Ian Abraham
Todd D. Murphey
B. Argall
16
5
0
03 Aug 2018
Deterministic Policy Gradients With General State Transitions
Deterministic Policy Gradients With General State Transitions
Qingpeng Cai
Ling Pan
Pingzhong Tang
OffRL
19
2
0
10 Jul 2018
Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value
  Expansion
Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion
Jacob Buckman
Danijar Hafner
George Tucker
E. Brevdo
Honglak Lee
22
329
0
04 Jul 2018
Guided evolutionary strategies: Augmenting random search with surrogate
  gradients
Guided evolutionary strategies: Augmenting random search with surrogate gradients
Niru Maheswaranathan
Luke Metz
George Tucker
Dami Choi
Jascha Narain Sohl-Dickstein
25
20
0
26 Jun 2018
Maximum a Posteriori Policy Optimisation
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
48
471
0
14 Jun 2018
Boredom-driven curious learning by Homeo-Heterostatic Value Gradients
Boredom-driven curious learning by Homeo-Heterostatic Value Gradients
Yen Yu
A. Chang
Ryota Kanai
12
9
0
05 Jun 2018
Graph networks as learnable physics engines for inference and control
Graph networks as learnable physics engines for inference and control
Alvaro Sanchez-Gonzalez
N. Heess
Jost Tobias Springenberg
J. Merel
Martin Riedmiller
R. Hadsell
Peter W. Battaglia
GNN
AI4CE
PINN
OCL
42
595
0
04 Jun 2018
Learning Real-World Robot Policies by Dreaming
Learning Real-World Robot Policies by Dreaming
A. Piergiovanni
Alan Wu
Michael S. Ryoo
29
31
0
20 May 2018
Episodic Memory Deep Q-Networks
Episodic Memory Deep Q-Networks
Zichuan Lin
Tianqi Zhao
Guangwen Yang
Lintao Zhang
OffRL
24
85
0
19 May 2018
Deep Reinforcement Learning to Acquire Navigation Skills for
  Wheel-Legged Robots in Complex Environments
Deep Reinforcement Learning to Acquire Navigation Skills for Wheel-Legged Robots in Complex Environments
Xi Chen
Ali Ghadirzadeh
John Folkesson
Patric Jensfelt
22
44
0
27 Apr 2018
Lipschitz Continuity in Model-based Reinforcement Learning
Lipschitz Continuity in Model-based Reinforcement Learning
Kavosh Asadi
Dipendra Kumar Misra
Michael L. Littman
KELM
43
150
0
19 Apr 2018
The Limits and Potentials of Deep Learning for Robotics
The Limits and Potentials of Deep Learning for Robotics
Niko Sünderhauf
Oliver Brock
Walter J. Scheirer
R. Hadsell
Dieter Fox
...
B. Upcroft
Pieter Abbeel
Wolfram Burgard
Michael Milford
Peter Corke
17
522
0
18 Apr 2018
Recall Traces: Backtracking Models for Efficient Reinforcement Learning
Recall Traces: Backtracking Models for Efficient Reinforcement Learning
Anirudh Goyal
Philemon Brakel
W. Fedus
Soumye Singhal
Timothy Lillicrap
Sergey Levine
Hugo Larochelle
Yoshua Bengio
OffRL
23
68
0
02 Apr 2018
Synthesizing Neural Network Controllers with Probabilistic Model based
  Reinforcement Learning
Synthesizing Neural Network Controllers with Probabilistic Model based Reinforcement Learning
J. A. G. Higuera
David Meger
Gregory Dudek
BDL
22
39
0
06 Mar 2018
Smoothed Action Value Functions for Learning Gaussian Policies
Smoothed Action Value Functions for Learning Gaussian Policies
Ofir Nachum
Mohammad Norouzi
George Tucker
Dale Schuurmans
10
28
0
06 Mar 2018
Model-Based Value Estimation for Efficient Model-Free Reinforcement
  Learning
Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning
Vladimir Feinberg
Alvin Wan
Ion Stoica
Michael I. Jordan
Joseph E. Gonzalez
Sergey Levine
OffRL
13
317
0
28 Feb 2018
Model-Ensemble Trust-Region Policy Optimization
Model-Ensemble Trust-Region Policy Optimization
Thanard Kurutach
I. Clavera
Yan Duan
Aviv Tamar
Pieter Abbeel
9
449
0
28 Feb 2018
Learning by Playing - Solving Sparse Reward Tasks from Scratch
Learning by Playing - Solving Sparse Reward Tasks from Scratch
Martin Riedmiller
Roland Hafner
Thomas Lampe
Michael Neunert
Jonas Degrave
T. Wiele
Volodymyr Mnih
N. Heess
Jost Tobias Springenberg
44
445
0
28 Feb 2018
The Mirage of Action-Dependent Baselines in Reinforcement Learning
The Mirage of Action-Dependent Baselines in Reinforcement Learning
George Tucker
Surya Bhupatiraju
S. Gu
Richard Turner
Zoubin Ghahramani
Sergey Levine
OffRL
30
126
0
27 Feb 2018
Reinforcement and Imitation Learning for Diverse Visuomotor Skills
Reinforcement and Imitation Learning for Diverse Visuomotor Skills
Yuke Zhu
Ziyun Wang
J. Merel
Andrei A. Rusu
Tom Erez
...
S. Tunyasuvunakool
János Kramár
R. Hadsell
Nando de Freitas
N. Heess
SSL
34
316
0
26 Feb 2018
Temporal Difference Models: Model-Free Deep RL for Model-Based Control
Temporal Difference Models: Model-Free Deep RL for Model-Based Control
Vitchyr H. Pong
S. Gu
Murtaza Dalal
Sergey Levine
OffRL
66
238
0
25 Feb 2018
Fourier Policy Gradients
Fourier Policy Gradients
M. Fellows
K. Ciosek
Shimon Whiteson
35
15
0
19 Feb 2018
Learning Parametric Closed-Loop Policies for Markov Potential Games
Learning Parametric Closed-Loop Policies for Markov Potential Games
Sergio Valcarcel Macua
Javier Zazo
S. Zazo
23
46
0
03 Feb 2018
Pretraining Deep Actor-Critic Reinforcement Learning Algorithms With
  Expert Demonstrations
Pretraining Deep Actor-Critic Reinforcement Learning Algorithms With Expert Demonstrations
Xiaoqin Zhang
Huimin Ma
OffRL
35
38
0
31 Jan 2018
Model-Based Action Exploration for Learning Dynamic Motion Skills
Model-Based Action Exploration for Learning Dynamic Motion Skills
Glen Berseth
M. van de Panne
33
0
0
11 Jan 2018
Expected Policy Gradients for Reinforcement Learning
Expected Policy Gradients for Reinforcement Learning
K. Ciosek
Shimon Whiteson
50
51
0
10 Jan 2018
Building Generalizable Agents with a Realistic and Rich 3D Environment
Building Generalizable Agents with a Realistic and Rich 3D Environment
Yi Wu
Yuxin Wu
Georgia Gkioxari
Yuandong Tian
3DV
62
338
0
07 Jan 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
28
8,170
0
04 Jan 2018
Deterministic Policy Optimization by Combining Pathwise and Score
  Function Estimators for Discrete Action Spaces
Deterministic Policy Optimization by Combining Pathwise and Score Function Estimators for Discrete Action Spaces
Daniel Levy
Stefano Ermon
21
4
0
21 Nov 2017
A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
Marc Lanctot
V. Zambaldi
A. Gruslys
Angeliki Lazaridou
K. Tuyls
Julien Perolat
David Silver
T. Graepel
50
628
0
02 Nov 2017
TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep
  Reinforcement Learning
TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning
Gregory Farquhar
Tim Rocktaschel
Maximilian Igl
Shimon Whiteson
OffRL
25
71
0
31 Oct 2017
Flow: A Modular Learning Framework for Mixed Autonomy Traffic
Flow: A Modular Learning Framework for Mixed Autonomy Traffic
Cathy Wu
Abdul Rahman Kreidieh
Kanaad Parvate
Eugene Vinitsky
Alexandre M. Bayen
13
153
0
16 Oct 2017
Expanding Motor Skills through Relay Neural Networks
Expanding Motor Skills through Relay Neural Networks
Visak C. V. Kumar
Sehoon Ha
Chenxi Liu
20
2
0
22 Sep 2017
MBMF: Model-Based Priors for Model-Free Reinforcement Learning
MBMF: Model-Based Priors for Model-Free Reinforcement Learning
Somil Bansal
Roberto Calandra
Kurtland Chua
Sergey Levine
Claire Tomlin
OffRL
11
36
0
10 Sep 2017
A Brief Survey of Deep Reinforcement Learning
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
65
2,780
0
19 Aug 2017
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with
  Model-Free Fine-Tuning
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning
Anusha Nagabandi
G. Kahn
R. Fearing
Sergey Levine
14
965
0
08 Aug 2017
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
I. Higgins
Arka Pal
Andrei A. Rusu
Loic Matthey
Christopher P. Burgess
Alexander Pritzel
M. Botvinick
Charles Blundell
Alexander Lerchner
DRL
43
411
0
26 Jul 2017
Learning model-based planning from scratch
Learning model-based planning from scratch
Razvan Pascanu
Yujia Li
Oriol Vinyals
N. Heess
Lars Buesing
S. Racanière
David P. Reichert
T. Weber
Daan Wierstra
Peter W. Battaglia
LM&Ro
42
97
0
19 Jul 2017
ADAPT: Zero-Shot Adaptive Policy Transfer for Stochastic Dynamical
  Systems
ADAPT: Zero-Shot Adaptive Policy Transfer for Stochastic Dynamical Systems
James Harrison
Animesh Garg
Boris Ivanovic
Yuke Zhu
Silvio Savarese
Li Fei-Fei
Marco Pavone
13
25
0
15 Jul 2017
Value Prediction Network
Value Prediction Network
Junhyuk Oh
Satinder Singh
Honglak Lee
29
331
0
11 Jul 2017
Emergence of Locomotion Behaviours in Rich Environments
Emergence of Locomotion Behaviours in Rich Environments
N. Heess
TB Dhruva
S. Sriram
Jay Lemmon
J. Merel
...
Tom Erez
Ziyun Wang
S. M. Ali Eslami
Martin Riedmiller
David Silver
143
928
0
07 Jul 2017
Expected Policy Gradients
Expected Policy Gradients
K. Ciosek
Shimon Whiteson
19
57
0
15 Jun 2017
Actor-Critic for Linearly-Solvable Continuous MDP with Partially Known
  Dynamics
Actor-Critic for Linearly-Solvable Continuous MDP with Partially Known Dynamics
Tomoki Nishi
Prashant Doshi
Michael R. James
Danil Prokhorov
22
5
0
04 Jun 2017
Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient
  Estimation for Deep Reinforcement Learning
Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning
S. Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard Turner
Bernhard Schölkopf
Sergey Levine
OffRL
23
164
0
01 Jun 2017
Non-Markovian Control with Gated End-to-End Memory Policy Networks
Non-Markovian Control with Gated End-to-End Memory Policy Networks
J. Perez
T. Silander
OffRL
15
6
0
31 May 2017
Guide Actor-Critic for Continuous Control
Guide Actor-Critic for Continuous Control
Voot Tangkaratt
A. Abdolmaleki
Masashi Sugiyama
24
17
0
22 May 2017
Metacontrol for Adaptive Imagination-Based Optimization
Metacontrol for Adaptive Imagination-Based Optimization
Jessica B. Hamrick
A. J. Ballard
Razvan Pascanu
Oriol Vinyals
N. Heess
Peter W. Battaglia
27
69
0
07 May 2017
Previous
1234567
Next