Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.09142
Cited By
Learning Continuous Control Policies by Stochastic Value Gradients
30 October 2015
N. Heess
Greg Wayne
David Silver
Timothy Lillicrap
Yuval Tassa
Tom Erez
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning Continuous Control Policies by Stochastic Value Gradients"
50 / 329 papers shown
Title
Collect & Infer -- a fresh look at data-efficient Reinforcement Learning
Martin Riedmiller
Jost Tobias Springenberg
Roland Hafner
N. Heess
OffRL
26
17
0
23 Aug 2021
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning
Andrea Zanette
Martin J. Wainwright
Emma Brunskill
OffRL
29
115
0
19 Aug 2021
A general class of surrogate functions for stable and efficient reinforcement learning
Sharan Vaswani
Olivier Bachem
Simone Totaro
Robert Mueller
Shivam Garg
M. Geist
Marlos C. Machado
Pablo Samuel Castro
Nicolas Le Roux
OffRL
32
15
0
12 Aug 2021
Physics-informed Dyna-Style Model-Based Deep Reinforcement Learning for Dynamic Control
Xin-Yang Liu
Jian-Xun Wang
AI4CE
31
38
0
31 Jul 2021
High-Accuracy Model-Based Reinforcement Learning, a Survey
Aske Plaat
W. Kosters
Mike Preuss
OffRL
27
37
0
17 Jul 2021
A Unified Off-Policy Evaluation Approach for General Value Function
Tengyu Xu
Zhuoran Yang
Zhaoran Wang
Yingbin Liang
OffRL
18
2
0
06 Jul 2021
Goal-Conditioned Reinforcement Learning with Imagined Subgoals
Elliot Chane-Sane
Cordelia Schmid
Ivan Laptev
30
141
0
01 Jul 2021
Mix and Mask Actor-Critic Methods
Dom Huh
21
1
0
24 Jun 2021
Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL
Catherine Cang
Aravind Rajeswaran
Pieter Abbeel
Michael Laskin
OffRL
32
29
0
16 Jun 2021
Bayesian Bellman Operators
M. Fellows
Kristian Hartikainen
Shimon Whiteson
OffRL
42
15
0
09 Jun 2021
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Michael Janner
Qiyang Li
Sergey Levine
OffRL
68
651
0
03 Jun 2021
Hierarchical Representation Learning for Markov Decision Processes
Lorenzo Steccanella
Simone Totaro
Anders Jonsson
28
4
0
03 Jun 2021
From Motor Control to Team Play in Simulated Humanoid Football
Siqi Liu
Guy Lever
Zhe Wang
J. Merel
S. M. Ali Eslami
...
Tuomas Haarnoja
Brendan D. Tracey
K. Tuyls
T. Graepel
N. Heess
31
130
0
25 May 2021
Acting upon Imagination: when to trust imagined trajectories in model based reinforcement learning
Adrian Remonda
Eduardo E. Veas
Granit Luzhnica
22
3
0
12 May 2021
Generative Actor-Critic: An Off-policy Algorithm Using the Push-forward Model
Lingwei Peng
Hui Qian
Zhebang Shen
Chao Zhang
Fei Li
27
2
0
08 May 2021
UVIP: Model-Free Approach to Evaluate Reinforcement Learning Algorithms
Denis Belomestny
I. Levin
Eric Moulines
A. Naumov
S. Samsonov
V. Zorina
OffRL
16
0
0
05 May 2021
Discovering Diverse Athletic Jumping Strategies
Zhiqi Yin
Zeshi Yang
M. van de Panne
KangKang Yin
42
46
0
02 May 2021
Model-aided Deep Reinforcement Learning for Sample-efficient UAV Trajectory Design in IoT Networks
Omid Esrafilian
Harald Bayerlein
David Gesbert
24
6
0
21 Apr 2021
MBRL-Lib: A Modular Library for Model-based Reinforcement Learning
Luis Pineda
Brandon Amos
Amy Zhang
Nathan Lambert
Roberto Calandra
OffRL
30
46
0
20 Apr 2021
Learning to Reweight Imaginary Transitions for Model-Based Reinforcement Learning
Wenzhen Huang
Qiyue Yin
Junge Zhang
Kaiqi Huang
31
4
0
09 Apr 2021
Scalable Visual Attribute Extraction through Hidden Layers of a Residual ConvNet
Andres Baloian
Garrett A. Warnell
J. M. Saavedra
FAtt
30
1
0
31 Mar 2021
Solving Heterogeneous General Equilibrium Economic Models with Deep Reinforcement Learning
Edward W. Hill
M. Bardoscia
A. Turrell
6
24
0
31 Mar 2021
Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow
John Mcleod
Hrvoje Stojić
Vincent Adam
Dongho Kim
Jordi Grau-Moya
Peter Vrancx
Felix Leibfried
OffRL
21
2
0
26 Mar 2021
Adversarial Imitation Learning with Trajectorial Augmentation and Correction
Dafni Antotsiou
C. Ciliberto
Tae-Kyun Kim
14
10
0
25 Mar 2021
Discriminator Augmented Model-Based Reinforcement Learning
Behzad Haghgoo
Allan Zhou
Archit Sharma
Chelsea Finn
OffRL
6
3
0
24 Mar 2021
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Benjamin Eysenbach
Sergey Levine
OOD
50
176
0
10 Mar 2021
Model-free Policy Learning with Reward Gradients
Qingfeng Lan
Samuele Tosatto
Homayoon Farrahi
Rupam Mahmood
19
6
0
09 Mar 2021
Improved Regret Bound and Experience Replay in Regularized Policy Iteration
N. Lazić
Dong Yin
Yasin Abbasi-Yadkori
Csaba Szepesvári
OffRL
6
17
0
25 Feb 2021
Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model
Yang Guan
Jingliang Duan
Shengbo Eben Li
Jie Li
Jianyu Chen
B. Cheng
OffRL
18
12
0
23 Feb 2021
Decaying Clipping Range in Proximal Policy Optimization
Mónika Farsang
Luca Szegletes
OffRL
18
4
0
20 Feb 2021
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Florian E. Dorner
25
12
0
09 Feb 2021
OffCon
3
^3
3
: What is state of the art anyway?
Philip J. Ball
Stephen J. Roberts
OffRL
23
8
0
27 Jan 2021
Portfolio Optimization with 2D Relative-Attentional Gated Transformer
Tae Wan Kim
Matloob Khushi
AI4TS
36
12
0
27 Dec 2020
Learning How to Solve Bubble Ball
Hotae Lee
Monimoy Bujarbaruah
Francesco Borrelli
AI4CE
13
0
0
20 Nov 2020
Counterfactual Credit Assignment in Model-Free Reinforcement Learning
Thomas Mesnard
T. Weber
Fabio Viola
S. Thakoor
Alaa Saade
...
A. Guez
Éric Moulines
Marcus Hutter
Lars Buesing
Rémi Munos
CML
OffRL
14
55
0
18 Nov 2020
On the role of planning in model-based deep reinforcement learning
Jessica B. Hamrick
A. Friesen
Feryal M. P. Behbahani
A. Guez
Fabio Viola
Sims Witherspoon
Thomas W. Anthony
Lars Buesing
Petar Velickovic
T. Weber
OffRL
30
65
0
08 Nov 2020
Bayes-Adaptive Deep Model-Based Policy Optimisation
Tai Hoang
Ngo Anh Vien
BDL
24
1
0
29 Oct 2020
Exploring Zero-Shot Emergent Communication in Embodied Multi-Agent Populations
Kalesha Bullard
Franziska Meier
Douwe Kiela
Joelle Pineau
Jakob N. Foerster
31
17
0
29 Oct 2020
Generative Temporal Difference Learning for Infinite-Horizon Prediction
Michael Janner
Igor Mordatch
Sergey Levine
AI4CE
18
34
0
27 Oct 2020
Behavior Priors for Efficient Reinforcement Learning
Dhruva Tirumala
Alexandre Galashov
Hyeonwoo Noh
Leonard Hasenclever
Razvan Pascanu
...
Guillaume Desjardins
Wojciech M. Czarnecki
Arun Ahuja
Yee Whye Teh
N. Heess
37
39
0
27 Oct 2020
Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning
Guangxiang Zhu
Minghao Zhang
Honglak Lee
Chongjie Zhang
OffRL
71
17
0
23 Oct 2020
Iterative Amortized Policy Optimization
Joseph Marino
Alexandre Piché
Alessandro Davide Ialongo
Yisong Yue
OffRL
63
21
0
20 Oct 2020
Local Search for Policy Iteration in Continuous Control
Jost Tobias Springenberg
N. Heess
D. Mankowitz
J. Merel
Arunkumar Byravan
...
Julian Schrittwieser
Yuval Tassa
J. Buchli
Dan Belov
Martin Riedmiller
OffRL
22
15
0
12 Oct 2020
Active Feature Acquisition with Generative Surrogate Models
Yang Li
Junier B. Oliva
RALM
TPM
30
37
0
06 Oct 2020
FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning
Honghao Wei
Lei Ying
23
7
0
04 Oct 2020
Attractor Selection in Nonlinear Energy Harvesting Using Deep Reinforcement Learning
Xue-She Wang
B. Mann
13
23
0
03 Oct 2020
Importance Weighted Policy Learning and Adaptation
Alexandre Galashov
Jakub Sygnowski
Guillaume Desjardins
Jan Humplik
Leonard Hasenclever
Rae Jeong
Yee Whye Teh
N. Heess
OffRL
21
1
0
10 Sep 2020
DyNODE: Neural Ordinary Differential Equations for Dynamics Modeling in Continuous Control
V. M. Alvarez
R. Rosca
Cristian G. Falcutescu
11
10
0
09 Sep 2020
A Hybrid PAC Reinforcement Learning Algorithm
A. Zehfroosh
H. Tanner
20
0
0
05 Sep 2020
On the model-based stochastic value gradient for continuous reinforcement learning
Brandon Amos
Samuel Stanton
Denis Yarats
A. Wilson
24
71
0
28 Aug 2020
Previous
1
2
3
4
5
6
7
Next