ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.09142
  4. Cited By
Learning Continuous Control Policies by Stochastic Value Gradients

Learning Continuous Control Policies by Stochastic Value Gradients

30 October 2015
N. Heess
Greg Wayne
David Silver
Timothy Lillicrap
Yuval Tassa
Tom Erez
ArXivPDFHTML

Papers citing "Learning Continuous Control Policies by Stochastic Value Gradients"

50 / 329 papers shown
Title
Collect & Infer -- a fresh look at data-efficient Reinforcement Learning
Collect & Infer -- a fresh look at data-efficient Reinforcement Learning
Martin Riedmiller
Jost Tobias Springenberg
Roland Hafner
N. Heess
OffRL
26
17
0
23 Aug 2021
Provable Benefits of Actor-Critic Methods for Offline Reinforcement
  Learning
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning
Andrea Zanette
Martin J. Wainwright
Emma Brunskill
OffRL
29
115
0
19 Aug 2021
A general class of surrogate functions for stable and efficient
  reinforcement learning
A general class of surrogate functions for stable and efficient reinforcement learning
Sharan Vaswani
Olivier Bachem
Simone Totaro
Robert Mueller
Shivam Garg
M. Geist
Marlos C. Machado
Pablo Samuel Castro
Nicolas Le Roux
OffRL
32
15
0
12 Aug 2021
Physics-informed Dyna-Style Model-Based Deep Reinforcement Learning for
  Dynamic Control
Physics-informed Dyna-Style Model-Based Deep Reinforcement Learning for Dynamic Control
Xin-Yang Liu
Jian-Xun Wang
AI4CE
31
38
0
31 Jul 2021
High-Accuracy Model-Based Reinforcement Learning, a Survey
High-Accuracy Model-Based Reinforcement Learning, a Survey
Aske Plaat
W. Kosters
Mike Preuss
OffRL
27
37
0
17 Jul 2021
A Unified Off-Policy Evaluation Approach for General Value Function
A Unified Off-Policy Evaluation Approach for General Value Function
Tengyu Xu
Zhuoran Yang
Zhaoran Wang
Yingbin Liang
OffRL
18
2
0
06 Jul 2021
Goal-Conditioned Reinforcement Learning with Imagined Subgoals
Goal-Conditioned Reinforcement Learning with Imagined Subgoals
Elliot Chane-Sane
Cordelia Schmid
Ivan Laptev
30
141
0
01 Jul 2021
Mix and Mask Actor-Critic Methods
Mix and Mask Actor-Critic Methods
Dom Huh
21
1
0
24 Jun 2021
Behavioral Priors and Dynamics Models: Improving Performance and Domain
  Transfer in Offline RL
Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL
Catherine Cang
Aravind Rajeswaran
Pieter Abbeel
Michael Laskin
OffRL
32
29
0
16 Jun 2021
Bayesian Bellman Operators
Bayesian Bellman Operators
M. Fellows
Kristian Hartikainen
Shimon Whiteson
OffRL
42
15
0
09 Jun 2021
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Michael Janner
Qiyang Li
Sergey Levine
OffRL
68
651
0
03 Jun 2021
Hierarchical Representation Learning for Markov Decision Processes
Hierarchical Representation Learning for Markov Decision Processes
Lorenzo Steccanella
Simone Totaro
Anders Jonsson
28
4
0
03 Jun 2021
From Motor Control to Team Play in Simulated Humanoid Football
From Motor Control to Team Play in Simulated Humanoid Football
Siqi Liu
Guy Lever
Zhe Wang
J. Merel
S. M. Ali Eslami
...
Tuomas Haarnoja
Brendan D. Tracey
K. Tuyls
T. Graepel
N. Heess
31
130
0
25 May 2021
Acting upon Imagination: when to trust imagined trajectories in model
  based reinforcement learning
Acting upon Imagination: when to trust imagined trajectories in model based reinforcement learning
Adrian Remonda
Eduardo E. Veas
Granit Luzhnica
22
3
0
12 May 2021
Generative Actor-Critic: An Off-policy Algorithm Using the Push-forward Model
Lingwei Peng
Hui Qian
Zhebang Shen
Chao Zhang
Fei Li
27
2
0
08 May 2021
UVIP: Model-Free Approach to Evaluate Reinforcement Learning Algorithms
UVIP: Model-Free Approach to Evaluate Reinforcement Learning Algorithms
Denis Belomestny
I. Levin
Eric Moulines
A. Naumov
S. Samsonov
V. Zorina
OffRL
16
0
0
05 May 2021
Discovering Diverse Athletic Jumping Strategies
Discovering Diverse Athletic Jumping Strategies
Zhiqi Yin
Zeshi Yang
M. van de Panne
KangKang Yin
42
46
0
02 May 2021
Model-aided Deep Reinforcement Learning for Sample-efficient UAV
  Trajectory Design in IoT Networks
Model-aided Deep Reinforcement Learning for Sample-efficient UAV Trajectory Design in IoT Networks
Omid Esrafilian
Harald Bayerlein
David Gesbert
24
6
0
21 Apr 2021
MBRL-Lib: A Modular Library for Model-based Reinforcement Learning
MBRL-Lib: A Modular Library for Model-based Reinforcement Learning
Luis Pineda
Brandon Amos
Amy Zhang
Nathan Lambert
Roberto Calandra
OffRL
30
46
0
20 Apr 2021
Learning to Reweight Imaginary Transitions for Model-Based Reinforcement
  Learning
Learning to Reweight Imaginary Transitions for Model-Based Reinforcement Learning
Wenzhen Huang
Qiyue Yin
Junge Zhang
Kaiqi Huang
31
4
0
09 Apr 2021
Scalable Visual Attribute Extraction through Hidden Layers of a Residual
  ConvNet
Scalable Visual Attribute Extraction through Hidden Layers of a Residual ConvNet
Andres Baloian
Garrett A. Warnell
J. M. Saavedra
FAtt
30
1
0
31 Mar 2021
Solving Heterogeneous General Equilibrium Economic Models with Deep
  Reinforcement Learning
Solving Heterogeneous General Equilibrium Economic Models with Deep Reinforcement Learning
Edward W. Hill
M. Bardoscia
A. Turrell
6
24
0
31 Mar 2021
Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow
Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow
John Mcleod
Hrvoje Stojić
Vincent Adam
Dongho Kim
Jordi Grau-Moya
Peter Vrancx
Felix Leibfried
OffRL
21
2
0
26 Mar 2021
Adversarial Imitation Learning with Trajectorial Augmentation and
  Correction
Adversarial Imitation Learning with Trajectorial Augmentation and Correction
Dafni Antotsiou
C. Ciliberto
Tae-Kyun Kim
14
10
0
25 Mar 2021
Discriminator Augmented Model-Based Reinforcement Learning
Discriminator Augmented Model-Based Reinforcement Learning
Behzad Haghgoo
Allan Zhou
Archit Sharma
Chelsea Finn
OffRL
6
3
0
24 Mar 2021
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Benjamin Eysenbach
Sergey Levine
OOD
50
176
0
10 Mar 2021
Model-free Policy Learning with Reward Gradients
Model-free Policy Learning with Reward Gradients
Qingfeng Lan
Samuele Tosatto
Homayoon Farrahi
Rupam Mahmood
19
6
0
09 Mar 2021
Improved Regret Bound and Experience Replay in Regularized Policy
  Iteration
Improved Regret Bound and Experience Replay in Regularized Policy Iteration
N. Lazić
Dong Yin
Yasin Abbasi-Yadkori
Csaba Szepesvári
OffRL
6
17
0
25 Feb 2021
Mixed Policy Gradient: off-policy reinforcement learning driven jointly
  by data and model
Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model
Yang Guan
Jingliang Duan
Shengbo Eben Li
Jie Li
Jianyu Chen
B. Cheng
OffRL
18
12
0
23 Feb 2021
Decaying Clipping Range in Proximal Policy Optimization
Decaying Clipping Range in Proximal Policy Optimization
Mónika Farsang
Luca Szegletes
OffRL
18
4
0
20 Feb 2021
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Florian E. Dorner
25
12
0
09 Feb 2021
OffCon$^3$: What is state of the art anyway?
OffCon3^33: What is state of the art anyway?
Philip J. Ball
Stephen J. Roberts
OffRL
23
8
0
27 Jan 2021
Portfolio Optimization with 2D Relative-Attentional Gated Transformer
Portfolio Optimization with 2D Relative-Attentional Gated Transformer
Tae Wan Kim
Matloob Khushi
AI4TS
36
12
0
27 Dec 2020
Learning How to Solve Bubble Ball
Learning How to Solve Bubble Ball
Hotae Lee
Monimoy Bujarbaruah
Francesco Borrelli
AI4CE
13
0
0
20 Nov 2020
Counterfactual Credit Assignment in Model-Free Reinforcement Learning
Counterfactual Credit Assignment in Model-Free Reinforcement Learning
Thomas Mesnard
T. Weber
Fabio Viola
S. Thakoor
Alaa Saade
...
A. Guez
Éric Moulines
Marcus Hutter
Lars Buesing
Rémi Munos
CML
OffRL
14
55
0
18 Nov 2020
On the role of planning in model-based deep reinforcement learning
On the role of planning in model-based deep reinforcement learning
Jessica B. Hamrick
A. Friesen
Feryal M. P. Behbahani
A. Guez
Fabio Viola
Sims Witherspoon
Thomas W. Anthony
Lars Buesing
Petar Velickovic
T. Weber
OffRL
30
65
0
08 Nov 2020
Bayes-Adaptive Deep Model-Based Policy Optimisation
Bayes-Adaptive Deep Model-Based Policy Optimisation
Tai Hoang
Ngo Anh Vien
BDL
24
1
0
29 Oct 2020
Exploring Zero-Shot Emergent Communication in Embodied Multi-Agent
  Populations
Exploring Zero-Shot Emergent Communication in Embodied Multi-Agent Populations
Kalesha Bullard
Franziska Meier
Douwe Kiela
Joelle Pineau
Jakob N. Foerster
31
17
0
29 Oct 2020
Generative Temporal Difference Learning for Infinite-Horizon Prediction
Generative Temporal Difference Learning for Infinite-Horizon Prediction
Michael Janner
Igor Mordatch
Sergey Levine
AI4CE
18
34
0
27 Oct 2020
Behavior Priors for Efficient Reinforcement Learning
Behavior Priors for Efficient Reinforcement Learning
Dhruva Tirumala
Alexandre Galashov
Hyeonwoo Noh
Leonard Hasenclever
Razvan Pascanu
...
Guillaume Desjardins
Wojciech M. Czarnecki
Arun Ahuja
Yee Whye Teh
N. Heess
37
39
0
27 Oct 2020
Bridging Imagination and Reality for Model-Based Deep Reinforcement
  Learning
Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning
Guangxiang Zhu
Minghao Zhang
Honglak Lee
Chongjie Zhang
OffRL
71
17
0
23 Oct 2020
Iterative Amortized Policy Optimization
Iterative Amortized Policy Optimization
Joseph Marino
Alexandre Piché
Alessandro Davide Ialongo
Yisong Yue
OffRL
63
21
0
20 Oct 2020
Local Search for Policy Iteration in Continuous Control
Local Search for Policy Iteration in Continuous Control
Jost Tobias Springenberg
N. Heess
D. Mankowitz
J. Merel
Arunkumar Byravan
...
Julian Schrittwieser
Yuval Tassa
J. Buchli
Dan Belov
Martin Riedmiller
OffRL
22
15
0
12 Oct 2020
Active Feature Acquisition with Generative Surrogate Models
Active Feature Acquisition with Generative Surrogate Models
Yang Li
Junier B. Oliva
RALM
TPM
30
37
0
06 Oct 2020
FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning
FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning
Honghao Wei
Lei Ying
23
7
0
04 Oct 2020
Attractor Selection in Nonlinear Energy Harvesting Using Deep
  Reinforcement Learning
Attractor Selection in Nonlinear Energy Harvesting Using Deep Reinforcement Learning
Xue-She Wang
B. Mann
13
23
0
03 Oct 2020
Importance Weighted Policy Learning and Adaptation
Importance Weighted Policy Learning and Adaptation
Alexandre Galashov
Jakub Sygnowski
Guillaume Desjardins
Jan Humplik
Leonard Hasenclever
Rae Jeong
Yee Whye Teh
N. Heess
OffRL
21
1
0
10 Sep 2020
DyNODE: Neural Ordinary Differential Equations for Dynamics Modeling in
  Continuous Control
DyNODE: Neural Ordinary Differential Equations for Dynamics Modeling in Continuous Control
V. M. Alvarez
R. Rosca
Cristian G. Falcutescu
11
10
0
09 Sep 2020
A Hybrid PAC Reinforcement Learning Algorithm
A Hybrid PAC Reinforcement Learning Algorithm
A. Zehfroosh
H. Tanner
20
0
0
05 Sep 2020
On the model-based stochastic value gradient for continuous
  reinforcement learning
On the model-based stochastic value gradient for continuous reinforcement learning
Brandon Amos
Samuel Stanton
Denis Yarats
A. Wilson
24
71
0
28 Aug 2020
Previous
1234567
Next