Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.09142
Cited By
Learning Continuous Control Policies by Stochastic Value Gradients
30 October 2015
N. Heess
Greg Wayne
David Silver
Timothy Lillicrap
Yuval Tassa
Tom Erez
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning Continuous Control Policies by Stochastic Value Gradients"
50 / 329 papers shown
Title
Leveraging Jumpy Models for Planning and Fast Learning in Robotic Domains
Jingwei Zhang
Jost Tobias Springenberg
Arunkumar Byravan
Leonard Hasenclever
A. Abdolmaleki
Dushyant Rao
N. Heess
Martin Riedmiller
39
5
0
24 Feb 2023
Stochastic Generative Flow Networks
L. Pan
Dinghuai Zhang
Moksh Jain
Longbo Huang
Yoshua Bengio
BDL
49
31
0
19 Feb 2023
Predictable MDP Abstraction for Unsupervised Model-Based RL
Seohong Park
Sergey Levine
24
9
0
08 Feb 2023
DiSProD: Differentiable Symbolic Propagation of Distributions for Planning
Palash Chatterjee
Ashutosh Chapagain
Weizhe (Wesley) Chen
Roni Khardon
11
1
0
03 Feb 2023
Extreme Q-Learning: MaxEnt RL without Entropy
Divyansh Garg
Joey Hejna
M. Geist
Stefano Ermon
OffRL
41
66
0
05 Jan 2023
Latent Variable Representation for Reinforcement Learning
Tongzheng Ren
Chenjun Xiao
Tianjun Zhang
Na Li
Zhaoran Wang
Sujay Sanghavi
Dale Schuurmans
Bo Dai
OffRL
33
10
0
17 Dec 2022
Physics-Informed Model-Based Reinforcement Learning
Adithya Ramesh
Balaraman Ravindran
27
10
0
05 Dec 2022
Predictive Sampling: Real-time Behaviour Synthesis with MuJoCo
Taylor A. Howell
Nimrod Gileadi
S. Tunyasuvunakool
Kevin Zakka
Tom Erez
Yuval Tassa
30
76
0
01 Dec 2022
The Benefits of Model-Based Generalization in Reinforcement Learning
K. Young
Aditya A. Ramesh
Louis Kirsch
Jürgen Schmidhuber
OffRL
28
12
0
04 Nov 2022
Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation
Siddharth Nayak
Kenneth M. F. Choi
Wenqi Ding
Sydney I. Dolan
Karthik Gopalakrishnan
H. Balakrishnan
17
29
0
03 Nov 2022
Integrated Decision and Control for High-Level Automated Vehicles by Mixed Policy Gradient and Its Experiment Verification
Yang Guan
Liye Tang
Chuanxiao Li
Shengbo Eben Li
Yangang Ren
Junqing Wei
Bo Zhang
Ke Li
23
0
0
19 Oct 2022
Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorithm
Ashish Kumar Jayant
S. Bhatnagar
OffRL
21
38
0
14 Oct 2022
ControlVAE: Model-Based Learning of Generative Controllers for Physics-Based Characters
Heyuan Yao
Zhenhua Song
Bin Chen
Libin Liu
DRL
VGen
16
41
0
12 Oct 2022
Training Efficient Controllers via Analytic Policy Gradient
Nina Wiedemann
Valentin Wüest
Antonio Loquercio
M. Müller
Dario Floreano
Davide Scaramuzza
OffRL
28
18
0
26 Sep 2022
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective
Raj Ghugare
Homanga Bharadhwaj
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
48
25
0
18 Sep 2022
Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning
Shen Zhang
20
6
0
16 Sep 2022
A model-based approach to meta-Reinforcement Learning: Transformers and tree search
Brieuc Pinon
Jean-Charles Delvenne
Raphaël Jungers
OffRL
35
3
0
24 Aug 2022
Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph Learning for Continuous Action Space
Yining Chen
Ke Wang
Guang-hua Song
Xiaohong Jiang
28
3
0
23 Aug 2022
Efficient Planning in a Compact Latent Action Space
Zhengyao Jiang
Tianjun Zhang
Michael Janner
Yueying Li
Tim Rocktaschel
Edward Grefenstette
Yuandong Tian
OffRL
24
37
0
22 Aug 2022
MPC-based Imitation Learning for Safe and Human-like Autonomous Driving
F. S. Acerbo
Jan Swevers
Tinne Tuytelaars
Tong Duy Son
16
0
0
24 Jun 2022
Auto-Encoding Adversarial Imitation Learning
Kaifeng Zhang
Rui Zhao
Ziming Zhang
Yang Gao
26
1
0
22 Jun 2022
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
53
101
0
19 Jun 2022
Autonomous Platoon Control with Integrated Deep Reinforcement Learning and Dynamic Programming
Tong Liu
Lei Lei
Kan Zheng
Kuan Zhang
13
26
0
15 Jun 2022
Open-Ended Learning Strategies for Learning Complex Locomotion Skills
Fangqin Zhou
Joaquin Vanschoren
36
2
0
14 Jun 2022
Reinforcement Learning for Vision-based Object Manipulation with Non-parametric Policy and Action Primitives
Dongwon Son
Myungsin Kim
Jaecheol Sim
Wonsik Shin
24
1
0
12 Jun 2022
A Meta Reinforcement Learning Approach for Predictive Autoscaling in the Cloud
Siqiao Xue
Chao Qu
Xiaoming Shi
Cong Liao
Shiyi Zhu
...
Yun Hu
Lei Lei
Yang Zheng
Jianguo Li
James Y. Zhang
62
38
0
31 May 2022
Memory-efficient Reinforcement Learning with Value-based Knowledge Consolidation
Qingfeng Lan
Yangchen Pan
Jun Luo
A. R. Mahmood
OffRL
36
8
0
22 May 2022
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning
Zhiwei Xu
Dapeng Li
Bin Zhang
Yuan Zhan
Yunru Bai
Guoliang Fan
OffRL
27
7
0
20 Apr 2022
Accelerated Policy Learning with Parallel Differentiable Simulation
Jie Xu
Viktor Makoviychuk
Yashraj S. Narang
Fabio Ramos
Wojciech Matusik
Animesh Garg
Miles Macklin
24
86
0
14 Apr 2022
Revisiting Model-based Value Expansion
Daniel Palenicek
M. Lutter
Jan Peters
27
2
0
28 Mar 2022
Investigating Compounding Prediction Errors in Learned Dynamics Models
Nathan Lambert
K. Pister
Roberto Calandra
AI4CE
22
27
0
17 Mar 2022
Strategic Maneuver and Disruption with Reinforcement Learning Approaches for Multi-Agent Coordination
Derrik E. Asher
Anjon Basak
Rolando Fernandez
P. Sharma
Erin G. Zaroukian
...
Thomas Mahre
Gerardo Galindo
Luke Frerichs
J. Rogers
J. Fossaceca
AI4CE
11
5
0
17 Mar 2022
Retrieval-Augmented Reinforcement Learning
Anirudh Goyal
A. Friesen
Andrea Banino
T. Weber
Nan Rosemary Ke
...
Michal Valko
Simon Osindero
Timothy Lillicrap
N. Heess
Charles Blundell
OffRL
32
53
0
17 Feb 2022
GrASP: Gradient-Based Affordance Selection for Planning
Vivek Veeriah
Zeyu Zheng
Richard L. Lewis
Satinder Singh
25
4
0
08 Feb 2022
A Temporal-Difference Approach to Policy Gradient Estimation
Samuele Tosatto
Andrew Patterson
Martha White
A. R. Mahmood
OffRL
27
2
0
04 Feb 2022
Tutorial on amortized optimization
Brandon Amos
OffRL
78
43
0
01 Feb 2022
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes
Chao Qu
Xiaoyu Tan
Siqiao Xue
Xiaoming Shi
James Y. Zhang
Hongyuan Mei
OffRL
30
17
0
29 Jan 2022
Joint Differentiable Optimization and Verification for Certified Reinforcement Learning
Yixuan Wang
S. Zhan
Zhilu Wang
Chao Huang
Zhaoran Wang
Zhuoran Yang
Qi Zhu
23
22
0
28 Jan 2022
Reinforcement Learning for Personalized Drug Discovery and Design for Complex Diseases: A Systems Pharmacology Perspective
Ryan K. Tan
Yang Liu
Lei Xie
44
2
0
21 Jan 2022
Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
Zhihai Wang
Jie Wang
Qi Zhou
Bin Li
Houqiang Li
27
30
0
16 Dec 2021
Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation
Todor Davchev
Oleg O. Sushkov
Jean-Baptiste Regli
S. Schaal
Y. Aytar
Markus Wulfmeier
Jonathan Scholz
16
18
0
01 Dec 2021
Generalized Decision Transformer for Offline Hindsight Information Matching
Hiroki Furuta
Y. Matsuo
S. Gu
OffRL
21
99
0
19 Nov 2021
Kernel-based diffusion approximated Markov decision processes for autonomous navigation and control on unstructured terrains
Junhong Xu
Kai-Li Yin
Zheng Chen
Jason M. Gregory
Ethan Stump
Lantao Liu
37
2
0
16 Nov 2021
Physics-informed neural networks via stochastic Hamiltonian dynamics learning
Minh Nguyen
Chandrajit Bajaj
21
1
0
15 Nov 2021
Procedural Generalization by Planning with Self-Supervised World Models
Ankesh Anand
Jacob Walker
Yazhe Li
Eszter Vértes
Julian Schrittwieser
Sherjil Ozair
T. Weber
Jessica B. Hamrick
31
31
0
02 Nov 2021
Fully Distributed Actor-Critic Architecture for Multitask Deep Reinforcement Learning
John Harwell
Angel Sylvester
Aleksi Tukiainen
Enrique Munoz de Cote
28
4
0
23 Oct 2021
Gradient-Based Mixed Planning with Symbolic and Numeric Action Parameters
Kebing Jin
H. Zhuo
Zhanhao Xiao
Hai Wan
Subbarao Kambhampati
44
9
0
19 Oct 2021
Evaluating model-based planning and planner amortization for continuous control
Arunkumar Byravan
Leonard Hasenclever
Piotr Trochim
M. Berk Mirza
Alessandro Davide Ialongo
...
Jost Tobias Springenberg
A. Abdolmaleki
N. Heess
J. Merel
Martin Riedmiller
55
17
0
07 Oct 2021
Learning Dynamics Models for Model Predictive Agents
M. Lutter
Leonard Hasenclever
Arunkumar Byravan
Gabriel Dulac-Arnold
Piotr Trochim
N. Heess
J. Merel
Yuval Tassa
AI4CE
57
26
0
29 Sep 2021
Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients
Oliver Scheel
Luca Bergamini
Maciej Wołczyk
Bla.zej Osiñski
Peter Ondruska
37
104
0
27 Sep 2021
Previous
1
2
3
4
5
6
7
Next