ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08649
  4. Cited By
Exploring Model-based Planning with Policy Networks

Exploring Model-based Planning with Policy Networks

20 June 2019
Tingwu Wang
Jimmy Ba
ArXivPDFHTML

Papers citing "Exploring Model-based Planning with Policy Networks"

39 / 39 papers shown
Title
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning
Patrick Yin
Tyler Westenbroek
Simran Bagaria
Kevin Huang
Ching-an Cheng
Andrey Kobolov
Abhishek Gupta
70
2
0
04 Feb 2025
Machine Learning Meets Advanced Robotic Manipulation
Machine Learning Meets Advanced Robotic Manipulation
Saeid Nahavandi
R. Alizadehsani
D. Nahavandi
Chee Peng Lim
Kevin Kelly
Fernando Bello
24
17
0
22 Sep 2023
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
Hanqing Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
27
28
0
14 Aug 2023
A Simple Decentralized Cross-Entropy Method
A Simple Decentralized Cross-Entropy Method
Zichen Zhang
Jun Jin
Martin Jägersand
Jun-Jie Luo
Dale Schuurmans
13
8
0
16 Dec 2022
Efficient Planning in a Compact Latent Action Space
Efficient Planning in a Compact Latent Action Space
Zhengyao Jiang
Tianjun Zhang
Michael Janner
Yueying Li
Tim Rocktaschel
Edward Grefenstette
Yuandong Tian
OffRL
16
36
0
22 Aug 2022
A Survey on Model-based Reinforcement Learning
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
41
101
0
19 Jun 2022
Integrating Symmetry into Differentiable Planning with Steerable
  Convolutions
Integrating Symmetry into Differentiable Planning with Steerable Convolutions
Linfeng Zhao
Xu Zhu
Lingzhi Kong
Robin G. Walters
Lawson L. S. Wong
20
7
0
08 Jun 2022
Temporal Difference Learning for Model Predictive Control
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen
Xiaolong Wang
H. Su
PINN
MU
36
219
0
09 Mar 2022
Inference of Affordances and Active Motor Control in Simulated Agents
Inference of Affordances and Active Motor Control in Simulated Agents
Fedor Scholz
Christian Gumbsch
S. Otte
Martin Volker Butz
AI4CE
16
5
0
23 Feb 2022
Reinforcement Learning from Demonstrations by Novel Interactive Expert
  and Application to Automatic Berthing Control Systems for Unmanned Surface
  Vessel
Reinforcement Learning from Demonstrations by Novel Interactive Expert and Application to Automatic Berthing Control Systems for Unmanned Surface Vessel
Haoran Zhang
Chenkun Yin
Yanxin Zhang
S. Jin
Zhenxuan Li
OffRL
11
3
0
23 Feb 2022
Deconstructing the Inductive Biases of Hamiltonian Neural Networks
Deconstructing the Inductive Biases of Hamiltonian Neural Networks
Nate Gruver
Marc Finzi
Samuel Stanton
A. Wilson
AI4CE
13
39
0
10 Feb 2022
Tutorial on amortized optimization
Tutorial on amortized optimization
Brandon Amos
OffRL
75
43
0
01 Feb 2022
Generative Planning for Temporally Coordinated Exploration in
  Reinforcement Learning
Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning
Haichao Zhang
Wei-ping Xu
Haonan Yu
25
10
0
24 Jan 2022
Sample-Efficient Reinforcement Learning via Conservative Model-Based
  Actor-Critic
Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
Zhihai Wang
Jie Wang
Qi Zhou
Bin Li
Houqiang Li
9
30
0
16 Dec 2021
CEM-GD: Cross-Entropy Method with Gradient Descent Planner for
  Model-Based Reinforcement Learning
CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement Learning
Kevin Huang
Sahin Lale
Ugo Rosolia
Yuanyuan Shi
Anima Anandkumar
11
8
0
14 Dec 2021
Residual Pathway Priors for Soft Equivariance Constraints
Residual Pathway Priors for Soft Equivariance Constraints
Marc Finzi
Gregory W. Benton
A. Wilson
BDL
UQCV
24
50
0
02 Dec 2021
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement
  Learning
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Edoardo Cetin
Oya Celiktutan
OffRL
34
16
0
07 Oct 2021
Evaluating model-based planning and planner amortization for continuous
  control
Evaluating model-based planning and planner amortization for continuous control
Arunkumar Byravan
Leonard Hasenclever
Piotr Trochim
M. Berk Mirza
Alessandro Davide Ialongo
...
Jost Tobias Springenberg
A. Abdolmaleki
N. Heess
J. Merel
Martin Riedmiller
55
17
0
07 Oct 2021
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Michael Janner
Qiyang Li
Sergey Levine
OffRL
66
643
0
03 Jun 2021
MBRL-Lib: A Modular Library for Model-based Reinforcement Learning
MBRL-Lib: A Modular Library for Model-based Reinforcement Learning
Luis Pineda
Brandon Amos
Amy Zhang
Nathan Lambert
Roberto Calandra
OffRL
8
46
0
20 Apr 2021
Actionable Models: Unsupervised Offline Reinforcement Learning of
  Robotic Skills
Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills
Yevgen Chebotar
Karol Hausman
Yao Lu
Ted Xiao
Dmitry Kalashnikov
...
A. Irpan
Benjamin Eysenbach
Ryan C. Julian
Chelsea Finn
Sergey Levine
SSL
OffRL
18
145
0
15 Apr 2021
Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow
Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow
John Mcleod
Hrvoje Stojić
Vincent Adam
Dongho Kim
Jordi Grau-Moya
Peter Vrancx
Felix Leibfried
OffRL
16
2
0
26 Mar 2021
Where to go next: Learning a Subgoal Recommendation Policy for
  Navigation Among Pedestrians
Where to go next: Learning a Subgoal Recommendation Policy for Navigation Among Pedestrians
B. Brito
Michael Everett
Jonathan P. How
Javier Alonso-Mora
76
57
0
25 Feb 2021
Model-Invariant State Abstractions for Model-Based Reinforcement
  Learning
Model-Invariant State Abstractions for Model-Based Reinforcement Learning
Manan Tomar
Amy Zhang
Roberto Calandra
Matthew E. Taylor
Joelle Pineau
19
24
0
19 Feb 2021
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous
  Agents via Personalized Simulators
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators
Anish Agarwal
Abdullah Alomar
Varkey Alumootil
Devavrat Shah
Dennis Shen
Zhi Xu
Cindy Yang
OffRL
18
18
0
13 Feb 2021
Model-based Policy Optimization with Unsupervised Model Adaptation
Model-based Policy Optimization with Unsupervised Model Adaptation
Jian Shen
Han Zhao
Weinan Zhang
Yong Yu
30
27
0
19 Oct 2020
Mastering Atari with Discrete World Models
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
30
809
0
05 Oct 2020
Learning Off-Policy with Online Planning
Learning Off-Policy with Online Planning
Harshit S. Sikchi
Wenxuan Zhou
David Held
OffRL
24
45
0
23 Aug 2020
Sample-efficient Cross-Entropy Method for Real-time Planning
Sample-efficient Cross-Entropy Method for Real-time Planning
Cristina Pinneri
Shambhuraj Sawant
Sebastian Blaes
Jan Achterhold
Joerg Stueckler
Michal Rolínek
Georg Martius
16
98
0
14 Aug 2020
Control as Hybrid Inference
Control as Hybrid Inference
Alexander Tschantz
Beren Millidge
A. Seth
Christopher L. Buckley
19
9
0
11 Jul 2020
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep
  Reinforcement Learning
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Kimin Lee
Michael Laskin
A. Srinivas
Pieter Abbeel
OffRL
11
199
0
09 Jul 2020
Information Theoretic Regret Bounds for Online Nonlinear Control
Information Theoretic Regret Bounds for Online Nonlinear Control
Sham Kakade
A. Krishnamurthy
Kendall Lowrey
Motoya Ohnishi
Wen Sun
31
117
0
22 Jun 2020
Efficient Model-Based Reinforcement Learning through Optimistic Policy
  Search and Planning
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning
Sebastian Curi
Felix Berkenkamp
Andreas Krause
25
82
0
15 Jun 2020
Maximum Entropy Model Rollouts: Fast Model Based Policy Optimization
  without Compounding Errors
Maximum Entropy Model Rollouts: Fast Model Based Policy Optimization without Compounding Errors
Chi Zhang
S. Kuppannagari
Viktor Prasanna
9
4
0
08 Jun 2020
Model-Augmented Actor-Critic: Backpropagating through Paths
Model-Augmented Actor-Critic: Backpropagating through Paths
I. Clavera
Yao Fu
Pieter Abbeel
27
86
0
16 May 2020
Delay-Aware Model-Based Reinforcement Learning for Continuous Control
Delay-Aware Model-Based Reinforcement Learning for Continuous Control
Baiming Chen
Mengdi Xu
Liang-Sheng Li
Ding Zhao
OffRL
29
63
0
11 May 2020
Model-Predictive Control via Cross-Entropy and Gradient-Based
  Optimization
Model-Predictive Control via Cross-Entropy and Gradient-Based Optimization
Homanga Bharadhwaj
Kevin Xie
Florian Shkurti
11
49
0
19 Apr 2020
The Differentiable Cross-Entropy Method
The Differentiable Cross-Entropy Method
Brandon Amos
Denis Yarats
21
54
0
27 Sep 2019
Emergence of Locomotion Behaviours in Rich Environments
Emergence of Locomotion Behaviours in Rich Environments
N. Heess
TB Dhruva
S. Sriram
Jay Lemmon
J. Merel
...
Tom Erez
Ziyun Wang
S. M. Ali Eslami
Martin Riedmiller
David Silver
131
928
0
07 Jul 2017
1