Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08253
Cited By
When to Trust Your Model: Model-Based Policy Optimization
19 June 2019
Michael Janner
Justin Fu
Marvin Zhang
Sergey Levine
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"When to Trust Your Model: Model-Based Policy Optimization"
50 / 246 papers shown
Title
Reinforcement Learning for Personalized Drug Discovery and Design for Complex Diseases: A Systems Pharmacology Perspective
Ryan K. Tan
Yang Liu
Lei Xie
49
2
0
21 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
35
100
0
11 Jan 2022
Offline Reinforcement Learning for Road Traffic Control
Mayuresh Kunjir
Sanjay Chawla
OffRL
32
4
0
07 Jan 2022
Multiagent Model-based Credit Assignment for Continuous Control
Dongge Han
Chris Xiaoxuan Lu
Tomasz P. Michalak
Michael Wooldridge
27
5
0
27 Dec 2021
Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
Zhihai Wang
Jie Wang
Qi Zhou
Bin Li
Houqiang Li
27
30
0
16 Dec 2021
CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement Learning
Kevin Huang
Sahin Lale
Ugo Rosolia
Yuanyuan Shi
Anima Anandkumar
21
8
0
14 Dec 2021
Continual Learning In Environments With Polynomial Mixing Times
Matthew D Riemer
Sharath Chandra Raparthy
Ignacio Cases
G. Subbaraj
M. P. Touzel
Irina Rish
CLL
41
8
0
13 Dec 2021
ED2: Environment Dynamics Decomposition World Models for Continuous Control
Jianye Hao
Yifu Yuan
Cong Wang
Zhen Wang
OffRL
16
1
0
06 Dec 2021
Residual Pathway Priors for Soft Equivariance Constraints
Marc Finzi
Gregory W. Benton
A. Wilson
BDL
UQCV
24
54
0
02 Dec 2021
Sample Efficient Imitation Learning via Reward Function Trained in Advance
Lihua Zhang
30
1
0
23 Nov 2021
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Yanqiu Wu
Xinyue Chen
Che Wang
Yiming Zhang
Keith Ross
OffRL
17
9
0
17 Nov 2021
Learning to Execute: Efficient Learning of Universal Plan-Conditioned Policies in Robotics
Ingmar Schubert
Danny Driess
Ozgur S. Oguz
Marc Toussaint
OffRL
24
1
0
15 Nov 2021
Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention
Yunkun Xu
Zhen-yu Liu
Guifang Duan
Jiangcheng Zhu
X. Bai
Jianrong Tan
18
9
0
10 Nov 2021
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Jinning Li
Chen Tang
Masayoshi Tomizuka
Wei Zhan
OffRL
19
21
0
09 Nov 2021
Dynamic Mirror Descent based Model Predictive Control for Accelerating Robot Learning
Utkarsh Aashu Mishra
Soumya R. Samineni
Prakhar Goel
Chandravaran Kunjeti
Himanshu Lodha
Aman Singh
Aditya Sagi
S. Bhatnagar
Shishir Kolathaya
29
3
0
04 Nov 2021
Learning Robust Controllers Via Probabilistic Model-Based Policy Search
V. Charvet
B. S. Jensen
R. Murray-Smith
19
2
0
26 Oct 2021
Which Model to Trust: Assessing the Influence of Models on the Performance of Reinforcement Learning Algorithms for Continuous Control Tasks
Giacomo Arcieri
David Wölfle
Eleni Chatzi
OffRL
27
5
0
25 Oct 2021
Policy Search using Dynamic Mirror Descent MPC for Model Free Off Policy RL
Aarush Gupta
30
0
0
23 Oct 2021
Safe Reinforcement Learning Using Robust Control Barrier Functions
Y. Emam
Gennaro Notomista
Paul Glotfelter
Z. Kira
M. Egerstedt
OffRL
24
39
0
11 Oct 2021
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Edoardo Cetin
Oya Celiktutan
OffRL
47
17
0
07 Oct 2021
Evaluating model-based planning and planner amortization for continuous control
Arunkumar Byravan
Leonard Hasenclever
Piotr Trochim
M. Berk Mirza
Alessandro Davide Ialongo
...
Jost Tobias Springenberg
A. Abdolmaleki
N. Heess
J. Merel
Martin Riedmiller
55
17
0
07 Oct 2021
Imaginary Hindsight Experience Replay: Curious Model-based Learning for Sparse Reward Tasks
Robert McCarthy
Qiang Wang
S. Redmond
OffRL
32
15
0
05 Oct 2021
Dropout Q-Functions for Doubly Efficient Reinforcement Learning
Takuya Hiraoka
Takahisa Imagawa
Taisei Hashimoto
Takashi Onishi
Yoshimasa Tsuruoka
13
105
0
05 Oct 2021
Solving the Real Robot Challenge using Deep Reinforcement Learning
Robert McCarthy
Francisco Roldan Sanchez
Qiang Wang
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
S. Redmond
29
11
0
30 Sep 2021
Learning Dynamics Models for Model Predictive Agents
M. Lutter
Leonard Hasenclever
Arunkumar Byravan
Gabriel Dulac-Arnold
Piotr Trochim
N. Heess
J. Merel
Yuval Tassa
AI4CE
57
26
0
29 Sep 2021
Robust Model-based Reinforcement Learning for Autonomous Greenhouse Control
Wanpeng Zhang
Xiaoyan Cao
Yaowen Yao
Zhicheng An
Xi Xiao
Dijun Luo
OffRL
38
18
0
26 Aug 2021
Physics-informed Dyna-Style Model-Based Deep Reinforcement Learning for Dynamic Control
Xin-Yang Liu
Jian-Xun Wang
AI4CE
31
38
0
31 Jul 2021
Few-shot Language Coordination by Modeling Theory of Mind
Hao Zhu
Graham Neubig
Yonatan Bisk
19
35
0
12 Jul 2021
IGrow: A Smart Agriculture Solution to Autonomous Greenhouse Control
Xiaoyan Cao
Yaowen Yao
Lanqing Li
Wanpeng Zhang
Zhicheng An
...
Li Xiao
Shihui Guo
Xiaoyu Cao
Meihong Wu
Dijun Luo
11
19
0
06 Jul 2021
Supervised Off-Policy Ranking
Yue Jin
Yue Zhang
Tao Qin
Xudong Zhang
Jian Yuan
Houqiang Li
Tie-Yan Liu
OffRL
32
5
0
03 Jul 2021
MHER: Model-based Hindsight Experience Replay
Rui Yang
Meng Fang
Lei Han
Yali Du
Feng Luo
Xiu Li
OffRL
29
17
0
01 Jul 2021
Interpretable Model-based Hierarchical Reinforcement Learning using Inductive Logic Programming
Duo Xu
Faramarz Fekri
24
10
0
21 Jun 2021
Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL
Catherine Cang
Aravind Rajeswaran
Pieter Abbeel
Michael Laskin
OffRL
32
29
0
16 Jun 2021
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning
Tengyang Xie
Nan Jiang
Huan Wang
Caiming Xiong
Yu Bai
OffRL
OnRL
44
162
0
09 Jun 2021
Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation
Evgenii Nikishin
Romina Abachi
Rishabh Agarwal
Pierre-Luc Bacon
OffRL
52
35
0
06 Jun 2021
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Michael Janner
Qiyang Li
Sergey Levine
OffRL
71
651
0
03 Jun 2021
MapGo: Model-Assisted Policy Optimization for Goal-Oriented Tasks
Menghui Zhu
Minghuan Liu
Jian Shen
Zhicheng Zhang
Sheng Chen
Weinan Zhang
Deheng Ye
Yong Yu
Qiang Fu
Wei Yang
49
22
0
13 May 2021
Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts
Weinan Zhang
Xihuai Wang
Jian Shen
Ming Zhou
27
35
0
07 May 2021
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Michael Ruogu Zhang
T. Paine
Ofir Nachum
Cosmin Paduraru
George Tucker
Ziyun Wang
Mohammad Norouzi
OffRL
22
45
0
28 Apr 2021
MBRL-Lib: A Modular Library for Model-based Reinforcement Learning
Luis Pineda
Brandon Amos
Amy Zhang
Nathan Lambert
Roberto Calandra
OffRL
33
46
0
20 Apr 2021
GEM: Group Enhanced Model for Learning Dynamical Control Systems
Philippe Hansen-Estruch
Wenling Shang
Lerrel Pinto
Pieter Abbeel
Stas Tiomkin
AI4CE
38
2
0
07 Apr 2021
Benchmarks for Deep Off-Policy Evaluation
Justin Fu
Mohammad Norouzi
Ofir Nachum
George Tucker
Ziyun Wang
...
Yutian Chen
Aviral Kumar
Cosmin Paduraru
Sergey Levine
T. Paine
ELM
OffRL
35
100
0
30 Mar 2021
Fundamental Challenges in Deep Learning for Stiff Contact Dynamics
Mihir Parmar
Mathew Halm
Michael Posa
29
36
0
29 Mar 2021
Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow
John Mcleod
Hrvoje Stojić
Vincent Adam
Dongho Kim
Jordi Grau-Moya
Peter Vrancx
Felix Leibfried
OffRL
21
2
0
26 Mar 2021
Learning Reactive and Predictive Differentiable Controllers for Switching Linear Dynamical Models
Saumya Saxena
A. LaGrassa
Oliver Kroemer
39
4
0
26 Mar 2021
Model Predictive Actor-Critic: Accelerating Robot Skill Acquisition with Deep Reinforcement Learning
A. S. Morgan
Daljeet Nandha
Georgia Chalvatzaki
Carlo DÉramo
A. Dollar
Jan Peters
48
43
0
25 Mar 2021
Cloth Manipulation Planning on Basis of Mesh Representations with Incomplete Domain Knowledge and Voxel-to-Mesh Estimation
S. Arnold
Daisuke Tanaka
Kimitoshi Yamazaki
22
4
0
15 Mar 2021
Adapting User Interfaces with Model-based Reinforcement Learning
Kashyap Todi
G. Bailly
Luis A. Leiva
Antti Oulasvirta
19
87
0
11 Mar 2021
MAMBPO: Sample-efficient multi-robot reinforcement learning using learned world models
Daniel Willemsen
M. Coppola
Guido de Croon
24
30
0
05 Mar 2021
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning
Xianyuan Zhan
Haoran Xu
Yueying Zhang
Xiangyu Zhu
Honglei Yin
Yu Zheng
OffRL
AI4CE
42
68
0
23 Feb 2021
Previous
1
2
3
4
5
Next