Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1802.10592
Cited By
v1
v2 (latest)
Model-Ensemble Trust-Region Policy Optimization
International Conference on Learning Representations (ICLR), 2018
28 February 2018
Thanard Kurutach
I. Clavera
Yan Duan
Aviv Tamar
Pieter Abbeel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Model-Ensemble Trust-Region Policy Optimization"
50 / 305 papers shown
Title
Balanced Product of Calibrated Experts for Long-Tailed Recognition
Computer Vision and Pattern Recognition (CVPR), 2022
Emanuel Sanchez Aimar
Arvi Jonnarth
Michael Felsberg
Marco Kuhlmann
179
37
0
10 Jun 2022
A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement Learning
International Conference on Learning Representations (ICLR), 2022
Jinpei Guo
Biwei Huang
Dacheng Tao
148
21
0
09 Jun 2022
Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning
International Conference on Learning Representations (ICLR), 2022
Deyao Zhu
Erran L. Li
Mohamed Elhoseiny
OffRL
235
13
0
09 Jun 2022
Offline Reinforcement Learning with Causal Structured World Models
Zhengbang Zhu
Xiong-Hui Chen
Hong Tian
Kun Zhang
Yang Yu
CML
OffRL
155
22
0
03 Jun 2022
Offline Policy Comparison with Confidence: Benchmarks and Baselines
Anurag Koul
Mariano Phielipp
Alan Fern
OffRL
181
0
0
22 May 2022
Towards Applicable Reinforcement Learning: Improving the Generalization and Sample Efficiency with Policy Ensemble
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Zhengyu Yang
Kan Ren
Xufang Luo
Minghuan Liu
Yuante Li
Jiang Bian
Weinan Zhang
Dongsheng Li
154
28
0
19 May 2022
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2022
Zhiwei Xu
Dapeng Li
Bin Zhang
Yuan Zhan
Yunru Bai
Guoliang Fan
OffRL
228
10
0
20 Apr 2022
Accelerated Policy Learning with Parallel Differentiable Simulation
International Conference on Learning Representations (ICLR), 2022
Jie Xu
Viktor Makoviychuk
Yashraj S. Narang
Fabio Ramos
Wojciech Matusik
Animesh Garg
Lukasz Wawrzyniak
191
124
0
14 Apr 2022
Control-oriented meta-learning
Spencer M. Richards
Navid Azizan
Jean-Jacques E. Slotine
Marco Pavone
139
31
0
14 Apr 2022
Gradient-Based Trajectory Optimization With Learned Dynamics
IEEE International Conference on Robotics and Automation (ICRA), 2022
Bhavya Sukhija
Nathanael Kohler
Miguel Zamora
Simon Zimmermann
Sebastian Curi
Andreas Krause
Stelian Coros
245
13
0
09 Apr 2022
Revisiting Model-based Value Expansion
Daniel Palenicek
M. Lutter
Jan Peters
183
2
0
28 Mar 2022
VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2022
Che Wang
Xufang Luo
George Andriopoulos
Dongsheng Li
OffRL
356
62
0
17 Feb 2022
Safe Reinforcement Learning by Imagining the Near Future
Neural Information Processing Systems (NeurIPS), 2022
G. Thomas
Yuping Luo
Tengyu Ma
OffRL
163
105
0
15 Feb 2022
Deep Ensembles Work, But Are They Necessary?
Neural Information Processing Systems (NeurIPS), 2022
Taiga Abe
E. Kelly Buchanan
Geoff Pleiss
R. Zemel
John P. Cunningham
OOD
UQCV
304
79
0
14 Feb 2022
Exploration with Multi-Sample Target Values for Distributional Reinforcement Learning
Michael Teng
M. van de Panne
Frank Wood
OOD
OffRL
100
1
0
06 Feb 2022
Reinforcement Learning for Personalized Drug Discovery and Design for Complex Diseases: A Systems Pharmacology Perspective
Ryan K. Tan
Yang Liu
Lei Xie
258
2
0
21 Jan 2022
Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
Zhihai Wang
Jie Wang
Qi Zhou
Bin Li
Houqiang Li
146
37
0
16 Dec 2021
Control-Tutored Reinforcement Learning: Towards the Integration of Data-Driven and Model-Based Control
F. D. Lellis
M. Coraggio
G. Russo
Mirco Musolesi
M. D. Bernardo
144
8
0
11 Dec 2021
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos
Eszter Vértes
Zita Marinho
Gregory Farquhar
Diana Borsa
A. Friesen
Feryal M. P. Behbahani
Tom Schaul
André Barreto
Simon Osindero
305
7
0
08 Dec 2021
ED2: Environment Dynamics Decomposition World Models for Continuous Control
Jianye Hao
Yifu Yuan
Cong Wang
Zhen Wang
OffRL
182
3
0
06 Dec 2021
Continuous Control With Ensemble Deep Deterministic Policy Gradients
Piotr Januszewski
Mateusz Olko
M. Królikowski
J. Swiatkowski
Marcin Andrychowicz
Lukasz Kuciñski
Piotr Milo's
OffRL
82
12
0
30 Nov 2021
A Free Lunch from the Noise: Provable and Practical Exploration for Representation Learning
Conference on Uncertainty in Artificial Intelligence (UAI), 2021
Zhaolin Ren
Tianjun Zhang
Csaba Szepesvári
Bo Dai
213
22
0
22 Nov 2021
On Effective Scheduling of Model-based Reinforcement Learning
Hang Lai
Jian Shen
Weinan Zhang
Yimin Huang
Xingzhi Zhang
Ruiming Tang
Yong Yu
Zhenguo Li
204
22
0
16 Nov 2021
ModelLight: Model-Based Meta-Reinforcement Learning for Traffic Signal Control
Xingshuai Huang
Di Wu
M. Jenkin
Benoit Boulet
186
15
0
15 Nov 2021
Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention
Conference on Robot Learning (CoRL), 2021
Yunkun Xu
Zhen-yu Liu
Guifang Duan
Jiangcheng Zhu
X. Bai
Jianrong Tan
211
11
0
10 Nov 2021
Gradients are Not All You Need
Luke Metz
C. Freeman
S. Schoenholz
Tal Kachman
200
102
0
10 Nov 2021
Improving Hyperparameter Optimization by Planning Ahead
H. Jomaa
Jonas K. Falkner
Lars Schmidt-Thieme
135
0
0
15 Oct 2021
On-Policy Model Errors in Reinforcement Learning
Lukas P. Frohlich
Maksym Lefarov
Melanie Zeilinger
Felix Berkenkamp
OnRL
187
6
0
15 Oct 2021
Using Human-Guided Causal Knowledge for More Generalized Robot Task Planning
Semir Tatlidil
Yanqi Liu
Emily Sheetz
R. I. Bahar
Steven Sloman Brown University
180
0
0
09 Oct 2021
Revisiting Design Choices in Offline Model-Based Reinforcement Learning
International Conference on Learning Representations (ICLR), 2021
Cong Lu
Philip J. Ball
Jack Parker-Holder
Michael A. Osborne
Stephen J. Roberts
OffRL
217
59
0
08 Oct 2021
Imaginary Hindsight Experience Replay: Curious Model-based Learning for Sparse Reward Tasks
Robert McCarthy
Qiang Wang
S. Redmond
OffRL
221
16
0
05 Oct 2021
Dropout Q-Functions for Doubly Efficient Reinforcement Learning
Takuya Hiraoka
Takahisa Imagawa
Taisei Hashimoto
Takashi Onishi
Yoshimasa Tsuruoka
288
142
0
05 Oct 2021
Cycle-Consistent World Models for Domain Independent Latent Imagination
Sidney Bender
Tim Joseph
Marius Zoellner
225
0
0
02 Oct 2021
Learning Dynamics Models for Model Predictive Agents
M. Lutter
Leonard Hasenclever
Arunkumar Byravan
Gabriel Dulac-Arnold
Piotr Trochim
N. Heess
J. Merel
Yuval Tassa
AI4CE
188
29
0
29 Sep 2021
MEPG: A Minimalist Ensemble Policy Gradient Framework for Deep Reinforcement Learning
Qiang He
Yuxun Qu
Chen Gong
Xinwen Hou
OffRL
129
10
0
22 Sep 2021
Dropout's Dream Land: Generalization from Learned Simulators to Reality
Zac Wellmer
James T. Kwok
SyDa
135
9
0
17 Sep 2021
Dynamics-Aware Quality-Diversity for Efficient Learning of Skill Repertoires
Bryan Lim
Luca Grillotti
Lorenzo Bernasconi
Antoine Cully
168
28
0
16 Sep 2021
Robust Stability of Neural Network-controlled Nonlinear Systems with Parametric Variability
Soumyabrata Talukder
Ratnesh Kumar
134
12
0
13 Sep 2021
Federated Ensemble Model-based Reinforcement Learning in Edge Computing
Jin Wang
Jia Hu
Jed Mills
Geyong Min
Ming Xia
FedML
169
28
0
12 Sep 2021
Eden: A Unified Environment Framework for Booming Reinforcement Learning Algorithms
Ruizhi Chen
Xiaoyu Wu
Yansong Pan
Kaizhao Yuan
Ling Li
...
Shaohui Peng
Xishan Zhang
Zidong Du
Qi Guo
Yunji Chen
OffRL
91
3
0
04 Sep 2021
Robust Model-based Reinforcement Learning for Autonomous Greenhouse Control
Asian Conference on Machine Learning (ACML), 2021
Wanpeng Zhang
Xiaoyan Cao
Yaowen Yao
Zhicheng An
Xi Xiao
Dijun Luo
OffRL
166
23
0
26 Aug 2021
Model-based Chance-Constrained Reinforcement Learning via Separated Proportional-Integral Lagrangian
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
Baiyu Peng
Jingliang Duan
Jianyu Chen
Shengbo Eben Li
Genjin Xie
Congsheng Zhang
Yang Guan
Yao Mu
Enxin Sun
95
26
0
26 Aug 2021
Model-Based Opponent Modeling
Xiaopeng Yu
Jiechuan Jiang
Wanpeng Zhang
Haobin Jiang
Zongqing Lu
OffRL
238
38
0
04 Aug 2021
MBDP: A Model-based Approach to Achieve both Robustness and Sample Efficiency via Double Dropout Planning
Wanpeng Zhang
Xi Xiao
Yaowen Yao
Mingzhe Chen
Dijun Luo
OffRL
135
1
0
03 Aug 2021
Physics-informed Dyna-Style Model-Based Deep Reinforcement Learning for Dynamic Control
Proceedings of the Royal Society A (Proc. R. Soc. A), 2021
Xin-Yang Liu
Jian-Xun Wang
AI4CE
237
48
0
31 Jul 2021
PC-MLP: Model-based Reinforcement Learning with Policy Cover Guided Exploration
International Conference on Machine Learning (ICML), 2021
Yuda Song
Wen Sun
233
23
0
15 Jul 2021
Centralized Model and Exploration Policy for Multi-Agent RL
Qizhen Zhang
Chris Xiaoxuan Lu
Animesh Garg
Jakob N. Foerster
152
19
0
14 Jul 2021
A Survey of Uncertainty in Deep Neural Networks
J. Gawlikowski
Cedrique Rovile Njieutcheu Tassi
Mohsin Ali
Jongseo Lee
Matthias Humt
...
R. Roscher
Muhammad Shahzad
Wen Yang
R. Bamler
Xiaoxiang Zhu
BDL
UQCV
OOD
527
1,456
0
07 Jul 2021
Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning
Muhammad Rizki Maulana
W. Lee
143
2
0
05 Jul 2021
Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation
Yaowen Yao
Li Xiao
Zhicheng An
Wanpeng Zhang
Dijun Luo
170
23
0
05 Jul 2021
Previous
1
2
3
4
5
6
7
Next