Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1807.03858
Cited By
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
10 July 2018
Yuping Luo
Huazhe Xu
Yuanzhi Li
Yuandong Tian
Trevor Darrell
Tengyu Ma
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees"
47 / 47 papers shown
Title
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning
Yongshuai Liu
Xin Liu
93
1
0
26 Mar 2025
SAMBO-RL: Shifts-aware Model-based Offline Reinforcement Learning
Wang Luo
Haoran Li
Zicheng Zhang
Congying Han
Jiayu Lv
Tiande Guo
OffRL
46
1
0
23 Aug 2024
BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
Hao-ming Lin
Wenhao Ding
Jian Chen
Laixi Shi
Jiacheng Zhu
Bo-wen Li
Ding Zhao
OffRL
CML
52
0
0
15 Jul 2024
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption
Bernd Frauenknecht
Artur Eisele
Devdutt Subhasish
Friedrich Solowjow
Sebastian Trimpe
46
5
0
29 May 2024
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation
Chengxing Jia
Pengyuan Wang
Ziniu Li
Yi-Chen Li
Zhilong Zhang
Nan Tang
Yang Yu
OffRL
34
1
0
27 May 2024
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Jiafei Lyu
Chenjia Bai
Jingwen Yang
Zongqing Lu
Xiu Li
30
8
0
24 May 2024
Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning
Xiaoyu Wen
Chenjia Bai
Kang Xu
Xudong Yu
Yang Zhang
Xuelong Li
Zhen Wang
39
2
0
10 May 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
Rafael Rafailov
Kyle Hatch
Victor Kolev
John D. Martin
Mariano Phielipp
Chelsea Finn
OffRL
OnRL
22
9
0
06 Jan 2024
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
30
8
0
15 Dec 2023
Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum
Jigang Kim
Daesol Cho
H. J. Kim
22
3
0
17 May 2023
Beware of Instantaneous Dependence in Reinforcement Learning
Zhengmao Zhu
Yu-Ren Liu
Hong Tian
Yang Yu
Kun Zhang
OffRL
31
1
0
09 Mar 2023
Learning Interaction-aware Motion Prediction Model for Decision-making in Autonomous Driving
Zhiyu Huang
Haochen Liu
Jingda Wu
Wenhui Huang
Chen Lv
31
17
0
08 Feb 2023
On Uncertainty in Deep State Space Models for Model-Based Reinforcement Learning
P. Becker
Gerhard Neumann
22
9
0
17 Oct 2022
When to Update Your Model: Constrained Model-based Reinforcement Learning
Tianying Ji
Yu-Juan Luo
Fuchun Sun
Mingxuan Jing
Fengxiang He
Wen-bing Huang
16
18
0
15 Oct 2022
Model-based Reinforcement Learning with Multi-step Plan Value Estimation
Hao-Chu Lin
Yihao Sun
Jiajin Zhang
Yang Yu
OffRL
24
7
0
12 Sep 2022
Efficiently Computing Nash Equilibria in Adversarial Team Markov Games
Fivos Kalogiannis
Ioannis Anagnostides
Ioannis Panageas
Emmanouil-Vasileios Vlatakis-Gkaragkounis
Vaggos Chatziafratis
S. Stavroulakis
31
13
0
03 Aug 2022
Scalable Model-based Policy Optimization for Decentralized Networked Systems
Yali Du
Chengdong Ma
Yuchen Liu
Runji Lin
Hao Dong
Jun Wang
Yaodong Yang
25
8
0
13 Jul 2022
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
44
101
0
19 Jun 2022
Relative Policy-Transition Optimization for Fast Policy Transfer
Jiawei Xu
Cheng Zhou
Yizheng Zhang
Zhengyou Zhang
Lei Han
18
0
0
13 Jun 2022
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning
Zhiwei Xu
Dapeng Li
Bin Zhang
Yuan Zhan
Yunru Bai
Guoliang Fan
OffRL
24
6
0
20 Apr 2022
REvolveR: Continuous Evolutionary Models for Robot-to-robot Policy Transfer
Xingyu Liu
Deepak Pathak
Kris M. Kitani
23
19
0
10 Feb 2022
Hyperparameter Selection Methods for Fitted Q-Evaluation with Error Guarantee
Kohei Miyaguchi
OffRL
33
1
0
07 Jan 2022
Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
Zhihai Wang
Jie Wang
Qi Zhou
Bin Li
Houqiang Li
19
30
0
16 Dec 2021
ModelLight: Model-Based Meta-Reinforcement Learning for Traffic Signal Control
Xingshuai Huang
Di Wu
M. Jenkin
Benoit Boulet
13
15
0
15 Nov 2021
Improving Hyperparameter Optimization by Planning Ahead
H. Jomaa
Jonas K. Falkner
Lars Schmidt-Thieme
22
0
0
15 Oct 2021
Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective
Ting-Han Fan
Peter J. Ramadge
CML
FAtt
OffRL
21
2
0
06 Oct 2021
Imaginary Hindsight Experience Replay: Curious Model-based Learning for Sparse Reward Tasks
Robert McCarthy
Qiang Wang
S. Redmond
OffRL
27
15
0
05 Oct 2021
Robust Model-based Reinforcement Learning for Autonomous Greenhouse Control
Wanpeng Zhang
Xiaoyan Cao
Yaowen Yao
Zhicheng An
Xi Xiao
Dijun Luo
OffRL
33
18
0
26 Aug 2021
Physics-informed Dyna-Style Model-Based Deep Reinforcement Learning for Dynamic Control
Xin-Yang Liu
Jian-Xun Wang
AI4CE
23
38
0
31 Jul 2021
Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts
Weinan Zhang
Xihuai Wang
Jian Shen
Ming Zhou
19
35
0
07 May 2021
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators
Anish Agarwal
Abdullah Alomar
Varkey Alumootil
Devavrat Shah
Dennis Shen
Zhi Xu
Cindy Yang
OffRL
18
18
0
13 Feb 2021
A Tutorial on Sparse Gaussian Processes and Variational Inference
Felix Leibfried
Vincent Dutordoir
S. T. John
N. Durrande
GP
27
49
0
27 Dec 2020
Model-based Policy Optimization with Unsupervised Model Adaptation
Jian Shen
Han Zhao
Weinan Zhang
Yong Yu
30
27
0
19 Oct 2020
FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning
Honghao Wei
Lei Ying
9
7
0
04 Oct 2020
Learning Off-Policy with Online Planning
Harshit S. Sikchi
Wenxuan Zhou
David Held
OffRL
29
45
0
23 Aug 2020
Learning Robust State Abstractions for Hidden-Parameter Block MDPs
Amy Zhang
Shagun Sodhani
Khimya Khetarpal
Joelle Pineau
29
5
0
14 Jul 2020
Information Theoretic Regret Bounds for Online Nonlinear Control
Sham Kakade
A. Krishnamurthy
Kendall Lowrey
Motoya Ohnishi
Wen Sun
31
117
0
22 Jun 2020
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning
Sebastian Curi
Felix Berkenkamp
Andreas Krause
25
82
0
15 Jun 2020
Maximum Entropy Model Rollouts: Fast Model Based Policy Optimization without Compounding Errors
Chi Zhang
S. Kuppannagari
Viktor Prasanna
17
4
0
08 Jun 2020
Model-Augmented Actor-Critic: Backpropagating through Paths
I. Clavera
Yao Fu
Pieter Abbeel
33
86
0
16 May 2020
Causally Correct Partial Models for Reinforcement Learning
Danilo Jimenez Rezende
Ivo Danihelka
George Papamakarios
Nan Rosemary Ke
Ray Jiang
...
Jane X. Wang
Jovana Mitrović
F. Besse
Ioannis Antonoglou
Lars Buesing
AI4TS
21
32
0
07 Feb 2020
Explicit Explore-Exploit Algorithms in Continuous State Spaces
Mikael Henaff
OffRL
14
31
0
01 Nov 2019
Asynchronous Methods for Model-Based Reinforcement Learning
Yunzhi Zhang
I. Clavera
Bo-Yu Tsai
Pieter Abbeel
OffRL
11
27
0
28 Oct 2019
Gradient-Aware Model-based Policy Search
P. DÓro
Alberto Maria Metelli
Andrea Tirinzoni
Matteo Papini
Marcello Restelli
19
34
0
09 Sep 2019
Exploration via Hindsight Goal Generation
Zhizhou Ren
Kefan Dong
Yuanshuo Zhou
Qiang Liu
Jian-wei Peng
22
84
0
10 Jun 2019
The Principle of Unchanged Optimality in Reinforcement Learning Generalization
A. Irpan
Xingyou Song
OffRL
25
7
0
02 Jun 2019
Combating the Compounding-Error Problem with a Multi-step Model
Kavosh Asadi
Dipendra Kumar Misra
Seungchan Kim
Michel L. Littman
LRM
14
55
0
30 May 2019
1