Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1802.10592
Cited By
v1
v2 (latest)
Model-Ensemble Trust-Region Policy Optimization
International Conference on Learning Representations (ICLR), 2018
28 February 2018
Thanard Kurutach
I. Clavera
Yan Duan
Aviv Tamar
Pieter Abbeel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Model-Ensemble Trust-Region Policy Optimization"
50 / 304 papers shown
Title
Controllable Flow Matching for Online Reinforcement Learning
Bin Wang
Boxiang Tao
Haifeng Jing
Hongbo Dou
Zijian Wang
52
0
0
10 Nov 2025
Cavity Duplexer Tuning with 1d Resnet-like Neural Networks
Anton Raskovalov
36
0
0
17 Oct 2025
First Order Model-Based RL through Decoupled Backpropagation
Joseph Amigo
Rooholla Khorrambakht
Elliot Chane-Sane
Nicolas Mansard
Ludovic Righetti
102
0
0
29 Aug 2025
Meta-reinforcement learning with minimum attention
Pilhwa Lee
Shashank Gupta
OffRL
160
0
0
22 May 2025
Sample-Efficient Reinforcement Learning of Koopman eNMPC
Computers and Chemical Engineering (CCE), 2025
Daniel Mayfrank
M. Velioglu
Alexander Mitsos
Manuel Dahmen
OffRL
259
0
0
24 Mar 2025
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
Siyuan Mu
Sen Lin
MoE
909
31
0
10 Mar 2025
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
Stefano Viel
Luca Viano
Volkan Cevher
538
1
0
27 Feb 2025
Scalable Model Merging with Progressive Layer-wise Distillation
Jing Xu
Jiazheng Li
J.N. Zhang
MoMe
FedML
557
6
0
18 Feb 2025
Digital Twin Calibration with Model-Based Reinforcement Learning
Hua Zheng
Wei Xie
I. Ryzhov
Keilung Choy
290
0
0
04 Jan 2025
On Reward Transferability in Adversarial Inverse Reinforcement Learning: Insights from Random Matrix Theory
Yangchun Zhang
Wang Zhou
Yirui Zhou
181
0
0
31 Dec 2024
SimuDICE: Offline Policy Optimization Through World Model Updates and DICE Estimation
Catalin E. Brita
Stephan Bongers
F. Oliehoek
OffRL
201
0
0
09 Dec 2024
Understanding World or Predicting Future? A Comprehensive Survey of World Models
ACM Computing Surveys (ACM CSUR), 2024
Jingtao Ding
Yunke Zhang
Yu Shang
Yuheng Zhang
Zefang Zong
...
Fengli Xu
Yong Li
Chen Gao
Fengli Xu
Yong Li
VGen
SyDa
397
17
0
21 Nov 2024
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
ACM Computing Surveys (ACM CSUR), 2024
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
233
13
0
08 Nov 2024
Learning World Models for Unconstrained Goal Navigation
Neural Information Processing Systems (NeurIPS), 2024
Yuanlin Duan
Wensen Mao
He Zhu
211
5
0
03 Nov 2024
Guiding Reinforcement Learning with Incomplete System Dynamics
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Shuyuan Wang
Jingliang Duan
Nathan P. Lawrence
Philip D. Loewen
M. Forbes
R. Bhushan Gopaluni
Lixian Zhang
198
3
0
22 Oct 2024
Dual Action Policy for Robust Sim-to-Real Reinforcement Learning
International Conference on Artificial Neural Networks (ICANN), 2024
Ng Wen Zheng Terence
Chen Jianda
129
0
0
16 Oct 2024
When to Trust Your Data: Enhancing Dyna-Style Model-Based Reinforcement Learning With Data Filter
Yansong Li
Zeyu Dong
Ertai Luo
Yu Wu
Shuo Wu
Shuo Han
109
3
0
16 Oct 2024
COSBO: Conservative Offline Simulation-Based Policy Optimization
E. Kargar
Ville Kyrki
OffRL
112
0
0
22 Sep 2024
Offline Model-Based Reinforcement Learning with Anti-Exploration
European Conference on Artificial Intelligence (ECAI), 2024
Padmanaba Srinivasan
William J. Knottenbelt
OffRL
203
0
0
20 Aug 2024
Mixture of Experts in a Mixture of RL settings
Timon Willi
J. Obando-Ceron
Jakob Foerster
Karolina Dziugaite
Pablo Samuel Castro
MoE
295
14
0
26 Jun 2024
Diffusion Spectral Representation for Reinforcement Learning
Dmitry Shribak
Chen-Xiao Gao
Yitong Li
Chenjun Xiao
Bo Dai
DiffM
280
7
0
23 Jun 2024
Learning to Play Atari in a World of Tokens
Pranav Agarwal
Sheldon Andrews
Samira Ebrahimi Kahou
OffRL
209
5
0
03 Jun 2024
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption
Bernd Frauenknecht
Artur Eisele
Devdutt Subhasish
Friedrich Solowjow
Sebastian Trimpe
299
5
0
29 May 2024
Efficient Multi-agent Reinforcement Learning by Planning
Qihan Liu
Jianing Ye
Xiaoteng Ma
Jun Yang
Bin Liang
Chongjie Zhang
167
14
0
20 May 2024
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin
P. DÓro
Evgenii Nikishin
Rameswar Panda
216
6
0
07 May 2024
Learning Control Barrier Functions and their application in Reinforcement Learning: A Survey
Maeva Guerrier
Hassan Fouad
Giovanni Beltrame
OffRL
197
6
0
22 Apr 2024
Robust Model Based Reinforcement Learning Using
L
1
\mathcal{L}_1
L
1
Adaptive Control
Minjun Sung
Sambhu H. Karumanchi
Aditya Gahlawat
N. Hovakimyan
171
1
0
21 Mar 2024
An Efficient Model-Based Approach on Learning Agile Motor Skills without Reinforcement
Hao-bin Shi
Tingguang Li
Qing Zhu
Jiapeng Sheng
Lei Han
Max Q.-H. Meng
174
3
0
04 Mar 2024
Model-based deep reinforcement learning for accelerated learning from flow simulations
Andre Weiner
Janis Geise
AI4CE
209
6
0
26 Feb 2024
Leveraging Digital Cousins for Ensemble Q-Learning in Large-Scale Wireless Networks
Talha Bozkus
Urbashi Mitra
164
6
0
12 Feb 2024
Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization
Talha Bozkus
Urbashi Mitra
OffRL
224
8
0
08 Feb 2024
Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning
AAAI Conference on Artificial Intelligence (AAAI), 2024
Zizhao Wang
Caroline Wang
Xuesu Xiao
Yuke Zhu
Peter Stone
OffRL
105
8
0
23 Jan 2024
Episodic Reinforcement Learning with Expanded State-reward Space
Dayang Liang
Yaru Zhang
Yunlong Liu
OffRL
126
4
0
19 Jan 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
Rafael Rafailov
Kyle Hatch
Victor Kolev
John D. Martin
Mariano Phielipp
Chelsea Finn
OffRL
OnRL
299
17
0
06 Jan 2024
Efficient Reinforcement Learning via Decoupling Exploration and Utilization
Jingpu Yang
Helin Wang
Qirui Zhao
Zhecheng Shi
Zirui Song
Miao Fang
281
3
0
26 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
278
30
0
15 Dec 2023
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control
Bernd Frauenknecht
Tobias Ehlgen
Sebastian Trimpe
189
4
0
30 Nov 2023
Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic Detection of Infeasible Plans
Neural Information Processing Systems (NeurIPS), 2023
Kyowoon Lee
Seongun Kim
Jaesik Choi
DiffM
215
19
0
30 Oct 2023
One is More: Diverse Perspectives within a Single Network for Efficient DRL
Yiqin Tan
Ling Pan
Longbo Huang
OffRL
249
0
0
21 Oct 2023
COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL
International Conference on Learning Representations (ICLR), 2023
Xiyao Wang
Ruijie Zheng
Yanchao Sun
Ruonan Jia
Wichayaporn Wongkamjan
Huazhe Xu
Furong Huang
OffRL
243
16
0
11 Oct 2023
Guided Cooperation in Hierarchical Reinforcement Learning via Model-based Rollout
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Haoran Wang
Zeshen Tang
Leya Yang
Yaoru Sun
Fang Wang
Siyu Zhang
Ye-Ting Chen
233
2
0
24 Sep 2023
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy Optimization
Neural Information Processing Systems (NeurIPS), 2023
Hai Zhang
Hang Yu
Siyue Tao
Di Zhang
Chang Huang
Hongtu Zhou
Xiao Zhang
Chen Ye
221
12
0
22 Sep 2023
Introspective Deep Metric Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Cheng-Hao Wang
Wenzhao Zheng
Zheng Hua Zhu
Jie Zhou
Jiwen Lu
UQCV
201
18
0
11 Sep 2023
Mind the Uncertainty: Risk-Aware and Actively Exploring Model-Based Reinforcement Learning
Marin Vlastelica
Sebastian Blaes
Cristina Pinneri
Georg Martius
110
2
0
11 Sep 2023
The Power of MEME: Adversarial Malware Creation with Model-Based Reinforcement Learning
European Symposium on Research in Computer Security (ESORICS), 2023
M. Rigaki
Sebastian Garcia
AAML
113
6
0
31 Aug 2023
Efficient Epistemic Uncertainty Estimation in Regression Ensemble Models Using Pairwise-Distance Estimators
Lucas Berry
David Meger
UD
323
3
0
25 Aug 2023
Censored Sampling of Diffusion Models Using 3 Minutes of Human Feedback
Neural Information Processing Systems (NeurIPS), 2023
Taeho Yoon
Kibeom Myoung
Keon Lee
Jaewoong Cho
Albert No
Ernest K. Ryu
221
11
0
06 Jul 2023
Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Neural Information Processing Systems (NeurIPS), 2023
Alexander Meulemans
Simon Schug
Seijin Kobayashi
Nathaniel D. Daw
Gregory Wayne
300
6
0
29 Jun 2023
Learning non-Markovian Decision-Making from State-only Sequences
Neural Information Processing Systems (NeurIPS), 2023
Aoyang Qin
Feng Gao
Qing Li
Song-Chun Zhu
Sirui Xie
225
11
0
27 Jun 2023
Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching
Conference on Robot Learning (CoRL), 2023
H.J. Terry Suh
Glen Chou
Hongkai Dai
Lujie Yang
Abhishek Gupta
Russ Tedrake
DiffM
OffRL
212
14
0
24 Jun 2023
1
2
3
4
5
6
7
Next