ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.10592
  4. Cited By
Model-Ensemble Trust-Region Policy Optimization
v1v2 (latest)

Model-Ensemble Trust-Region Policy Optimization

International Conference on Learning Representations (ICLR), 2018
28 February 2018
Thanard Kurutach
I. Clavera
Yan Duan
Aviv Tamar
Pieter Abbeel
ArXiv (abs)PDFHTML

Papers citing "Model-Ensemble Trust-Region Policy Optimization"

50 / 305 papers shown
Fighting Uncertainty with Gradients: Offline Reinforcement Learning via
  Diffusion Score Matching
Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score MatchingConference on Robot Learning (CoRL), 2023
H.J. Terry Suh
Glen Chou
Hongkai Dai
Lujie Yang
Abhishek Gupta
Russ Tedrake
DiffMOffRL
268
15
0
24 Jun 2023
Deep Generative Models for Decision-Making and Control
Deep Generative Models for Decision-Making and Control
Michael Janner
291
3
0
15 Jun 2023
How to Learn and Generalize From Three Minutes of Data:
  Physics-Constrained and Uncertainty-Aware Neural Stochastic Differential
  Equations
How to Learn and Generalize From Three Minutes of Data: Physics-Constrained and Uncertainty-Aware Neural Stochastic Differential EquationsConference on Robot Learning (CoRL), 2023
Franck Djeumou
Cyrus Neary
Ufuk Topcu
DiffM
275
14
0
10 Jun 2023
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy
  Actor-Critic
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-CriticInternational Conference on Machine Learning (ICML), 2023
Tianying Ji
Yuping Luo
Gang Hua
Xianyuan Zhan
Jianwei Zhang
Huazhe Xu
OffRLOnRL
406
21
0
05 Jun 2023
Maximize to Explore: One Objective Function Fusing Estimation, Planning,
  and Exploration
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and ExplorationNeural Information Processing Systems (NeurIPS), 2023
Zhihan Liu
Miao Lu
Wei Xiong
Han Zhong
Haotian Hu
Shenao Zhang
Sirui Zheng
Zhuoran Yang
Zhaoran Wang
OffRL
352
24
0
29 May 2023
Cross-Domain Policy Adaptation via Value-Guided Data Filtering
Cross-Domain Policy Adaptation via Value-Guided Data FilteringNeural Information Processing Systems (NeurIPS), 2023
Kang Xu
Chenjia Bai
Xiaoteng Ma
Dong Wang
Bingyan Zhao
Zhen Wang
Xuelong Li
Wei Li
308
26
0
28 May 2023
Sample Efficient Reinforcement Learning in Mixed Systems through
  Augmented Samples and Its Applications to Queueing Networks
Sample Efficient Reinforcement Learning in Mixed Systems through Augmented Samples and Its Applications to Queueing NetworksNeural Information Processing Systems (NeurIPS), 2023
Honghao Wei
Xin Liu
Weina Wang
Lei Ying
214
10
0
25 May 2023
Co-Learning Empirical Games and World Models
Co-Learning Empirical Games and World Models
Max O. Smith
Michael P. Wellman
253
2
0
23 May 2023
ChemGymRL: An Interactive Framework for Reinforcement Learning for
  Digital Chemistry
ChemGymRL: An Interactive Framework for Reinforcement Learning for Digital Chemistry
Chris Beeler
Sriram Ganapathi Subramanian
Kyle Sprague
Nouha Chatti
C. Bellinger
...
Amanuel Dawit
Zihan Yang
Xinkai Li
Mark Crowley
Isaac Tamblyn
OffRL
202
7
0
23 May 2023
Robust nonlinear set-point control with reinforcement learning
Robust nonlinear set-point control with reinforcement learningAmerican Control Conference (ACC), 2023
Ruoqing Zhang
Per Mattsson
T. Wigren
OOD
126
2
0
20 Apr 2023
Models as Agents: Optimizing Multi-Step Predictions of Interactive Local
  Models in Model-Based Multi-Agent Reinforcement Learning
Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2023
Zifan Wu
Chao Yu
Chong Chen
Jianye Hao
H. Zhuo
149
15
0
31 Mar 2023
Boosting Reinforcement Learning and Planning with Demonstrations: A
  Survey
Boosting Reinforcement Learning and Planning with Demonstrations: A Survey
Tongzhou Mu
H. Su
OffRL
391
1
0
23 Mar 2023
Delay-SDE-net: A deep learning approach for time series modelling with
  memory and uncertainty estimates
Delay-SDE-net: A deep learning approach for time series modelling with memory and uncertainty estimates
M. Eggen
A. Midtfjord
160
3
0
14 Mar 2023
Learning a model is paramount for sample efficiency in reinforcement
  learning control of PDEs
Learning a model is paramount for sample efficiency in reinforcement learning control of PDEs
Stefan Werner
Sebastian Peitz
282
10
0
14 Feb 2023
A Survey on Causal Reinforcement Learning
A Survey on Causal Reinforcement LearningIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Yan Zeng
Ruichu Cai
Gang Hua
Libo Huang
Zijian Li
CML
430
52
0
10 Feb 2023
Predictable MDP Abstraction for Unsupervised Model-Based RL
Predictable MDP Abstraction for Unsupervised Model-Based RLInternational Conference on Machine Learning (ICML), 2023
Seohong Park
Sergey Levine
214
10
0
08 Feb 2023
Multipath agents for modular multitask ML systems
Multipath agents for modular multitask ML systems
Andrea Gesmundo
242
1
0
06 Feb 2023
Normalizing Flow Ensembles for Rich Aleatoric and Epistemic Uncertainty
  Modeling
Normalizing Flow Ensembles for Rich Aleatoric and Epistemic Uncertainty ModelingAAAI Conference on Artificial Intelligence (AAAI), 2023
Lucas Berry
David Meger
288
13
0
02 Feb 2023
Is Model Ensemble Necessary? Model-based RL via a Single Model with
  Lipschitz Regularized Value Function
Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value FunctionInternational Conference on Learning Representations (ICLR), 2023
Ruijie Zheng
Xiyao Wang
Huazhe Xu
Furong Huang
248
17
0
02 Feb 2023
Learning Control from Raw Position Measurements
Learning Control from Raw Position MeasurementsAmerican Control Conference (ACC), 2023
Fabio Amadio
Alberto Dalla Libera
D. Nikovski
R. Carli
Diego Romeres
160
10
0
30 Jan 2023
Risk-Averse Model Uncertainty for Distributionally Robust Safe
  Reinforcement Learning
Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
James Queeney
M. Benosman
OODOffRL
284
13
0
30 Jan 2023
Plan To Predict: Learning an Uncertainty-Foreseeing Model for
  Model-Based Reinforcement Learning
Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Zifan Wu
Chao Yu
Chong Chen
Jianye Hao
H. Zhuo
146
21
0
20 Jan 2023
On Multi-Agent Deep Deterministic Policy Gradients and their
  Explainability for SMARTS Environment
On Multi-Agent Deep Deterministic Policy Gradients and their Explainability for SMARTS Environment
Ansh Mittal
Aditya Malte
214
1
0
20 Jan 2023
Latent Variable Representation for Reinforcement Learning
Latent Variable Representation for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022
Zhaolin Ren
Chenjun Xiao
Tianjun Zhang
Na Li
Zhaoran Wang
Sujay Sanghavi
Dale Schuurmans
Bo Dai
OffRL
227
12
0
17 Dec 2022
One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based
  Offline Reinforcement Learning
One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Marc Rigter
Bruno Lacerda
Nick Hawes
OffRL
416
10
0
30 Nov 2022
Domain Generalization for Robust Model-Based Offline Reinforcement
  Learning
Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Alan Clark
Shoaib Ahmed Siddiqui
Robert Kirk
Usman Anwar
Stephen Chung
David M. Krueger
OODOffRL
185
0
0
27 Nov 2022
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch
  Size
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Dmitry Akimov
Sergey Kolesnikov
OffRL
263
19
0
20 Nov 2022
On Many-Actions Policy Gradient
On Many-Actions Policy GradientInternational Conference on Machine Learning (ICML), 2022
Michal Nauman
Marek Cygan
321
0
0
24 Oct 2022
Learning General World Models in a Handful of Reward-Free Deployments
Learning General World Models in a Handful of Reward-Free DeploymentsNeural Information Processing Systems (NeurIPS), 2022
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
258
11
0
23 Oct 2022
Deep Reinforcement Learning for Stabilization of Large-scale
  Probabilistic Boolean Networks
Deep Reinforcement Learning for Stabilization of Large-scale Probabilistic Boolean NetworksbioRxiv (bioRxiv), 2022
S. Moschoyiannis
Evangelos Chatzaroulas
Vytenis Sliogeris
Yuhu Wu
BDLOffRLAI4CE
90
16
0
21 Oct 2022
When to Update Your Model: Constrained Model-based Reinforcement
  Learning
When to Update Your Model: Constrained Model-based Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Tianying Ji
Yu-Juan Luo
Gang Hua
Mingxuan Jing
Fengxiang He
Wen-bing Huang
358
22
0
15 Oct 2022
Model-based Safe Deep Reinforcement Learning via a Constrained Proximal
  Policy Optimization Algorithm
Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization AlgorithmNeural Information Processing Systems (NeurIPS), 2022
Ashish Kumar Jayant
S. Bhatnagar
OffRL
159
61
0
14 Oct 2022
A Unified Framework for Alternating Offline Model Training and Policy
  Learning
A Unified Framework for Alternating Offline Model Training and Policy LearningNeural Information Processing Systems (NeurIPS), 2022
Shentao Yang
Shujian Zhang
Yihao Feng
Mi Zhou
OffRL
232
17
0
12 Oct 2022
CostNet: An End-to-End Framework for Goal-Directed Reinforcement
  Learning
CostNet: An End-to-End Framework for Goal-Directed Reinforcement LearningSGAI Conferences (SGAI), 2022
Per-Arne Andersen
M. G. Olsen
Ole-Christoffer Granmo
3DVOffRL
89
0
0
03 Oct 2022
S2P: State-conditioned Image Synthesis for Data Augmentation in Offline
  Reinforcement Learning
S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Daesol Cho
D. Shim
H. J. Kim
OffRL
186
11
0
30 Sep 2022
Ensemble Reinforcement Learning in Continuous Spaces -- A Hierarchical
  Multi-Step Approach for Policy Training
Ensemble Reinforcement Learning in Continuous Spaces -- A Hierarchical Multi-Step Approach for Policy TrainingInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Gang Chen
Victoria Huang
OffRL
291
1
0
29 Sep 2022
Training neural network ensembles via trajectory sampling
Training neural network ensembles via trajectory sampling
Jamie F. Mair
Dominic C. Rose
J. P. Garrahan
284
2
0
22 Sep 2022
Simplifying Model-based RL: Learning Representations, Latent-space
  Models, and Policies with One Objective
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One ObjectiveInternational Conference on Learning Representations (ICLR), 2022
Raj Ghugare
Homanga Bharadhwaj
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
334
28
0
18 Sep 2022
Conservative Dual Policy Optimization for Efficient Model-Based
  Reinforcement Learning
Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Shen Zhang
161
6
0
16 Sep 2022
Variational Inference for Model-Free and Model-Based Reinforcement
  Learning
Variational Inference for Model-Free and Model-Based Reinforcement Learning
Felix Leibfried
OffRL
218
1
0
04 Sep 2022
Distributed Ensembles of Reinforcement Learning Agents for Electricity
  Control
Distributed Ensembles of Reinforcement Learning Agents for Electricity Control
Pierrick Pochelu
S. Petiton
B. Conche
AI4CE
135
2
0
30 Aug 2022
Spectral Decomposition Representation for Reinforcement Learning
Spectral Decomposition Representation for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022
Zhaolin Ren
Tianjun Zhang
Lisa Lee
Joseph E. Gonzalez
Dale Schuurmans
Bo Dai
OffRL
218
35
0
19 Aug 2022
Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy
Live in the Moment: Learning Dynamics Model Adapted to Evolving PolicyInternational Conference on Machine Learning (ICML), 2022
Xiyao Wang
Wichayaporn Wongkamjan
Furong Huang
358
21
0
25 Jul 2022
Making Linear MDPs Practical via Contrastive Representation Learning
Making Linear MDPs Practical via Contrastive Representation LearningInternational Conference on Machine Learning (ICML), 2022
Tianjun Zhang
Zhaolin Ren
Mengjiao Yang
Joseph E. Gonzalez
Dale Schuurmans
Bo Dai
224
53
0
14 Jul 2022
Masked World Models for Visual Control
Masked World Models for Visual ControlConference on Robot Learning (CoRL), 2022
Younggyo Seo
Danijar Hafner
Hao Liu
Fangchen Liu
Stephen James
Kimin Lee
Pieter Abbeel
OffRL
403
183
0
28 Jun 2022
Generalized Policy Improvement Algorithms with Theoretically Supported
  Sample Reuse
Generalized Policy Improvement Algorithms with Theoretically Supported Sample ReuseIEEE Transactions on Automatic Control (TAC), 2022
James Queeney
I. Paschalidis
Christos G. Cassandras
OffRL
297
3
0
28 Jun 2022
Causal Dynamics Learning for Task-Independent State Abstraction
Causal Dynamics Learning for Task-Independent State AbstractionInternational Conference on Machine Learning (ICML), 2022
Zizhao Wang
Xuesu Xiao
Zifan Xu
Yuke Zhu
Peter Stone
CML
202
70
0
27 Jun 2022
Certifiably Robust Policy Learning against Adversarial Communication in
  Multi-agent Systems
Certifiably Robust Policy Learning against Adversarial Communication in Multi-agent Systems
Yanchao Sun
Ruijie Zheng
Parisa Hassanzadeh
Yongyuan Liang
Soheil Feizi
Sumitra Ganesh
Furong Huang
AAML
221
12
0
21 Jun 2022
A Survey on Model-based Reinforcement Learning
A Survey on Model-based Reinforcement LearningScience China Information Sciences (Sci. China Inf. Sci.), 2022
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRLLRM
343
152
0
19 Jun 2022
Regularizing a Model-based Policy Stationary Distribution to Stabilize
  Offline Reinforcement Learning
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Shentao Yang
Yihao Feng
Shujian Zhang
Mi Zhou
OffRL
212
14
0
14 Jun 2022
Previous
1234567
Next