v1v2 (latest)

Model-Ensemble Trust-Region Policy Optimization

International Conference on Learning Representations (ICLR), 2018

28 February 2018

Pieter Abbeel

Papers citing "Model-Ensemble Trust-Region Policy Optimization"

50 / 305 papers shown

Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score MatchingConference on Robot Learning (CoRL), 2023

Hongkai Dai

Abhishek Gupta

268

24 Jun 2023

Deep Generative Models for Decision-Making and Control

Michael Janner

291

15 Jun 2023

How to Learn and Generalize From Three Minutes of Data: Physics-Constrained and Uncertainty-Aware Neural Stochastic Differential EquationsConference on Robot Learning (CoRL), 2023

Franck Djeumou

Cyrus Neary

Ufuk Topcu

DiffM

275

10 Jun 2023

Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-CriticInternational Conference on Machine Learning (ICML), 2023

406

05 Jun 2023

Maximize to Explore: One Objective Function Fusing Estimation, Planning, and ExplorationNeural Information Processing Systems (NeurIPS), 2023

Wei Xiong

352

29 May 2023

Cross-Domain Policy Adaptation via Value-Guided Data FilteringNeural Information Processing Systems (NeurIPS), 2023

Zhen Wang

Xuelong Li

Wei Li

308

28 May 2023

Sample Efficient Reinforcement Learning in Mixed Systems through Augmented Samples and Its Applications to Queueing NetworksNeural Information Processing Systems (NeurIPS), 2023

214

25 May 2023

Co-Learning Empirical Games and World Models

Max O. Smith

Michael P. Wellman

253

23 May 2023

ChemGymRL: An Interactive Framework for Reinforcement Learning for Digital Chemistry

Chris Beeler

Sriram Ganapathi Subramanian

...

202

23 May 2023

Robust nonlinear set-point control with reinforcement learningAmerican Control Conference (ACC), 2023

126

20 Apr 2023

Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2023

Zifan Wu

Chao Yu

Chong Chen

Jianye Hao

H. Zhuo

149

31 Mar 2023

Boosting Reinforcement Learning and Planning with Demonstrations: A Survey

Tongzhou Mu

H. Su

OffRL

391

23 Mar 2023

Delay-SDE-net: A deep learning approach for time series modelling with memory and uncertainty estimates

M. Eggen

A. Midtfjord

160

14 Mar 2023

Learning a model is paramount for sample efficiency in reinforcement learning control of PDEs

Stefan Werner

Sebastian Peitz

282

14 Feb 2023

A Survey on Causal Reinforcement LearningIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023

430

10 Feb 2023

Predictable MDP Abstraction for Unsupervised Model-Based RLInternational Conference on Machine Learning (ICML), 2023

Seohong Park

Sergey Levine

214

08 Feb 2023

Multipath agents for modular multitask ML systems

Andrea Gesmundo

242

06 Feb 2023

Normalizing Flow Ensembles for Rich Aleatoric and Epistemic Uncertainty ModelingAAAI Conference on Artificial Intelligence (AAAI), 2023

Lucas Berry

David Meger

288

02 Feb 2023

Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value FunctionInternational Conference on Learning Representations (ICLR), 2023

Ruijie Zheng

Xiyao Wang

Huazhe Xu

Furong Huang

248

02 Feb 2023

Learning Control from Raw Position MeasurementsAmerican Control Conference (ACC), 2023

160

30 Jan 2023

Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023

James Queeney

M. Benosman

OOD OffRL

284

30 Jan 2023

Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023

Zifan Wu

Chao Yu

Chong Chen

Jianye Hao

H. Zhuo

146

20 Jan 2023

On Multi-Agent Deep Deterministic Policy Gradients and their Explainability for SMARTS Environment

Ansh Mittal

Aditya Malte

214

20 Jan 2023

Latent Variable Representation for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022

Zhaolin Ren

Chenjun Xiao

Tianjun Zhang

Na Li

Sujay Sanghavi

227

17 Dec 2022

One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

416

30 Nov 2022

Domain Generalization for Robust Model-Based Offline Reinforcement Learning

Alan Clark

Shoaib Ahmed Siddiqui

185

27 Nov 2022

Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size

263

20 Nov 2022

On Many-Actions Policy GradientInternational Conference on Machine Learning (ICML), 2022

Michal Nauman

Marek Cygan

321

24 Oct 2022

Learning General World Models in a Handful of Reward-Free DeploymentsNeural Information Processing Systems (NeurIPS), 2022

258

23 Oct 2022

Deep Reinforcement Learning for Stabilization of Large-scale Probabilistic Boolean NetworksbioRxiv (bioRxiv), 2022

S. Moschoyiannis

Evangelos Chatzaroulas

Vytenis Sliogeris

Yuhu Wu

BDL OffRL AI4CE

21 Oct 2022

When to Update Your Model: Constrained Model-based Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

358

15 Oct 2022

Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization AlgorithmNeural Information Processing Systems (NeurIPS), 2022

Ashish Kumar Jayant

S. Bhatnagar

OffRL

159

14 Oct 2022

A Unified Framework for Alternating Offline Model Training and Policy LearningNeural Information Processing Systems (NeurIPS), 2022

232

12 Oct 2022

CostNet: An End-to-End Framework for Goal-Directed Reinforcement LearningSGAI Conferences (SGAI), 2022

Per-Arne Andersen

M. G. Olsen

Ole-Christoffer Granmo

3DV OffRL

03 Oct 2022

S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

186

30 Sep 2022

Ensemble Reinforcement Learning in Continuous Spaces -- A Hierarchical Multi-Step Approach for Policy TrainingInternational Joint Conference on Artificial Intelligence (IJCAI), 2022

Gang Chen

Victoria Huang

OffRL

291

29 Sep 2022

Training neural network ensembles via trajectory sampling

Jamie F. Mair

Dominic C. Rose

J. P. Garrahan

284

22 Sep 2022

Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One ObjectiveInternational Conference on Learning Representations (ICLR), 2022

Homanga Bharadhwaj

334

18 Sep 2022

Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

Shen Zhang

161

16 Sep 2022

Variational Inference for Model-Free and Model-Based Reinforcement Learning

Felix Leibfried

OffRL

218

04 Sep 2022

Distributed Ensembles of Reinforcement Learning Agents for Electricity Control

135

30 Aug 2022

Spectral Decomposition Representation for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022

Tianjun Zhang

218

19 Aug 2022

Live in the Moment: Learning Dynamics Model Adapted to Evolving PolicyInternational Conference on Machine Learning (ICML), 2022

Xiyao Wang

Wichayaporn Wongkamjan

Furong Huang

358

25 Jul 2022

Making Linear MDPs Practical via Contrastive Representation LearningInternational Conference on Machine Learning (ICML), 2022

Tianjun Zhang

224

14 Jul 2022

Masked World Models for Visual ControlConference on Robot Learning (CoRL), 2022

Pieter Abbeel

403

183

28 Jun 2022

Generalized Policy Improvement Algorithms with Theoretically Supported Sample ReuseIEEE Transactions on Automatic Control (TAC), 2022

James Queeney

I. Paschalidis

Christos G. Cassandras

OffRL

297

28 Jun 2022

Causal Dynamics Learning for Task-Independent State AbstractionInternational Conference on Machine Learning (ICML), 2022

Xuesu Xiao

202

27 Jun 2022

Certifiably Robust Policy Learning against Adversarial Communication in Multi-agent Systems

Furong Huang

221

21 Jun 2022

A Survey on Model-based Reinforcement LearningScience China Information Sciences (Sci. China Inf. Sci.), 2022

343

152

19 Jun 2022

Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022

212

14 Jun 2022