v1v2 (latest)

Model-Ensemble Trust-Region Policy Optimization

International Conference on Learning Representations (ICLR), 2018

28 February 2018

Pieter Abbeel

Papers citing "Model-Ensemble Trust-Region Policy Optimization"

50 / 305 papers shown

Title
Controllable Flow Matching for Online Reinforcement Learning Bin Wang Boxiang Tao Haifeng Jing Hongbo Dou Zijian Wang 82 0 0 10 Nov 2025
Cavity Duplexer Tuning with 1d Resnet-like Neural Networks Anton Raskovalov 40 0 0 17 Oct 2025
First Order Model-Based RL through Decoupled Backpropagation Joseph Amigo Rooholla Khorrambakht Elliot Chane-Sane Nicolas Mansard Ludovic Righetti 114 0 0 29 Aug 2025
Meta-reinforcement learning with minimum attention Pilhwa Lee Shashank Gupta OffRL 184 0 0 22 May 2025
Sample-Efficient Reinforcement Learning of Koopman eNMPCComputers and Chemical Engineering (CCE), 2025 Daniel Mayfrank M. Velioglu Alexander Mitsos Manuel Dahmen OffRL 291 0 0 24 Mar 2025
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications Siyuan Mu Sen Lin MoE 969 35 0 10 Mar 2025
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic Stefano Viel Luca Viano Volkan Cevher 570 1 0 27 Feb 2025
Scalable Model Merging with Progressive Layer-wise Distillation Jing Xu Jiazheng Li J.N. Zhang MoMe FedML 561 6 0 18 Feb 2025
Digital Twin Calibration with Model-Based Reinforcement Learning Hua Zheng Wei Xie I. Ryzhov Keilung Choy 338 0 0 04 Jan 2025
On Reward Transferability in Adversarial Inverse Reinforcement Learning: Insights from Random Matrix Theory Yangchun Zhang Wang Zhou Yirui Zhou 205 0 0 31 Dec 2024
SimuDICE: Offline Policy Optimization Through World Model Updates and DICE Estimation Catalin E. Brita Stephan Bongers F. Oliehoek OffRL 217 0 0 09 Dec 2024
Understanding World or Predicting Future? A Comprehensive Survey of World ModelsACM Computing Surveys (ACM CSUR), 2024 Jingtao Ding Yunke Zhang Yu Shang Yuheng Zhang Zefang Zong ... Fengli Xu Yong Li Chen Gao Fengli Xu Yong Li VGen SyDa 405 70 0 21 Nov 2024
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A SurveyACM Computing Surveys (ACM CSUR), 2024 Zhihong Liu Xin Xu Peng Qiao Dongsheng Li OffRL 258 13 0 08 Nov 2024
Learning World Models for Unconstrained Goal NavigationNeural Information Processing Systems (NeurIPS), 2024 Yuanlin Duan Wensen Mao He Zhu 215 5 0 03 Nov 2024
Guiding Reinforcement Learning with Incomplete System DynamicsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024 Shuyuan Wang Jingliang Duan Nathan P. Lawrence Philip D. Loewen M. Forbes R. Bhushan Gopaluni Lixian Zhang 206 3 0 22 Oct 2024
Dual Action Policy for Robust Sim-to-Real Reinforcement LearningInternational Conference on Artificial Neural Networks (ICANN), 2024 Ng Wen Zheng Terence Chen Jianda 145 0 0 16 Oct 2024
When to Trust Your Data: Enhancing Dyna-Style Model-Based Reinforcement Learning With Data Filter Yansong Li Zeyu Dong Ertai Luo Yu Wu Shuo Wu Shuo Han 121 3 0 16 Oct 2024
COSBO: Conservative Offline Simulation-Based Policy Optimization E. Kargar Ville Kyrki OffRL 128 0 0 22 Sep 2024
Offline Model-Based Reinforcement Learning with Anti-ExplorationEuropean Conference on Artificial Intelligence (ECAI), 2024 Padmanaba Srinivasan William J. Knottenbelt OffRL 207 0 0 20 Aug 2024
Mixture of Experts in a Mixture of RL settings Timon Willi J. Obando-Ceron Jakob Foerster Karolina Dziugaite Pablo Samuel Castro MoE 319 15 0 26 Jun 2024
Diffusion Spectral Representation for Reinforcement Learning Dmitry Shribak Chen-Xiao Gao Yitong Li Chenjun Xiao Bo Dai DiffM 288 8 0 23 Jun 2024
Learning to Play Atari in a World of Tokens Pranav Agarwal Sheldon Andrews Samira Ebrahimi Kahou OffRL 217 5 0 03 Jun 2024
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption Bernd Frauenknecht Artur Eisele Devdutt Subhasish Friedrich Solowjow Sebastian Trimpe 315 5 0 29 May 2024
Efficient Multi-agent Reinforcement Learning by Planning Qihan Liu Jianing Ye Xiaoteng Ma Jun Yang Bin Liang Chongjie Zhang 175 14 0 20 May 2024
The Curse of Diversity in Ensemble-Based Exploration Zhixuan Lin P. DÓro Evgenii Nikishin Rameswar Panda 220 6 0 07 May 2024
Learning Control Barrier Functions and their application in Reinforcement Learning: A Survey Maeva Guerrier Hassan Fouad Giovanni Beltrame OffRL 201 6 0 22 Apr 2024
Grid-Mapping Pseudo-Count Constraint for Offline Reinforcement Learning Yi Shen Hanyan Huang Shan Xie 186 0 0 03 Apr 2024
$Robust Model Based Reinforcement Learning Using $\mathcal{L}_1$ Adaptive Control$ Robust Model Based Reinforcement Learning Using $\mathcal{L}_1$ Adaptive Control Minjun Sung Sambhu H. Karumanchi Aditya Gahlawat N. Hovakimyan 191 1 0 21 Mar 2024
An Efficient Model-Based Approach on Learning Agile Motor Skills without Reinforcement Hao-bin Shi Tingguang Li Qing Zhu Jiapeng Sheng Lei Han Max Q.-H. Meng 182 3 0 04 Mar 2024
Model-based deep reinforcement learning for accelerated learning from flow simulations Andre Weiner Janis Geise AI4CE 225 6 0 26 Feb 2024
Leveraging Digital Cousins for Ensemble Q-Learning in Large-Scale Wireless Networks Talha Bozkus Urbashi Mitra 164 6 0 12 Feb 2024
Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization Talha Bozkus Urbashi Mitra OffRL 236 8 0 08 Feb 2024
Building Minimal and Reusable Causal State Abstractions for Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2024 Zizhao Wang Caroline Wang Xuesu Xiao Yuke Zhu Peter Stone OffRL 105 8 0 23 Jan 2024
Episodic Reinforcement Learning with Expanded State-reward Space Dayang Liang Yaru Zhang Yunlong Liu OffRL 130 4 0 19 Jan 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning Rafael Rafailov Kyle Hatch Victor Kolev John D. Martin Mariano Phielipp Chelsea Finn OffRL OnRL 307 17 0 06 Jan 2024
Efficient Reinforcement Learning via Decoupling Exploration and Utilization Jingpu Yang Helin Wang Qirui Zhao Zhecheng Shi Zirui Song Miao Fang 329 3 0 26 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey Dom Huh Prasant Mohapatra AI4CE 286 33 0 15 Dec 2023
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control Bernd Frauenknecht Tobias Ehlgen Sebastian Trimpe 201 4 0 30 Nov 2023
Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic Detection of Infeasible PlansNeural Information Processing Systems (NeurIPS), 2023 Kyowoon Lee Seongun Kim Jaesik Choi DiffM 239 19 0 30 Oct 2023
One is More: Diverse Perspectives within a Single Network for Efficient DRL Yiqin Tan Ling Pan Longbo Huang OffRL 257 0 0 21 Oct 2023
COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RLInternational Conference on Learning Representations (ICLR), 2023 Xiyao Wang Ruijie Zheng Yanchao Sun Ruonan Jia Wichayaporn Wongkamjan Huazhe Xu Furong Huang OffRL 259 16 0 11 Oct 2023
Guided Cooperation in Hierarchical Reinforcement Learning via Model-based RolloutIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023 Haoran Wang Zeshen Tang Leya Yang Yaoru Sun Fang Wang Siyu Zhang Ye-Ting Chen 261 2 0 24 Sep 2023
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy OptimizationNeural Information Processing Systems (NeurIPS), 2023 Hai Zhang Hang Yu Siyue Tao Di Zhang Chang Huang Hongtu Zhou Xiao Zhang Chen Ye 245 12 0 22 Sep 2023
Introspective Deep Metric LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023 Cheng-Hao Wang Wenzhao Zheng Zheng Hua Zhu Jie Zhou Jiwen Lu UQCV 225 18 0 11 Sep 2023
Mind the Uncertainty: Risk-Aware and Actively Exploring Model-Based Reinforcement Learning Marin Vlastelica Sebastian Blaes Cristina Pinneri Georg Martius 118 2 0 11 Sep 2023
The Power of MEME: Adversarial Malware Creation with Model-Based Reinforcement LearningEuropean Symposium on Research in Computer Security (ESORICS), 2023 M. Rigaki Sebastian Garcia AAML 117 6 0 31 Aug 2023
Efficient Epistemic Uncertainty Estimation in Regression Ensemble Models Using Pairwise-Distance Estimators Lucas Berry David Meger UD 378 3 0 25 Aug 2023
Censored Sampling of Diffusion Models Using 3 Minutes of Human FeedbackNeural Information Processing Systems (NeurIPS), 2023 Taeho Yoon Kibeom Myoung Keon Lee Jaewoong Cho Albert No Ernest K. Ryu 229 11 0 06 Jul 2023
Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysisNeural Information Processing Systems (NeurIPS), 2023 Alexander Meulemans Simon Schug Seijin Kobayashi Nathaniel D. Daw Gregory Wayne 304 6 0 29 Jun 2023
Learning non-Markovian Decision-Making from State-only SequencesNeural Information Processing Systems (NeurIPS), 2023 Aoyang Qin Feng Gao Qing Li Song-Chun Zhu Sirui Xie 237 11 0 27 Jun 2023