All Papers

0 / 0 papers shown

Title

v1v2 (latest)

Model-Ensemble Trust-Region Policy Optimization

International Conference on Learning Representations (ICLR), 2018

28 February 2018

Pieter Abbeel

Papers citing "Model-Ensemble Trust-Region Policy Optimization"

50 / 304 papers shown

Title
Controllable Flow Matching for Online Reinforcement Learning Bin Wang Boxiang Tao Haifeng Jing Hongbo Dou Zijian Wang 52 0 0 10 Nov 2025
Cavity Duplexer Tuning with 1d Resnet-like Neural Networks Anton Raskovalov 36 0 0 17 Oct 2025
First Order Model-Based RL through Decoupled Backpropagation Joseph Amigo Rooholla Khorrambakht Elliot Chane-Sane Nicolas Mansard Ludovic Righetti 102 0 0 29 Aug 2025
Meta-reinforcement learning with minimum attention Pilhwa Lee Shashank Gupta OffRL 160 0 0 22 May 2025
Sample-Efficient Reinforcement Learning of Koopman eNMPCComputers and Chemical Engineering (CCE), 2025 Daniel Mayfrank M. Velioglu Alexander Mitsos Manuel Dahmen OffRL 259 0 0 24 Mar 2025
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications Siyuan Mu Sen Lin MoE 909 31 0 10 Mar 2025
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic Stefano Viel Luca Viano Volkan Cevher 538 1 0 27 Feb 2025
Scalable Model Merging with Progressive Layer-wise Distillation Jing Xu Jiazheng Li J.N. Zhang MoMe FedML 557 6 0 18 Feb 2025
Digital Twin Calibration with Model-Based Reinforcement Learning Hua Zheng Wei Xie I. Ryzhov Keilung Choy 290 0 0 04 Jan 2025
On Reward Transferability in Adversarial Inverse Reinforcement Learning: Insights from Random Matrix Theory Yangchun Zhang Wang Zhou Yirui Zhou 181 0 0 31 Dec 2024
SimuDICE: Offline Policy Optimization Through World Model Updates and DICE Estimation Catalin E. Brita Stephan Bongers F. Oliehoek OffRL 201 0 0 09 Dec 2024
Understanding World or Predicting Future? A Comprehensive Survey of World ModelsACM Computing Surveys (ACM CSUR), 2024 Jingtao Ding Yunke Zhang Yu Shang Yuheng Zhang Zefang Zong ... Fengli Xu Yong Li Chen Gao Fengli Xu Yong Li VGen SyDa 397 17 0 21 Nov 2024
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A SurveyACM Computing Surveys (ACM CSUR), 2024 Zhihong Liu Xin Xu Peng Qiao Dongsheng Li OffRL 233 13 0 08 Nov 2024
Learning World Models for Unconstrained Goal NavigationNeural Information Processing Systems (NeurIPS), 2024 Yuanlin Duan Wensen Mao He Zhu 211 5 0 03 Nov 2024
Guiding Reinforcement Learning with Incomplete System DynamicsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024 Shuyuan Wang Jingliang Duan Nathan P. Lawrence Philip D. Loewen M. Forbes R. Bhushan Gopaluni Lixian Zhang 198 3 0 22 Oct 2024
Dual Action Policy for Robust Sim-to-Real Reinforcement LearningInternational Conference on Artificial Neural Networks (ICANN), 2024 Ng Wen Zheng Terence Chen Jianda 129 0 0 16 Oct 2024
When to Trust Your Data: Enhancing Dyna-Style Model-Based Reinforcement Learning With Data Filter Yansong Li Zeyu Dong Ertai Luo Yu Wu Shuo Wu Shuo Han 109 3 0 16 Oct 2024
COSBO: Conservative Offline Simulation-Based Policy Optimization E. Kargar Ville Kyrki OffRL 112 0 0 22 Sep 2024
Offline Model-Based Reinforcement Learning with Anti-ExplorationEuropean Conference on Artificial Intelligence (ECAI), 2024 Padmanaba Srinivasan William J. Knottenbelt OffRL 203 0 0 20 Aug 2024
Mixture of Experts in a Mixture of RL settings Timon Willi J. Obando-Ceron Jakob Foerster Karolina Dziugaite Pablo Samuel Castro MoE 295 14 0 26 Jun 2024
Diffusion Spectral Representation for Reinforcement Learning Dmitry Shribak Chen-Xiao Gao Yitong Li Chenjun Xiao Bo Dai DiffM 280 7 0 23 Jun 2024
Learning to Play Atari in a World of Tokens Pranav Agarwal Sheldon Andrews Samira Ebrahimi Kahou OffRL 209 5 0 03 Jun 2024
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption Bernd Frauenknecht Artur Eisele Devdutt Subhasish Friedrich Solowjow Sebastian Trimpe 299 5 0 29 May 2024
Efficient Multi-agent Reinforcement Learning by Planning Qihan Liu Jianing Ye Xiaoteng Ma Jun Yang Bin Liang Chongjie Zhang 167 14 0 20 May 2024
The Curse of Diversity in Ensemble-Based Exploration Zhixuan Lin P. DÓro Evgenii Nikishin Rameswar Panda 216 6 0 07 May 2024
Learning Control Barrier Functions and their application in Reinforcement Learning: A Survey Maeva Guerrier Hassan Fouad Giovanni Beltrame OffRL 197 6 0 22 Apr 2024
$Robust Model Based Reinforcement Learning Using $\mathcal{L}_1$ Adaptive Control$ Robust Model Based Reinforcement Learning Using $\mathcal{L}_1$ Adaptive Control Minjun Sung Sambhu H. Karumanchi Aditya Gahlawat N. Hovakimyan 171 1 0 21 Mar 2024
An Efficient Model-Based Approach on Learning Agile Motor Skills without Reinforcement Hao-bin Shi Tingguang Li Qing Zhu Jiapeng Sheng Lei Han Max Q.-H. Meng 174 3 0 04 Mar 2024
Model-based deep reinforcement learning for accelerated learning from flow simulations Andre Weiner Janis Geise AI4CE 209 6 0 26 Feb 2024
Leveraging Digital Cousins for Ensemble Q-Learning in Large-Scale Wireless Networks Talha Bozkus Urbashi Mitra 164 6 0 12 Feb 2024
Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization Talha Bozkus Urbashi Mitra OffRL 224 8 0 08 Feb 2024
Building Minimal and Reusable Causal State Abstractions for Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2024 Zizhao Wang Caroline Wang Xuesu Xiao Yuke Zhu Peter Stone OffRL 105 8 0 23 Jan 2024
Episodic Reinforcement Learning with Expanded State-reward Space Dayang Liang Yaru Zhang Yunlong Liu OffRL 126 4 0 19 Jan 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning Rafael Rafailov Kyle Hatch Victor Kolev John D. Martin Mariano Phielipp Chelsea Finn OffRL OnRL 299 17 0 06 Jan 2024
Efficient Reinforcement Learning via Decoupling Exploration and Utilization Jingpu Yang Helin Wang Qirui Zhao Zhecheng Shi Zirui Song Miao Fang 281 3 0 26 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey Dom Huh Prasant Mohapatra AI4CE 278 30 0 15 Dec 2023
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control Bernd Frauenknecht Tobias Ehlgen Sebastian Trimpe 189 4 0 30 Nov 2023
Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic Detection of Infeasible PlansNeural Information Processing Systems (NeurIPS), 2023 Kyowoon Lee Seongun Kim Jaesik Choi DiffM 215 19 0 30 Oct 2023
One is More: Diverse Perspectives within a Single Network for Efficient DRL Yiqin Tan Ling Pan Longbo Huang OffRL 249 0 0 21 Oct 2023
COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RLInternational Conference on Learning Representations (ICLR), 2023 Xiyao Wang Ruijie Zheng Yanchao Sun Ruonan Jia Wichayaporn Wongkamjan Huazhe Xu Furong Huang OffRL 243 16 0 11 Oct 2023
Guided Cooperation in Hierarchical Reinforcement Learning via Model-based RolloutIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023 Haoran Wang Zeshen Tang Leya Yang Yaoru Sun Fang Wang Siyu Zhang Ye-Ting Chen 233 2 0 24 Sep 2023
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy OptimizationNeural Information Processing Systems (NeurIPS), 2023 Hai Zhang Hang Yu Siyue Tao Di Zhang Chang Huang Hongtu Zhou Xiao Zhang Chen Ye 221 12 0 22 Sep 2023
Introspective Deep Metric LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023 Cheng-Hao Wang Wenzhao Zheng Zheng Hua Zhu Jie Zhou Jiwen Lu UQCV 201 18 0 11 Sep 2023
Mind the Uncertainty: Risk-Aware and Actively Exploring Model-Based Reinforcement Learning Marin Vlastelica Sebastian Blaes Cristina Pinneri Georg Martius 110 2 0 11 Sep 2023
The Power of MEME: Adversarial Malware Creation with Model-Based Reinforcement LearningEuropean Symposium on Research in Computer Security (ESORICS), 2023 M. Rigaki Sebastian Garcia AAML 113 6 0 31 Aug 2023
Efficient Epistemic Uncertainty Estimation in Regression Ensemble Models Using Pairwise-Distance Estimators Lucas Berry David Meger UD 323 3 0 25 Aug 2023
Censored Sampling of Diffusion Models Using 3 Minutes of Human FeedbackNeural Information Processing Systems (NeurIPS), 2023 Taeho Yoon Kibeom Myoung Keon Lee Jaewoong Cho Albert No Ernest K. Ryu 221 11 0 06 Jul 2023
Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysisNeural Information Processing Systems (NeurIPS), 2023 Alexander Meulemans Simon Schug Seijin Kobayashi Nathaniel D. Daw Gregory Wayne 300 6 0 29 Jun 2023
Learning non-Markovian Decision-Making from State-only SequencesNeural Information Processing Systems (NeurIPS), 2023 Aoyang Qin Feng Gao Qing Li Song-Chun Zhu Sirui Xie 225 11 0 27 Jun 2023
Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score MatchingConference on Robot Learning (CoRL), 2023 H.J. Terry Suh Glen Chou Hongkai Dai Lujie Yang Abhishek Gupta Russ Tedrake DiffM OffRL 212 14 0 24 Jun 2023

v1v2 (latest)

Model-Ensemble Trust-Region Policy Optimization

International Conference on Learning Representations (ICLR), 2018

28 February 2018

Pieter Abbeel

Papers citing "Model-Ensemble Trust-Region Policy Optimization"

50 / 304 papers shown

Title
Controllable Flow Matching for Online Reinforcement Learning Bin Wang Boxiang Tao Haifeng Jing Hongbo Dou Zijian Wang 52 0 0 10 Nov 2025
Cavity Duplexer Tuning with 1d Resnet-like Neural Networks Anton Raskovalov 36 0 0 17 Oct 2025
First Order Model-Based RL through Decoupled Backpropagation Joseph Amigo Rooholla Khorrambakht Elliot Chane-Sane Nicolas Mansard Ludovic Righetti 102 0 0 29 Aug 2025
Meta-reinforcement learning with minimum attention Pilhwa Lee Shashank Gupta OffRL 160 0 0 22 May 2025
Sample-Efficient Reinforcement Learning of Koopman eNMPCComputers and Chemical Engineering (CCE), 2025 Daniel Mayfrank M. Velioglu Alexander Mitsos Manuel Dahmen OffRL 259 0 0 24 Mar 2025
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications Siyuan Mu Sen Lin MoE 909 31 0 10 Mar 2025
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic Stefano Viel Luca Viano Volkan Cevher 538 1 0 27 Feb 2025
Scalable Model Merging with Progressive Layer-wise Distillation Jing Xu Jiazheng Li J.N. Zhang MoMe FedML 557 6 0 18 Feb 2025
Digital Twin Calibration with Model-Based Reinforcement Learning Hua Zheng Wei Xie I. Ryzhov Keilung Choy 290 0 0 04 Jan 2025
On Reward Transferability in Adversarial Inverse Reinforcement Learning: Insights from Random Matrix Theory Yangchun Zhang Wang Zhou Yirui Zhou 181 0 0 31 Dec 2024
SimuDICE: Offline Policy Optimization Through World Model Updates and DICE Estimation Catalin E. Brita Stephan Bongers F. Oliehoek OffRL 201 0 0 09 Dec 2024
Understanding World or Predicting Future? A Comprehensive Survey of World ModelsACM Computing Surveys (ACM CSUR), 2024 Jingtao Ding Yunke Zhang Yu Shang Yuheng Zhang Zefang Zong ... Fengli Xu Yong Li Chen Gao Fengli Xu Yong Li VGen SyDa 397 17 0 21 Nov 2024
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A SurveyACM Computing Surveys (ACM CSUR), 2024 Zhihong Liu Xin Xu Peng Qiao Dongsheng Li OffRL 233 13 0 08 Nov 2024
Learning World Models for Unconstrained Goal NavigationNeural Information Processing Systems (NeurIPS), 2024 Yuanlin Duan Wensen Mao He Zhu 211 5 0 03 Nov 2024
Guiding Reinforcement Learning with Incomplete System DynamicsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024 Shuyuan Wang Jingliang Duan Nathan P. Lawrence Philip D. Loewen M. Forbes R. Bhushan Gopaluni Lixian Zhang 198 3 0 22 Oct 2024
Dual Action Policy for Robust Sim-to-Real Reinforcement LearningInternational Conference on Artificial Neural Networks (ICANN), 2024 Ng Wen Zheng Terence Chen Jianda 129 0 0 16 Oct 2024
When to Trust Your Data: Enhancing Dyna-Style Model-Based Reinforcement Learning With Data Filter Yansong Li Zeyu Dong Ertai Luo Yu Wu Shuo Wu Shuo Han 109 3 0 16 Oct 2024
COSBO: Conservative Offline Simulation-Based Policy Optimization E. Kargar Ville Kyrki OffRL 112 0 0 22 Sep 2024
Offline Model-Based Reinforcement Learning with Anti-ExplorationEuropean Conference on Artificial Intelligence (ECAI), 2024 Padmanaba Srinivasan William J. Knottenbelt OffRL 203 0 0 20 Aug 2024
Mixture of Experts in a Mixture of RL settings Timon Willi J. Obando-Ceron Jakob Foerster Karolina Dziugaite Pablo Samuel Castro MoE 295 14 0 26 Jun 2024
Diffusion Spectral Representation for Reinforcement Learning Dmitry Shribak Chen-Xiao Gao Yitong Li Chenjun Xiao Bo Dai DiffM 280 7 0 23 Jun 2024
Learning to Play Atari in a World of Tokens Pranav Agarwal Sheldon Andrews Samira Ebrahimi Kahou OffRL 209 5 0 03 Jun 2024
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption Bernd Frauenknecht Artur Eisele Devdutt Subhasish Friedrich Solowjow Sebastian Trimpe 299 5 0 29 May 2024
Efficient Multi-agent Reinforcement Learning by Planning Qihan Liu Jianing Ye Xiaoteng Ma Jun Yang Bin Liang Chongjie Zhang 167 14 0 20 May 2024
The Curse of Diversity in Ensemble-Based Exploration Zhixuan Lin P. DÓro Evgenii Nikishin Rameswar Panda 216 6 0 07 May 2024
Learning Control Barrier Functions and their application in Reinforcement Learning: A Survey Maeva Guerrier Hassan Fouad Giovanni Beltrame OffRL 197 6 0 22 Apr 2024
$Robust Model Based Reinforcement Learning Using $\mathcal{L}_1$ Adaptive Control$ Robust Model Based Reinforcement Learning Using $\mathcal{L}_1$ Adaptive Control Minjun Sung Sambhu H. Karumanchi Aditya Gahlawat N. Hovakimyan 171 1 0 21 Mar 2024
An Efficient Model-Based Approach on Learning Agile Motor Skills without Reinforcement Hao-bin Shi Tingguang Li Qing Zhu Jiapeng Sheng Lei Han Max Q.-H. Meng 174 3 0 04 Mar 2024
Model-based deep reinforcement learning for accelerated learning from flow simulations Andre Weiner Janis Geise AI4CE 209 6 0 26 Feb 2024
Leveraging Digital Cousins for Ensemble Q-Learning in Large-Scale Wireless Networks Talha Bozkus Urbashi Mitra 164 6 0 12 Feb 2024
Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization Talha Bozkus Urbashi Mitra OffRL 224 8 0 08 Feb 2024
Building Minimal and Reusable Causal State Abstractions for Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2024 Zizhao Wang Caroline Wang Xuesu Xiao Yuke Zhu Peter Stone OffRL 105 8 0 23 Jan 2024
Episodic Reinforcement Learning with Expanded State-reward Space Dayang Liang Yaru Zhang Yunlong Liu OffRL 126 4 0 19 Jan 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning Rafael Rafailov Kyle Hatch Victor Kolev John D. Martin Mariano Phielipp Chelsea Finn OffRL OnRL 299 17 0 06 Jan 2024
Efficient Reinforcement Learning via Decoupling Exploration and Utilization Jingpu Yang Helin Wang Qirui Zhao Zhecheng Shi Zirui Song Miao Fang 281 3 0 26 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey Dom Huh Prasant Mohapatra AI4CE 278 30 0 15 Dec 2023
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control Bernd Frauenknecht Tobias Ehlgen Sebastian Trimpe 189 4 0 30 Nov 2023
Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic Detection of Infeasible PlansNeural Information Processing Systems (NeurIPS), 2023 Kyowoon Lee Seongun Kim Jaesik Choi DiffM 215 19 0 30 Oct 2023
One is More: Diverse Perspectives within a Single Network for Efficient DRL Yiqin Tan Ling Pan Longbo Huang OffRL 249 0 0 21 Oct 2023
COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RLInternational Conference on Learning Representations (ICLR), 2023 Xiyao Wang Ruijie Zheng Yanchao Sun Ruonan Jia Wichayaporn Wongkamjan Huazhe Xu Furong Huang OffRL 243 16 0 11 Oct 2023
Guided Cooperation in Hierarchical Reinforcement Learning via Model-based RolloutIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023 Haoran Wang Zeshen Tang Leya Yang Yaoru Sun Fang Wang Siyu Zhang Ye-Ting Chen 233 2 0 24 Sep 2023
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy OptimizationNeural Information Processing Systems (NeurIPS), 2023 Hai Zhang Hang Yu Siyue Tao Di Zhang Chang Huang Hongtu Zhou Xiao Zhang Chen Ye 221 12 0 22 Sep 2023
Introspective Deep Metric LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023 Cheng-Hao Wang Wenzhao Zheng Zheng Hua Zhu Jie Zhou Jiwen Lu UQCV 201 18 0 11 Sep 2023
Mind the Uncertainty: Risk-Aware and Actively Exploring Model-Based Reinforcement Learning Marin Vlastelica Sebastian Blaes Cristina Pinneri Georg Martius 110 2 0 11 Sep 2023
The Power of MEME: Adversarial Malware Creation with Model-Based Reinforcement LearningEuropean Symposium on Research in Computer Security (ESORICS), 2023 M. Rigaki Sebastian Garcia AAML 113 6 0 31 Aug 2023
Efficient Epistemic Uncertainty Estimation in Regression Ensemble Models Using Pairwise-Distance Estimators Lucas Berry David Meger UD 323 3 0 25 Aug 2023
Censored Sampling of Diffusion Models Using 3 Minutes of Human FeedbackNeural Information Processing Systems (NeurIPS), 2023 Taeho Yoon Kibeom Myoung Keon Lee Jaewoong Cho Albert No Ernest K. Ryu 221 11 0 06 Jul 2023
Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysisNeural Information Processing Systems (NeurIPS), 2023 Alexander Meulemans Simon Schug Seijin Kobayashi Nathaniel D. Daw Gregory Wayne 300 6 0 29 Jun 2023
Learning non-Markovian Decision-Making from State-only SequencesNeural Information Processing Systems (NeurIPS), 2023 Aoyang Qin Feng Gao Qing Li Song-Chun Zhu Sirui Xie 225 11 0 27 Jun 2023
Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score MatchingConference on Robot Learning (CoRL), 2023 H.J. Terry Suh Glen Chou Hongkai Dai Lujie Yang Abhishek Gupta Russ Tedrake DiffM OffRL 212 14 0 24 Jun 2023