ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.13239
  4. Cited By
MOPO: Model-based Offline Policy Optimization
v1v2v3v4v5v6 (latest)

MOPO: Model-based Offline Policy Optimization

Neural Information Processing Systems (NeurIPS), 2020
27 May 2020
Tianhe Yu
G. Thomas
Lantao Yu
Stefano Ermon
James Zou
Sergey Levine
Chelsea Finn
Tengyu Ma
    OffRL
ArXiv (abs)PDFHTMLGithub (179★)

Papers citing "MOPO: Model-based Offline Policy Optimization"

50 / 538 papers shown
Efficient Cross-Domain Offline Reinforcement Learning with Dynamics- and Value-Aligned Data Filtering
Efficient Cross-Domain Offline Reinforcement Learning with Dynamics- and Value-Aligned Data Filtering
Zhongjian Qiao
Rui Yang
Jiafei Lyu
Chenjia Bai
Xiu Li
Zhuoran Yang
Siyang Gao
207
0
0
02 Dec 2025
Dual-Robust Cross-Domain Offline Reinforcement Learning Against Dynamics Shifts
Dual-Robust Cross-Domain Offline Reinforcement Learning Against Dynamics Shifts
Zhongjian Qiao
Rui Yang
Jiafei Lyu
Xiu Li
Zhongxiang Dai
Zhuoran Yang
Siyang Gao
Shuang Qiu
OffRL
227
2
0
02 Dec 2025
Efficient Diffusion Planning with Temporal Diffusion
Efficient Diffusion Planning with Temporal Diffusion
Jiaming Guo
Rui Zhang
Z. Li
Yunkai Gao
Shaohui Peng
Siming Lan
Xing Hu
Zidong Du
Xishan Zhang
Ling Li
DiffM
219
0
0
26 Nov 2025
Adaptive Neighborhood-Constrained Q Learning for Offline Reinforcement Learning
Adaptive Neighborhood-Constrained Q Learning for Offline Reinforcement Learning
Yixiu Mao
Yun Qu
Qi Wang
Xiangyang Ji
OffRL
194
1
0
04 Nov 2025
Social World Model-Augmented Mechanism Design Policy Learning
Social World Model-Augmented Mechanism Design Policy Learning
Xiaoyuan Zhang
Y. Huang
Chengdong Ma
Zhixun Chen
Long Ma
Yali Du
Song-Chun Zhu
Yaodong Yang
Xue Feng
174
0
0
22 Oct 2025
Using Non-Expert Data to Robustify Imitation Learning via Offline Reinforcement Learning
Using Non-Expert Data to Robustify Imitation Learning via Offline Reinforcement Learning
Kevin Huang
Rosario Scalise
Cleah Winston
Ayush Agrawal
Yunchu Zhang
...
Byron Boots
Benjamin Burchfiel
Hongkai Dai
Masha Itkina
Paarth Shah
OffRL
345
0
0
22 Oct 2025
Internalizing World Models via Self-Play Finetuning for Agentic RL
Internalizing World Models via Self-Play Finetuning for Agentic RL
S. Chen
Tongyao Zhu
Z. Wang
Jinghan Zhang
Kangrui Wang
Siyang Gao
Teng Xiao
Yee Whye Teh
Junxian He
Manling Li
OffRLLRM
156
10
0
16 Oct 2025
Near-Optimal Second-Order Guarantees for Model-Based Adversarial Imitation Learning
Near-Optimal Second-Order Guarantees for Model-Based Adversarial Imitation Learning
Shangzhe Li
Dongruo Zhou
Weitong Zhang
OffRL
261
1
0
10 Oct 2025
Analytical Survey of Learning with Low-Resource Data: From Analysis to Investigation
Analytical Survey of Learning with Low-Resource Data: From Analysis to Investigation
Xiaofeng Cao
Mingwei Xu
Xin Yu
Jiangchao Yao
Wei Ye
...
Minling Zhang
Ivor Tsang
Yew-Soon Ong
James T. Kwok
Heng Tao Shen
221
15
0
10 Oct 2025
Expressive Value Learning for Scalable Offline Reinforcement Learning
Expressive Value Learning for Scalable Offline Reinforcement Learning
Nicolas Espinosa-Dice
Kianté Brantley
Wen Sun
OffRL
308
1
0
09 Oct 2025
Offline Reinforcement Learning in Large State Spaces: Algorithms and Guarantees
Offline Reinforcement Learning in Large State Spaces: Algorithms and Guarantees
Nan Jiang
Tengyang Xie
OffRL
243
16
0
05 Oct 2025
RAMAC: Multimodal Risk-Aware Offline Reinforcement Learning and the Role of Behavior Regularization
RAMAC: Multimodal Risk-Aware Offline Reinforcement Learning and the Role of Behavior Regularization
Kai Fukazawa
Kunal Mundada
Iman Soltani
OffRL
219
0
0
03 Oct 2025
PASTA: A Unified Framework for Offline Assortment Learning
PASTA: A Unified Framework for Offline Assortment Learning
Juncheng Dong
Weibin Mo
Zhengling Qi
C. Shi
Ethan X. Fang
Vahid Tarokh
OffRL
217
0
0
02 Oct 2025
SPiDR: A Simple Approach for Zero-Shot Safety in Sim-to-Real Transfer
SPiDR: A Simple Approach for Zero-Shot Safety in Sim-to-Real Transfer
Yarden As
Chengrui Qu
Benjamin Unger
Dongho Kang
Max van der Hart
Laixi Shi
Stelian Coros
Adam Wierman
Andreas Krause
OffRL
426
2
0
23 Sep 2025
Enhancing Generative Auto-bidding with Offline Reward Evaluation and Policy Search
Enhancing Generative Auto-bidding with Offline Reward Evaluation and Policy Search
Zhiyu Mou
Yiqin Lv
Miao Xu
Cheems Wang
Yixiu Mao
...
Rongquan Bai
Chuan Yu
Jian Xu
Bo Zheng
Bo Zheng
OffRL
292
2
0
19 Sep 2025
Offline vs. Online Learning in Model-based RL: Lessons for Data Collection Strategies
Offline vs. Online Learning in Model-based RL: Lessons for Data Collection Strategies
Jiaqi Chen
Ji Shi
Cansu Sancaktar
Jonas Frey
Georg Martius
OffRL
155
0
0
06 Sep 2025
Beyond Prediction: Reinforcement Learning as the Defining Leap in Healthcare AI
Beyond Prediction: Reinforcement Learning as the Defining Leap in Healthcare AI
Dilruk Perera
Gousia Habib
Qianyi Xu
Daniel J. Tan
Kai He
Erik Cambria
Mengling Feng
OffRLAI4TS
341
0
0
28 Aug 2025
Adaptive Scaling of Policy Constraints for Offline Reinforcement Learning
Adaptive Scaling of Policy Constraints for Offline Reinforcement Learning
Tan Jing
Xiaorui Li
Chao Yao
Xiaojuan Ban
Yuetong Fang
Zhanchen Zhu
Zhaolin Yuan
OffRL
171
0
0
27 Aug 2025
Dream to Chat: Model-based Reinforcement Learning on Dialogues with User Belief Modeling
Dream to Chat: Model-based Reinforcement Learning on Dialogues with User Belief Modeling
Yue Zhao
Xiaoyu Wang
Dan Wang
Zhonglin Jiang
Qingqing Gu
Teng Chen
Ningyuan Xi
Jinxian Qu
Yong Chen
Luo Ji
297
0
0
23 Aug 2025
Central Limit Theorems for Transition Probabilities of Controlled Markov Chains
Central Limit Theorems for Transition Probabilities of Controlled Markov Chains
Ziwei Su
Imon Banerjee
Diego Klabjan
OffRL
233
0
0
02 Aug 2025
Safe Deployment of Offline Reinforcement Learning via Input Convex Action Correction
Safe Deployment of Offline Reinforcement Learning via Input Convex Action Correction
Alex Durkin
Jasper Stolte
Matthew Jones
Raghuraman Pitchumani
Bei Li
Christian Michler
Mehmet Mercangöz
OffRLOnRL
285
1
0
30 Jul 2025
RAD: Retrieval High-quality Demonstrations to Enhance Decision-making
RAD: Retrieval High-quality Demonstrations to Enhance Decision-making
Lu Guo
Yixiang Shan
Zhengbang Zhu
Qifan Liang
Lichang Song
Ting Long
Weinan Zhang
Yi-Ju Chang
OffRL
257
0
0
21 Jul 2025
Latent Policy Steering with Embodiment-Agnostic Pretrained World Models
Latent Policy Steering with Embodiment-Agnostic Pretrained World Models
Yiqi Wang
Mrinal Verghese
Jeff Schneider
341
8
0
17 Jul 2025
Q-Guided Stein Variational Model Predictive Control via RL-informed Policy Prior
Q-Guided Stein Variational Model Predictive Control via RL-informed Policy Prior
Shizhe Cai
Zeya Yin
Jayadeep Jacob
Fabio Ramos
BDL
232
0
0
09 Jul 2025
CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization
CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization
Ranting Hu
OffRL
329
0
0
18 Jun 2025
MOBODY: Model Based Off-Dynamics Offline Reinforcement Learning
Yihong Guo
Yu Yang
Pan Xu
Anqi Liu
OffRL
312
5
0
10 Jun 2025
Accelerating Diffusion Planners in Offline RL via Reward-Aware Consistency Trajectory Distillation
Accelerating Diffusion Planners in Offline RL via Reward-Aware Consistency Trajectory Distillation
Xintong Duan
Yutong He
Fahim Tajwar
Ruslan Salakhutdinov
J. Zico Kolter
J. Schneider
OffRL
385
1
0
09 Jun 2025
Horizon Reduction Makes RL Scalable
Horizon Reduction Makes RL Scalable
Seohong Park
Kevin Frans
Deepinder Mann
Benjamin Eysenbach
Aviral Kumar
Sergey Levine
OffRL
731
24
0
04 Jun 2025
Hybrid Cross-domain Robust Reinforcement Learning
Hybrid Cross-domain Robust Reinforcement Learning
Linh Le Pham Van
Minh Hoang Nguyen
Hung Le
H. Tran
Sunil R. Gupta
OffRL
275
3
0
29 May 2025
SOReL and TOReL: Two Methods for Fully Offline Reinforcement Learning
SOReL and TOReL: Two Methods for Fully Offline Reinforcement Learning
Mattie Fellows
Clarisse Wibault
Uljad Berdica
Johannes Forkel
Jakob Foerster
Michael A. Osborne
OffRLOnRL
362
0
0
28 May 2025
Scaling Offline RL via Efficient and Expressive Shortcut Models
Scaling Offline RL via Efficient and Expressive Shortcut Models
Nicolas Espinosa-Dice
Yiyi Zhang
Yiding Chen
Bradley Guo
Owen Oertell
Gokul Swamy
Kianté Brantley
Wen Sun
OffRLLRM
290
8
0
28 May 2025
Decision Flow Policy Optimization
Decision Flow Policy Optimization
Jifeng Hu
Sili Huang
Siyuan Guo
Zhaogeng Liu
Li Shen
Lichao Sun
Hechang Chen
Yi-Ju Chang
Dacheng Tao
389
0
0
26 May 2025
medDreamer: Model-Based Reinforcement Learning with Latent Imagination on Complex EHRs for Clinical Decision Support
medDreamer: Model-Based Reinforcement Learning with Latent Imagination on Complex EHRs for Clinical Decision Support
Qianyi Xu
Gousia Habib
Dilruk Perera
Mengling Feng
Mengling Feng
OffRL
429
1
0
26 May 2025
FlowQ: Energy-Guided Flow Policies for Offline Reinforcement Learning
FlowQ: Energy-Guided Flow Policies for Offline Reinforcement Learning
Marvin Alles
Nutan Chen
Patrick van der Smagt
Botond Cseke
464
3
0
20 May 2025
Imagination-Limited Q-Learning for Offline Reinforcement Learning
Imagination-Limited Q-Learning for Offline Reinforcement LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Wenhui Liu
Zhijian Wu
Jingchao Wang
Dingjiang Huang
Shuigeng Zhou
OffRL
375
1
0
18 May 2025
ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts
ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts
Jing-Cheng Pang
Kaiyuan Li
Longji Xu
Si-Hang Yang
Shengyi Jiang
Yang Yu
OffRLLLMAGLM&RoLRM
287
1
0
15 May 2025
Beyond the Known: Decision Making with Counterfactual Reasoning Decision Transformer
Beyond the Known: Decision Making with Counterfactual Reasoning Decision TransformerInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Minh Hoang Nguyen
Linh Le Pham Van
Thommen George Karimpanal
Sunil Gupta
Hung Le
OffRLLRM
335
2
0
14 May 2025
DARLR: Dual-Agent Offline Reinforcement Learning for Recommender Systems with Dynamic Reward
DARLR: Dual-Agent Offline Reinforcement Learning for Recommender Systems with Dynamic RewardAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Yi Zhang
Ruihong Qiu
Xuwei Xu
Jiajun Liu
Sen Wang
OffRL
312
5
0
12 May 2025
Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach
Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach
Xuyang Chen
Keyu Yan
Wenhan Cao
Tianyuan Chen
OffRL
582
2
0
08 May 2025
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
Shangzhe Li
Zhiao Huang
Hao Su
473
1
0
04 May 2025
PIN-WM: Learning Physics-INformed World Models for Non-Prehensile Manipulation
PIN-WM: Learning Physics-INformed World Models for Non-Prehensile Manipulation
Wenxuan Li
Hang Zhao
Zhiyuan Yu
Yu Du
Qin Zou
Ruizhen Hu
K. Xu
SSL
514
11
0
23 Apr 2025
Improving Sequential Recommenders through Counterfactual Augmentation of System Exposure
Improving Sequential Recommenders through Counterfactual Augmentation of System ExposureAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Ziqi Zhao
Zhaochun Ren
Jiyuan Yang
Zuming Yan
Zihan Wang
Liu Yang
Sudipta Singha Roy
Zhumin Chen
Maarten de Rijke
Xin Xin
CML
366
3
0
18 Apr 2025
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Haoran Xu
Shuozhe Li
Harshit S. Sikchi
S. Niekum
Amy Zhang
OffRL
478
3
0
17 Apr 2025
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
Xuyang Chen
Guojian Wang
Keyu Yan
Tianyuan Chen
OffRL
613
1
0
16 Apr 2025
A Clean Slate for Offline Reinforcement Learning
A Clean Slate for Offline Reinforcement Learning
Matthew Jackson
Uljad Berdica
Jarek Liesen
Shimon Whiteson
Jakob Foerster
OffRLOnRL
497
3
0
15 Apr 2025
Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing
Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing
Vinal Asodia
Zhenhua Feng
Saber Fallah
Zhenhua Feng
Saber Fallah
OffRL
351
2
0
11 Apr 2025
Learning with Imperfect Models: When Multi-step Prediction Mitigates Compounding Error
Learning with Imperfect Models: When Multi-step Prediction Mitigates Compounding Error
Anne Somalwar
Bruce D. Lee
George J. Pappas
Nikolai Matni
238
4
0
02 Apr 2025
Beyond Non-Expert Demonstrations: Outcome-Driven Action Constraint for Offline Reinforcement Learning
Beyond Non-Expert Demonstrations: Outcome-Driven Action Constraint for Offline Reinforcement Learning
Ke Jiang
Wen Jiang
You Li
Xiaoyang Tan
OffRL
425
1
0
02 Apr 2025
A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective
A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective
Zhuoren Li
Guizhe Jin
Ran Yu
Zhiwen Chen
Nan I. Li
...
Lu Xiong
Bo Leng
Jia Hu
Ilya Kolmanovsky
Dimitar Filev
285
4
0
31 Mar 2025
Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation
Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation
Hongye Cao
Fan Feng
Jing Huo
Shangdong Yang
Meng Fang
Zhenxing Ge
Yang Gao
AAMLOffRL
286
2
0
26 Mar 2025
1234...91011
Next
Page 1 of 11
Pageof 11