ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.07989
  4. Cited By
Double Check Your State Before Trusting It: Confidence-Aware
  Bidirectional Offline Model-Based Imagination
v1v2 (latest)

Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination

Neural Information Processing Systems (NeurIPS), 2022
16 June 2022
Jiafei Lyu
Xiu Li
Zongqing Lu
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination"

18 / 18 papers shown
Title
PROF: An LLM-based Reward Code Preference Optimization Framework for Offline Imitation Learning
PROF: An LLM-based Reward Code Preference Optimization Framework for Offline Imitation Learning
Shengjie Sun
Jiafei Lyu
Runze Liu
Mengbei Yan
Bo Liu
Deheng Ye
Xiu Li
OffRL
214
0
0
14 Nov 2025
RAD: Retrieval High-quality Demonstrations to Enhance Decision-making
RAD: Retrieval High-quality Demonstrations to Enhance Decision-making
Lu Guo
Yixiang Shan
Zhengbang Zhu
Qifan Liang
Lichang Song
Ting Long
Weinan Zhang
Yi-Ju Chang
OffRL
154
0
0
21 Jul 2025
Exploration by Random Distribution Distillation
Exploration by Random Distribution Distillation
Zhirui Fang
Kai Yang
Jian Tao
Jiafei Lyu
Lusong Li
Li Shen
Xiu Li
270
1
0
16 May 2025
Extendable Planning via Multiscale Diffusion
Extendable Planning via Multiscale Diffusion
Chang Chen
Hany Hamed
Doojin Baek
Taegu Kang
Samyeul Noh
Yoshua Bengio
Sungjin Ahn
329
4
0
25 Mar 2025
Model-Based Offline Reinforcement Learning with Reliability-Guaranteed Sequence Modeling
Model-Based Offline Reinforcement Learning with Reliability-Guaranteed Sequence Modeling
Shenghong He
OffRL
1.0K
0
0
10 Feb 2025
Enhancing Decision Transformer with Diffusion-Based Trajectory Branch Generation
Zhihong Liu
Long Qian
Zeyang Liu
Lipeng Wan
Xingyu Chen
Xuguang Lan
OffRL
318
3
0
18 Nov 2024
SUMO: Search-Based Uncertainty Estimation for Model-Based Offline
  Reinforcement Learning
SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2024
Zhongjian Qiao
Jiafei Lyu
Kechen Jiao
Qi Liu
Xiu Li
OffRL
171
6
0
23 Aug 2024
CDSA: Conservative Denoising Score-based Algorithm for Offline
  Reinforcement Learning
CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning
Zeyuan Liu
Kai Yang
Xiu Li
OffRL
275
0
0
11 Jun 2024
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Jiafei Lyu
Fuchun Sun
Jingwen Yang
Zongqing Lu
Xiu Li
250
21
0
24 May 2024
Improving Offline Reinforcement Learning with Inaccurate Simulators
Improving Offline Reinforcement Learning with Inaccurate Simulators
Yiwen Hou
Haoyuan Sun
Jinming Ma
Feng Wu
OffRL
127
8
0
07 May 2024
SEABO: A Simple Search-Based Method for Offline Imitation Learning
SEABO: A Simple Search-Based Method for Offline Imitation LearningInternational Conference on Learning Representations (ICLR), 2024
Jiafei Lyu
Xiaoteng Ma
Le Wan
Runze Liu
Xiu Li
Zongqing Lu
OffRL
278
14
0
06 Feb 2024
DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based
  Trajectory Stitching
DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching
Guanghe Li
Yixiang Shan
Zhengbang Zhu
Ting Long
Weinan Zhang
OffRL
277
33
0
04 Feb 2024
Exploration and Anti-Exploration with Distributional Random Network
  Distillation
Exploration and Anti-Exploration with Distributional Random Network Distillation
Kai Yang
Jian Tao
Jiafei Lyu
Xiu Li
363
27
0
18 Jan 2024
Optimistic Model Rollouts for Pessimistic Offline Policy Optimization
Optimistic Model Rollouts for Pessimistic Offline Policy OptimizationAAAI Conference on Artificial Intelligence (AAAI), 2024
Yuanzhao Zhai
Yiying Li
Zijian Gao
Xudong Gong
Kele Xu
Dawei Feng
Bo Ding
Huaimin Wang
OffRL
126
3
0
11 Jan 2024
CROP: Conservative Reward for Model-based Offline Policy Optimization
CROP: Conservative Reward for Model-based Offline Policy Optimization
Hao Li
Xiaohu Zhou
Mei-Jiang Gui
Shiqi Liu
Zhen-Qiu Feng
...
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Bo-Xian Yao
Zeng-Guang Hou
OffRL
149
4
0
26 Oct 2023
HIPODE: Enhancing Offline Reinforcement Learning with High-Quality
  Synthetic Data from a Policy-Decoupled Approach
HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach
Shixi Lian
Yi-An Ma
Jinyi Liu
Yan Zheng
Zhaopeng Meng
OffRL
152
2
0
10 Jun 2023
Look Beneath the Surface: Exploiting Fundamental Symmetry for
  Sample-Efficient Offline RL
Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RLNeural Information Processing Systems (NeurIPS), 2023
Peng Cheng
Xianyuan Zhan
Zhihao Wu
Wenjia Zhang
Shoucheng Song
Han Wang
Youfang Lin
Li Jiang
OffRL
570
15
0
07 Jun 2023
Uncertainty-driven Trajectory Truncation for Data Augmentation in
  Offline Reinforcement Learning
Uncertainty-driven Trajectory Truncation for Data Augmentation in Offline Reinforcement LearningEuropean Conference on Artificial Intelligence (ECAI), 2023
Junjie Zhang
Jiafei Lyu
Xiaoteng Ma
Jiangpeng Yan
Jun Yang
Le Wan
Xiu Li
OffRL
153
10
0
10 Apr 2023
1