ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.11566
  4. Cited By
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement
  Learning

Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning

23 February 2022
Chenjia Bai
Lingxiao Wang
Zhuoran Yang
Zhihong Deng
Animesh Garg
Peng Liu
Zhaoran Wang
    OffRL
ArXivPDFHTML

Papers citing "Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning"

50 / 100 papers shown
Title
Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach
Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach
Xuyang Chen
Keyu Yan
Lin Zhao
OffRL
42
0
0
08 May 2025
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning
Jifeng Hu
Sili Huang
Z. Yang
Shengchao Hu
Li Shen
H. Chen
Lichao Sun
Yi-Ju Chang
Dacheng Tao
OffRL
44
0
0
03 May 2025
Uncertainty-aware Latent Safety Filters for Avoiding Out-of-Distribution Failures
Uncertainty-aware Latent Safety Filters for Avoiding Out-of-Distribution Failures
Junwon Seo
Kensuke Nakamura
Andrea V. Bajcsy
51
0
0
01 May 2025
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
Haoran Xu
Shuozhe Li
Harshit S. Sikchi
S. Niekum
Amy Zhang
OffRL
22
0
0
17 Apr 2025
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
Xuyang Chen
Guojian Wang
Keyu Yan
Lin Zhao
OffRL
20
1
0
16 Apr 2025
Beyond Non-Expert Demonstrations: Outcome-Driven Action Constraint for Offline Reinforcement Learning
Beyond Non-Expert Demonstrations: Outcome-Driven Action Constraint for Offline Reinforcement Learning
Ke Jiang
Wen Jiang
Y. Li
Xiaoyang Tan
OffRL
35
0
0
02 Apr 2025
Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation
Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation
Hongye Cao
Fan Feng
Jing Huo
Shangdong Yang
Meng Fang
Tianpei Yang
Yang Gao
AAML
OffRL
55
0
0
26 Mar 2025
Policy Constraint by Only Support Constraint for Offline Reinforcement Learning
Yunkai Gao
Jiaming Guo
Fan Wu
Rui Zhang
OffRL
47
0
0
07 Mar 2025
Improving Plasticity in Non-stationary Reinforcement Learning with Evidential Proximal Policy Optimization
Abdullah Akgul
Gulcin Baykal
Manuel Haußmann
M. Kandemir
30
0
0
03 Mar 2025
Data Center Cooling System Optimization Using Offline Reinforcement Learning
Data Center Cooling System Optimization Using Offline Reinforcement Learning
Xianyuan Zhan
Xiangyu Zhu
Peng Cheng
Xiao Hu
Ziteng He
...
Chenhui Liu
Tianshun Hong
Yan Liang
Yunxin Liu
Feng Zhao
AI4CE
57
0
0
17 Feb 2025
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
Abdullah Akgul
Manuel Haußmann
M. Kandemir
OffRL
61
1
0
17 Jan 2025
An Investigation of Offline Reinforcement Learning in Factorisable Action Spaces
Alex Beeson
David Ireland
Giovanni Montana
OffRL
32
0
0
17 Nov 2024
Uncertainty-based Offline Variational Bayesian Reinforcement Learning
  for Robustness under Diverse Data Corruptions
Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data Corruptions
Rui Yang
Jie Wang
Guoping Wu
B. Li
AAML
OffRL
29
1
0
01 Nov 2024
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Jing Zhang
Linjiajie Fang
Kexin Shi
Wenjia Wang
Bing-Yi Jing
OffRL
27
0
0
27 Oct 2024
Offline Reinforcement Learning with OOD State Correction and OOD Action
  Suppression
Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression
Yixiu Mao
Qi Wang
Chen Chen
Yun Qu
Xiangyang Ji
OffRL
32
3
0
25 Oct 2024
Grounded Answers for Multi-agent Decision-making Problem through
  Generative World Model
Grounded Answers for Multi-agent Decision-making Problem through Generative World Model
Zeyang Liu
Xinrui Yang
Shiguang Sun
Long Qian
Lipeng Wan
Xingyu Chen
Xuguang Lan
22
2
0
03 Oct 2024
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with
  Stationary Distribution Shift Regularization
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization
The Viet Bui
Thanh Hong Nguyen
Tien Mai
OffRL
16
0
0
02 Oct 2024
SAMBO-RL: Shifts-aware Model-based Offline Reinforcement Learning
SAMBO-RL: Shifts-aware Model-based Offline Reinforcement Learning
Wang Luo
Haoran Li
Zicheng Zhang
Congying Han
Jiayu Lv
Tiande Guo
OffRL
33
1
0
23 Aug 2024
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning
Shirong Liu
Chenjia Bai
Zixian Guo
Hao Zhang
Gaurav Sharma
Yang Liu
OffRL
22
2
0
04 Aug 2024
Reinforcement Learning for Sustainable Energy: A Survey
Reinforcement Learning for Sustainable Energy: A Survey
Koen Ponse
Felix Kleuker
Márton Fejér
Álvaro Serra-Gómez
Aske Plaat
Thomas M. Moerland
OffRL
AI4CE
33
0
0
26 Jul 2024
CDSA: Conservative Denoising Score-based Algorithm for Offline
  Reinforcement Learning
CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning
Zeyuan Liu
Kai Yang
Xiu Li
OffRL
35
0
0
11 Jun 2024
UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function
  in Offline Reinforcement Learning
UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning
Yu Zhang
Rui Yu
Zhipeng Yao
Wenyuan Zhang
Jun Wang
Liming Zhang
OffRL
27
0
0
05 Jun 2024
Combining Experimental and Historical Data for Policy Evaluation
Combining Experimental and Historical Data for Policy Evaluation
Ting Li
Chengchun Shi
Qianglin Wen
Yang Sui
Yongli Qin
Chunbo Lai
Hongtu Zhu
OffRL
39
0
0
01 Jun 2024
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Chenjia Bai
Rushuai Yang
Qiaosheng Zhang
Kang Xu
Yi Chen
Ting Xiao
Xuelong Li
OffRL
25
0
0
25 May 2024
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Junghyuk Yeom
Yonghyeon Jo
Jungmo Kim
Sanghyeon Lee
Seungyul Han
OffRL
38
2
0
23 May 2024
Towards Robust Policy: Enhancing Offline Reinforcement Learning with
  Adversarial Attacks and Defenses
Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses
Thanh Nguyen
T. Luu
Tri Ton
Chang D. Yoo
OffRL
AAML
27
0
0
18 May 2024
Reinformer: Max-Return Sequence Modeling for Offline RL
Reinformer: Max-Return Sequence Modeling for Offline RL
Zifeng Zhuang
Dengyun Peng
Jinxin Liu
Ziqi Zhang
Donglin Wang
OffRL
AI4TS
33
11
0
14 May 2024
Ensemble Successor Representations for Task Generalization in
  Offline-to-Online Reinforcement Learning
Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning
Changhong Wang
Xudong Yu
Chenjia Bai
Qiaosheng Zhang
Zhen Wang
38
1
0
12 May 2024
Contrastive Representation for Data Filtering in Cross-Domain Offline
  Reinforcement Learning
Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning
Xiaoyu Wen
Chenjia Bai
Kang Xu
Xudong Yu
Yang Zhang
Xuelong Li
Zhen Wang
32
2
0
10 May 2024
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline
  Reinforcement Learning
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Jianye Hao
Zhuoran Yang
Bin Zhao
Zhen Wang
Xuelong Li
OffRL
14
9
0
30 Apr 2024
Diverse Randomized Value Functions: A Provably Pessimistic Approach for
  Offline Reinforcement Learning
Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning
Xudong Yu
Chenjia Bai
Hongyi Guo
Changhong Wang
Zhen Wang
OffRL
19
0
0
09 Apr 2024
Compositional Conservatism: A Transductive Approach in Offline
  Reinforcement Learning
Compositional Conservatism: A Transductive Approach in Offline Reinforcement Learning
Yeda Song
Dongwook Lee
Gunhee Kim
OffRL
25
0
0
06 Apr 2024
Grid-Mapping Pseudo-Count Constraint for Offline Reinforcement Learning
Grid-Mapping Pseudo-Count Constraint for Offline Reinforcement Learning
Yi Shen
Hanyan Huang
Shan Xie
25
0
0
03 Apr 2024
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning
Ruoqing Zhang
Ziwei Luo
Jens Sjölund
Thomas B. Schon
Per Mattsson
8
7
0
06 Feb 2024
SEABO: A Simple Search-Based Method for Offline Imitation Learning
SEABO: A Simple Search-Based Method for Offline Imitation Learning
Jiafei Lyu
Xiaoteng Ma
Le Wan
Runze Liu
Xiu Li
Zongqing Lu
OffRL
11
9
0
06 Feb 2024
ODICE: Revealing the Mystery of Distribution Correction Estimation via
  Orthogonal-gradient Update
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
14
10
0
01 Feb 2024
Off-Policy Primal-Dual Safe Reinforcement Learning
Off-Policy Primal-Dual Safe Reinforcement Learning
Zifan Wu
Bo Tang
Qian Lin
Chao Yu
Shangqin Mao
Qianlong Xie
Xingxing Wang
Dong Wang
OffRL
11
1
0
26 Jan 2024
A unified uncertainty-aware exploration: Combining epistemic and
  aleatory uncertainty
A unified uncertainty-aware exploration: Combining epistemic and aleatory uncertainty
Parvin Malekzadeh
Ming Hou
Konstantinos N. Plataniotis
UD
16
1
0
05 Jan 2024
Uncertainty-Penalized Reinforcement Learning from Human Feedback with
  Diverse Reward LoRA Ensembles
Uncertainty-Penalized Reinforcement Learning from Human Feedback with Diverse Reward LoRA Ensembles
Yuanzhao Zhai
Han Zhang
Yu Lei
Yue Yu
Kele Xu
Dawei Feng
Bo Ding
Huaimin Wang
AI4CE
55
31
0
30 Dec 2023
Can Active Sampling Reduce Causal Confusion in Offline Reinforcement
  Learning?
Can Active Sampling Reduce Causal Confusion in Offline Reinforcement Learning?
Gunshi Gupta
Tim G. J. Rudner
R. McAllister
Adrien Gaidon
Y. Gal
OffRL
32
3
0
28 Dec 2023
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement
  Learning
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
Yinmin Zhang
Jie Liu
Chuming Li
Yazhe Niu
Yaodong Yang
Yu Liu
Wanli Ouyang
OffRL
OnRL
31
3
0
12 Dec 2023
Model-Based Epistemic Variance of Values for Risk-Aware Policy
  Optimization
Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization
Carlos E. Luis
A. Bottero
Julia Vinogradska
Felix Berkenkamp
Jan Peters
OffRL
21
3
0
07 Dec 2023
MICRO: Model-Based Offline Reinforcement Learning with a Conservative
  Bellman Operator
MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator
Xiao-Yin Liu
Xiao-Hu Zhou
Guo-Tao Li
Hao Li
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Zeng-Guang Hou
OffRL
19
3
0
07 Dec 2023
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with
  Learned Models
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models
Mianchu Wang
Rui Yang
Xi Chen
Hao Sun
Meng Fang
Giovanni Montana
OffRL
15
9
0
30 Oct 2023
Towards Robust Offline Reinforcement Learning under Diverse Data
  Corruption
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption
Rui Yang
Han Zhong
Jiawei Xu
Amy Zhang
Chong Zhang
Lei Han
Tong Zhang
OffRL
OnRL
30
15
0
19 Oct 2023
Improving Generalization of Alignment with Human Preferences through
  Group Invariant Learning
Improving Generalization of Alignment with Human Preferences through Group Invariant Learning
Rui Zheng
Wei Shen
Yuan Hua
Wenbin Lai
Shihan Dou
...
Xiao Wang
Haoran Huang
Tao Gui
Qi Zhang
Xuanjing Huang
52
14
0
18 Oct 2023
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline
  Reinforcement Learning
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
Fan Luo
Tian Xu
Xingchen Cao
Yang Yu
OffRL
14
7
0
09 Oct 2023
Improving Offline-to-Online Reinforcement Learning with Q Conditioned
  State Entropy Exploration
Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration
Ziqi Zhang
Xiao Xiong
Zifeng Zhuang
Jinxin Liu
Donglin Wang
OffRL
OnRL
19
0
0
07 Oct 2023
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty
  and Smoothness
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness
Xiaoyu Wen
Xudong Yu
Rui Yang
Chenjia Bai
Zhen Wang
OffRL
OnRL
11
10
0
29 Sep 2023
Zero-Shot Reinforcement Learning from Low Quality Data
Zero-Shot Reinforcement Learning from Low Quality Data
Scott Jeen
Tom Bewley
Jonathan M. Cullen
OffRL
OnRL
27
0
0
26 Sep 2023
12
Next