ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.03607
  4. Cited By
Bootstrapping Fitted Q-Evaluation for Off-Policy Inference
v1v2v3 (latest)

Bootstrapping Fitted Q-Evaluation for Off-Policy Inference

6 February 2021
Botao Hao
X. Ji
Yaqi Duan
Hao Lu
Csaba Szepesvári
Mengdi Wang
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Bootstrapping Fitted Q-Evaluation for Off-Policy Inference"

28 / 28 papers shown
Title
Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation
Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation
Hongyi Zhou
Josiah P. Hanna
Jin Zhu
Ying Yang
Chengchun Shi
OffRL
56
0
0
28 May 2025
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Yuhan Li
Eugene Han
Yifan Hu
Wenzhuo Zhou
Zhengling Qi
Yifan Cui
Ruoqing Zhu
OffRL
445
0
0
01 May 2025
Statistical Inference in Reinforcement Learning: A Selective Survey
Statistical Inference in Reinforcement Learning: A Selective Survey
Chengchun Shi
OffRL
272
2
0
22 Feb 2025
Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy
Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy
Keru Chen
Honghao Wei
Zhigang Deng
Sen Lin
OffRLOnRL
164
0
0
31 Dec 2024
Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement
  Learning
Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning
Shuguang Yu
Shuxing Fang
Ruixin Peng
Zhengling Qi
Fan Zhou
C. Shi
CMLOffRL
105
4
0
08 Dec 2024
Off-Policy Selection for Initiating Human-Centric Experimental Design
Off-Policy Selection for Initiating Human-Centric Experimental Design
Ge Gao
Xi Yang
Qitong Gao
Song Ju
Miroslav Pajic
Min Chi
OffRL
76
0
0
26 Oct 2024
Causal Deepsets for Off-policy Evaluation under Spatial or
  Spatio-temporal Interferences
Causal Deepsets for Off-policy Evaluation under Spatial or Spatio-temporal Interferences
Runpeng Dai
Jianing Wang
Fan Zhou
Shuang Luo
Zhiwei Qin
Chengchun Shi
Hongtu Zhu
CMLOffRL
64
2
0
25 Jul 2024
Why long model-based rollouts are no reason for bad Q-value estimates
Why long model-based rollouts are no reason for bad Q-value estimates
Philipp Wissmann
Daniel Hein
Steffen Udluft
Volker Tresp
OffRLLRM
50
1
0
16 Jul 2024
Combining Experimental and Historical Data for Policy Evaluation
Combining Experimental and Historical Data for Policy Evaluation
Ting Li
Chengchun Shi
Qianglin Wen
Yang Sui
Yongli Qin
Chunbo Lai
Hongtu Zhu
OffRL
106
1
0
01 Jun 2024
Robust Offline Reinforcement learning with Heavy-Tailed Rewards
Robust Offline Reinforcement learning with Heavy-Tailed Rewards
Jin Zhu
Runzhe Wan
Zhengling Qi
Shuang Luo
C. Shi
OffRL
74
1
0
28 Oct 2023
Off-Policy Evaluation for Human Feedback
Off-Policy Evaluation for Human Feedback
Qitong Gao
Ge Gao
Juncheng Dong
Vahid Tarokh
Min Chi
Miroslav Pajic
OffRL
86
5
0
11 Oct 2023
Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning
Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning
Weidong Liu
Jiyuan Tu
Yichen Zhang
Xi Chen
OffRL
121
4
0
04 Oct 2023
Estimation and Inference in Distributional Reinforcement Learning
Estimation and Inference in Distributional Reinforcement Learning
Liangyu Zhang
Yang Peng
Jiadong Liang
Wenhao Yang
Zhihua Zhang
OffRL
61
1
0
29 Sep 2023
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified
  Error Quantification Framework
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework
Wenzhuo Zhou
Yuhan Li
Ruoqing Zhu
Annie Qu
OffRL
73
5
0
23 Sep 2023
Off-policy Evaluation in Doubly Inhomogeneous Environments
Off-policy Evaluation in Doubly Inhomogeneous Environments
Zeyu Bian
C. Shi
Zhengling Qi
Lan Wang
OffRL
107
7
0
14 Jun 2023
$K$-Nearest-Neighbor Resampling for Off-Policy Evaluation in Stochastic
  Control
KKK-Nearest-Neighbor Resampling for Off-Policy Evaluation in Stochastic Control
Michael Giegrich
Roel Oomen
C. Reisinger
OffRL
60
2
0
07 Jun 2023
Did we personalize? Assessing personalization by an online reinforcement
  learning algorithm using resampling
Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling
Susobhan Ghosh
Raphael Kim
Prasidh Chhabria
Raaz Dwivedi
Predrag Klasjna
Peng Liao
Kelly Zhang
Susan Murphy
OffRL
72
9
0
11 Apr 2023
On the Sample Complexity of Vanilla Model-Based Offline Reinforcement
  Learning with Dependent Samples
On the Sample Complexity of Vanilla Model-Based Offline Reinforcement Learning with Dependent Samples
Mustafa O. Karabag
Ufuk Topcu
OffRL
97
5
0
07 Mar 2023
A Reinforcement Learning Framework for Dynamic Mediation Analysis
A Reinforcement Learning Framework for Dynamic Mediation Analysis
Linjuan Ge
Jitao Wang
C. Shi
Zhanghua Wu
Rui Song
62
6
0
31 Jan 2023
Variational Latent Branching Model for Off-Policy Evaluation
Variational Latent Branching Model for Off-Policy Evaluation
Qitong Gao
Ge Gao
Min Chi
Miroslav Pajic
OffRL
84
6
0
28 Jan 2023
An Instrumental Variable Approach to Confounded Off-Policy Evaluation
An Instrumental Variable Approach to Confounded Off-Policy Evaluation
Yang Xu
Jin Zhu
C. Shi
Shuang Luo
R. Song
OffRL
102
18
0
29 Dec 2022
Quantile Off-Policy Evaluation via Deep Conditional Generative Learning
Quantile Off-Policy Evaluation via Deep Conditional Generative Learning
Yang Xu
C. Shi
Shuang Luo
Lan Wang
R. Song
OffRL
69
5
0
29 Dec 2022
Policy-Adaptive Estimator Selection for Off-Policy Evaluation
Policy-Adaptive Estimator Selection for Off-Policy Evaluation
Takuma Udagawa
Haruka Kiyohara
Yusuke Narita
Yuta Saito
Keisuke Tateno
OffRL
60
25
0
25 Nov 2022
Policy Optimization with Sparse Global Contrastive Explanations
Policy Optimization with Sparse Global Contrastive Explanations
Jiayu Yao
S. Parbhoo
Weiwei Pan
Finale Doshi-Velez
OffRL
48
2
0
13 Jul 2022
Conformal Off-policy Prediction
Conformal Off-policy Prediction
Yingying Zhang
C. Shi
Shuang Luo
OffRL
79
11
0
14 Jun 2022
Testing Stationarity and Change Point Detection in Reinforcement Learning
Testing Stationarity and Change Point Detection in Reinforcement Learning
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
110
9
0
03 Mar 2022
Optimal Estimation of Off-Policy Policy Gradient via Double Fitted
  Iteration
Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration
Chengzhuo Ni
Ruiqi Zhang
Xiang Ji
Xuezhou Zhang
Mengdi Wang
OffRL
82
1
0
31 Jan 2022
Statistical Testing under Distributional Shifts
Statistical Testing under Distributional Shifts
Nikolaj Thams
Sorawit Saengkyongam
Niklas Pfister
J. Peters
OOD
124
10
0
22 May 2021
1