ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.03607
  4. Cited By
Bootstrapping Fitted Q-Evaluation for Off-Policy Inference
v1v2v3 (latest)

Bootstrapping Fitted Q-Evaluation for Off-Policy Inference

International Conference on Machine Learning (ICML), 2021
6 February 2021
Botao Hao
X. Ji
Yaqi Duan
Hao Lu
Csaba Szepesvári
Mengdi Wang
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Bootstrapping Fitted Q-Evaluation for Off-Policy Inference"

30 / 30 papers shown
A Tutorial: An Intuitive Explanation of Offline Reinforcement Learning Theory
A Tutorial: An Intuitive Explanation of Offline Reinforcement Learning Theory
Fengdi Che
OffRL
165
0
0
11 Aug 2025
Central Limit Theorems for Transition Probabilities of Controlled Markov Chains
Central Limit Theorems for Transition Probabilities of Controlled Markov Chains
Ziwei Su
Imon Banerjee
Diego Klabjan
OffRL
207
0
0
02 Aug 2025
Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation
Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation
Hongyi Zhou
Josiah P. Hanna
Jin Zhu
Ying Yang
Chengchun Shi
OffRL
207
4
0
28 May 2025
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Yuhan Li
Eugene Han
Yifan Hu
Wenzhuo Zhou
Zhengling Qi
Yifan Cui
Ruoqing Zhu
OffRL
958
0
0
01 May 2025
Statistical Inference in Reinforcement Learning: A Selective Survey
Statistical Inference in Reinforcement Learning: A Selective Survey
Chengchun Shi
OffRL
630
4
0
22 Feb 2025
Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement
  Learning
Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Shuguang Yu
Shuxing Fang
Ruixin Peng
Zhengling Qi
Fan Zhou
C. Shi
CMLOffRL
328
7
0
08 Dec 2024
Towards Fast Safe Online Reinforcement Learning via Policy Finetuning
Towards Fast Safe Online Reinforcement Learning via Policy Finetuning
Keru Chen
Honghao Wei
Zhigang Deng
Sen Lin
OffRLOnRL
452
1
0
05 Dec 2024
Off-Policy Selection for Initiating Human-Centric Experimental Design
Off-Policy Selection for Initiating Human-Centric Experimental DesignNeural Information Processing Systems (NeurIPS), 2024
Ge Gao
Xi Yang
Qitong Gao
Song Ju
Miroslav Pajic
Min Chi
OffRL
332
0
0
26 Oct 2024
Causal Deepsets for Off-policy Evaluation under Spatial or
  Spatio-temporal Interferences
Causal Deepsets for Off-policy Evaluation under Spatial or Spatio-temporal Interferences
Runpeng Dai
Jianing Wang
Fan Zhou
Shuang Luo
Zhiwei Qin
Chengchun Shi
Hongtu Zhu
CMLOffRL
289
3
0
25 Jul 2024
Why long model-based rollouts are no reason for bad Q-value estimates
Why long model-based rollouts are no reason for bad Q-value estimates
Philipp Wissmann
Daniel Hein
Steffen Udluft
Volker Tresp
OffRLLRM
190
2
0
16 Jul 2024
Combining Experimental and Historical Data for Policy Evaluation
Combining Experimental and Historical Data for Policy Evaluation
Ting Li
Chengchun Shi
Qianglin Wen
Yang Sui
Yongli Qin
Chunbo Lai
Hongtu Zhu
OffRL
455
4
0
01 Jun 2024
Robust Offline Reinforcement learning with Heavy-Tailed Rewards
Robust Offline Reinforcement learning with Heavy-Tailed RewardsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Jin Zhu
Runzhe Wan
Zhengling Qi
Shuang Luo
C. Shi
OffRL
364
5
0
28 Oct 2023
Off-Policy Evaluation for Human Feedback
Off-Policy Evaluation for Human FeedbackNeural Information Processing Systems (NeurIPS), 2023
Qitong Gao
Ge Gao
Juncheng Dong
Vahid Tarokh
Min Chi
Miroslav Pajic
OffRL
354
8
0
11 Oct 2023
Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning
Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning
Weidong Liu
Jiyuan Tu
Yichen Zhang
Xi Chen
OffRL
403
5
0
04 Oct 2023
Estimation and Inference in Distributional Reinforcement Learning
Estimation and Inference in Distributional Reinforcement Learning
Liangyu Zhang
Yang Peng
Jiadong Liang
Wenhao Yang
Zhihua Zhang
OffRL
189
4
0
29 Sep 2023
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified
  Error Quantification Framework
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework
Wenzhuo Zhou
Yuhan Li
Ruoqing Zhu
Annie Qu
OffRL
299
7
0
23 Sep 2023
Off-policy Evaluation in Doubly Inhomogeneous Environments
Off-policy Evaluation in Doubly Inhomogeneous EnvironmentsJournal of the American Statistical Association (JASA), 2023
Zeyu Bian
C. Shi
Zhengling Qi
Lan Wang
OffRL
302
12
0
14 Jun 2023
$K$-Nearest-Neighbor Resampling for Off-Policy Evaluation in Stochastic
  Control
KKK-Nearest-Neighbor Resampling for Off-Policy Evaluation in Stochastic Control
Michael Giegrich
Roel Oomen
C. Reisinger
OffRL
233
2
0
07 Jun 2023
Did we personalize? Assessing personalization by an online reinforcement
  learning algorithm using resampling
Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resamplingMachine-mediated learning (ML), 2023
Susobhan Ghosh
Raphael Kim
Prasidh Chhabria
Raaz Dwivedi
Predrag Klasjna
Peng Liao
Kelly Zhang
Susan Murphy
OffRL
468
13
0
11 Apr 2023
On the Sample Complexity of Vanilla Model-Based Offline Reinforcement
  Learning with Dependent Samples
On the Sample Complexity of Vanilla Model-Based Offline Reinforcement Learning with Dependent SamplesAAAI Conference on Artificial Intelligence (AAAI), 2023
Mustafa O. Karabag
Ufuk Topcu
OffRL
279
6
0
07 Mar 2023
A Reinforcement Learning Framework for Dynamic Mediation Analysis
A Reinforcement Learning Framework for Dynamic Mediation AnalysisInternational Conference on Machine Learning (ICML), 2023
Linjuan Ge
Jitao Wang
C. Shi
Zhanghua Wu
Rui Song
358
6
0
31 Jan 2023
Variational Latent Branching Model for Off-Policy Evaluation
Variational Latent Branching Model for Off-Policy EvaluationInternational Conference on Learning Representations (ICLR), 2023
Qitong Gao
Ge Gao
Min Chi
Miroslav Pajic
OffRL
378
7
0
28 Jan 2023
An Instrumental Variable Approach to Confounded Off-Policy Evaluation
An Instrumental Variable Approach to Confounded Off-Policy EvaluationInternational Conference on Machine Learning (ICML), 2022
Yang Xu
Jin Zhu
C. Shi
Shuang Luo
R. Song
OffRL
339
24
0
29 Dec 2022
Quantile Off-Policy Evaluation via Deep Conditional Generative Learning
Quantile Off-Policy Evaluation via Deep Conditional Generative Learning
Yang Xu
C. Shi
Shuang Luo
Lan Wang
R. Song
OffRL
272
6
0
29 Dec 2022
Policy-Adaptive Estimator Selection for Off-Policy Evaluation
Policy-Adaptive Estimator Selection for Off-Policy EvaluationAAAI Conference on Artificial Intelligence (AAAI), 2022
Takuma Udagawa
Haruka Kiyohara
Yusuke Narita
Yuta Saito
Keisuke Tateno
OffRL
244
27
0
25 Nov 2022
Policy Optimization with Sparse Global Contrastive Explanations
Policy Optimization with Sparse Global Contrastive Explanations
Jiayu Yao
S. Parbhoo
Weiwei Pan
Finale Doshi-Velez
OffRL
193
3
0
13 Jul 2022
Conformal Off-policy Prediction
Conformal Off-policy PredictionInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Yingying Zhang
C. Shi
Shuang Luo
OffRL
304
14
0
14 Jun 2022
Testing Stationarity and Change Point Detection in Reinforcement Learning
Testing Stationarity and Change Point Detection in Reinforcement LearningAnnals of Statistics (Ann. Stat.), 2022
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
538
14
0
03 Mar 2022
Optimal Estimation of Off-Policy Policy Gradient via Double Fitted
  Iteration
Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration
Chengzhuo Ni
Ruiqi Zhang
Xiang Ji
Xuezhou Zhang
Mengdi Wang
OffRL
358
1
0
31 Jan 2022
Statistical Testing under Distributional Shifts
Statistical Testing under Distributional Shifts
Nikolaj Thams
Sorawit Saengkyongam
Niklas Pfister
J. Peters
OOD
426
11
0
22 May 2021
1
Page 1 of 1