Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2102.03607
Cited By
v1
v2
v3 (latest)
Bootstrapping Fitted Q-Evaluation for Off-Policy Inference
International Conference on Machine Learning (ICML), 2021
6 February 2021
Botao Hao
X. Ji
Yaqi Duan
Hao Lu
Csaba Szepesvári
Mengdi Wang
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Bootstrapping Fitted Q-Evaluation for Off-Policy Inference"
30 / 30 papers shown
A Tutorial: An Intuitive Explanation of Offline Reinforcement Learning Theory
Fengdi Che
OffRL
165
0
0
11 Aug 2025
Central Limit Theorems for Transition Probabilities of Controlled Markov Chains
Ziwei Su
Imon Banerjee
Diego Klabjan
OffRL
207
0
0
02 Aug 2025
Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation
Hongyi Zhou
Josiah P. Hanna
Jin Zhu
Ying Yang
Chengchun Shi
OffRL
207
4
0
28 May 2025
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Yuhan Li
Eugene Han
Yifan Hu
Wenzhuo Zhou
Zhengling Qi
Yifan Cui
Ruoqing Zhu
OffRL
958
0
0
01 May 2025
Statistical Inference in Reinforcement Learning: A Selective Survey
Chengchun Shi
OffRL
630
4
0
22 Feb 2025
Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2024
Shuguang Yu
Shuxing Fang
Ruixin Peng
Zhengling Qi
Fan Zhou
C. Shi
CML
OffRL
328
7
0
08 Dec 2024
Towards Fast Safe Online Reinforcement Learning via Policy Finetuning
Keru Chen
Honghao Wei
Zhigang Deng
Sen Lin
OffRL
OnRL
452
1
0
05 Dec 2024
Off-Policy Selection for Initiating Human-Centric Experimental Design
Neural Information Processing Systems (NeurIPS), 2024
Ge Gao
Xi Yang
Qitong Gao
Song Ju
Miroslav Pajic
Min Chi
OffRL
332
0
0
26 Oct 2024
Causal Deepsets for Off-policy Evaluation under Spatial or Spatio-temporal Interferences
Runpeng Dai
Jianing Wang
Fan Zhou
Shuang Luo
Zhiwei Qin
Chengchun Shi
Hongtu Zhu
CML
OffRL
289
3
0
25 Jul 2024
Why long model-based rollouts are no reason for bad Q-value estimates
Philipp Wissmann
Daniel Hein
Steffen Udluft
Volker Tresp
OffRL
LRM
190
2
0
16 Jul 2024
Combining Experimental and Historical Data for Policy Evaluation
Ting Li
Chengchun Shi
Qianglin Wen
Yang Sui
Yongli Qin
Chunbo Lai
Hongtu Zhu
OffRL
455
4
0
01 Jun 2024
Robust Offline Reinforcement learning with Heavy-Tailed Rewards
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Jin Zhu
Runzhe Wan
Zhengling Qi
Shuang Luo
C. Shi
OffRL
364
5
0
28 Oct 2023
Off-Policy Evaluation for Human Feedback
Neural Information Processing Systems (NeurIPS), 2023
Qitong Gao
Ge Gao
Juncheng Dong
Vahid Tarokh
Min Chi
Miroslav Pajic
OffRL
354
8
0
11 Oct 2023
Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning
Weidong Liu
Jiyuan Tu
Yichen Zhang
Xi Chen
OffRL
403
5
0
04 Oct 2023
Estimation and Inference in Distributional Reinforcement Learning
Liangyu Zhang
Yang Peng
Jiadong Liang
Wenhao Yang
Zhihua Zhang
OffRL
189
4
0
29 Sep 2023
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework
Wenzhuo Zhou
Yuhan Li
Ruoqing Zhu
Annie Qu
OffRL
299
7
0
23 Sep 2023
Off-policy Evaluation in Doubly Inhomogeneous Environments
Journal of the American Statistical Association (JASA), 2023
Zeyu Bian
C. Shi
Zhengling Qi
Lan Wang
OffRL
302
12
0
14 Jun 2023
K
K
K
-Nearest-Neighbor Resampling for Off-Policy Evaluation in Stochastic Control
Michael Giegrich
Roel Oomen
C. Reisinger
OffRL
233
2
0
07 Jun 2023
Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling
Machine-mediated learning (ML), 2023
Susobhan Ghosh
Raphael Kim
Prasidh Chhabria
Raaz Dwivedi
Predrag Klasjna
Peng Liao
Kelly Zhang
Susan Murphy
OffRL
468
13
0
11 Apr 2023
On the Sample Complexity of Vanilla Model-Based Offline Reinforcement Learning with Dependent Samples
AAAI Conference on Artificial Intelligence (AAAI), 2023
Mustafa O. Karabag
Ufuk Topcu
OffRL
279
6
0
07 Mar 2023
A Reinforcement Learning Framework for Dynamic Mediation Analysis
International Conference on Machine Learning (ICML), 2023
Linjuan Ge
Jitao Wang
C. Shi
Zhanghua Wu
Rui Song
358
6
0
31 Jan 2023
Variational Latent Branching Model for Off-Policy Evaluation
International Conference on Learning Representations (ICLR), 2023
Qitong Gao
Ge Gao
Min Chi
Miroslav Pajic
OffRL
378
7
0
28 Jan 2023
An Instrumental Variable Approach to Confounded Off-Policy Evaluation
International Conference on Machine Learning (ICML), 2022
Yang Xu
Jin Zhu
C. Shi
Shuang Luo
R. Song
OffRL
339
24
0
29 Dec 2022
Quantile Off-Policy Evaluation via Deep Conditional Generative Learning
Yang Xu
C. Shi
Shuang Luo
Lan Wang
R. Song
OffRL
272
6
0
29 Dec 2022
Policy-Adaptive Estimator Selection for Off-Policy Evaluation
AAAI Conference on Artificial Intelligence (AAAI), 2022
Takuma Udagawa
Haruka Kiyohara
Yusuke Narita
Yuta Saito
Keisuke Tateno
OffRL
244
27
0
25 Nov 2022
Policy Optimization with Sparse Global Contrastive Explanations
Jiayu Yao
S. Parbhoo
Weiwei Pan
Finale Doshi-Velez
OffRL
193
3
0
13 Jul 2022
Conformal Off-policy Prediction
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Yingying Zhang
C. Shi
Shuang Luo
OffRL
304
14
0
14 Jun 2022
Testing Stationarity and Change Point Detection in Reinforcement Learning
Annals of Statistics (Ann. Stat.), 2022
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
538
14
0
03 Mar 2022
Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration
Chengzhuo Ni
Ruiqi Zhang
Xiang Ji
Xuezhou Zhang
Mengdi Wang
OffRL
358
1
0
31 Jan 2022
Statistical Testing under Distributional Shifts
Nikolaj Thams
Sorawit Saengkyongam
Niklas Pfister
J. Peters
OOD
426
11
0
22 May 2021
1
Page 1 of 1