Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.22492
Cited By
Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation
28 May 2025
Hongyi Zhou
Josiah P. Hanna
Jin Zhu
Ying Yang
Chengchun Shi
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation"
2 / 2 papers shown
Title
Doubly Optimal Policy Evaluation for Reinforcement Learning
Shuze Liu
Claire Chen
Shangtong Zhang
OffRL
196
3
0
03 Oct 2024
Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning
Weidong Liu
Jiyuan Tu
Yichen Zhang
Xi Chen
OffRL
121
4
0
04 Oct 2023
1