Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.05741
Cited By
Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds
9 March 2021
Yihao Feng
Ziyang Tang
Na Zhang
Qiang Liu
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds"
12 / 12 papers shown
Title
Conformal Prediction with Upper and Lower Bound Models
Miao Li
Michael Klamkin
Mathieu Tanneau
Reza Zandehshahvar
Pascal Van Hentenryck
92
0
0
06 Mar 2025
Multiple-policy Evaluation via Density Estimation
Yilei Chen
Aldo Pacchiano
I. Paschalidis
OffRL
62
1
0
29 Mar 2024
Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation
Paul Daoudi
Mathias Formoso
Othman Gaizi
Achraf Azize
Evrard Garcelon
OffRL
55
0
0
24 Dec 2023
AI in Pharma for Personalized Sequential Decision-Making: Methods, Applications and Opportunities
Yuhan Li
Hongtao Zhang
Keaven M Anderson
Songzi Li
Ruoqing Zhu
57
0
0
30 Nov 2023
Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning
Weidong Liu
Jiyuan Tu
Yichen Zhang
Xi Chen
OffRL
123
4
0
04 Oct 2023
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework
Wenzhuo Zhou
Yuhan Li
Ruoqing Zhu
Annie Qu
OffRL
83
5
0
23 Sep 2023
Tight Non-asymptotic Inference via Sub-Gaussian Intrinsic Moment Norm
Huiming Zhang
Haoyu Wei
Guang Cheng
77
1
0
13 Mar 2023
Hallucinated Adversarial Control for Conservative Offline Policy Evaluation
Jonas Rothfuss
Bhavya Sukhija
Tobias Birchler
Parnian Kassraie
Andreas Krause
OffRL
83
10
0
02 Mar 2023
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Yash Chandak
Shiv Shankar
Nathaniel D. Bastian
Bruno Castro da Silva
Emma Brunskil
Philip S. Thomas
OffRL
92
6
0
24 Jan 2023
A Unified Framework for Alternating Offline Model Training and Policy Learning
Shentao Yang
Shujian Zhang
Yihao Feng
Mi Zhou
OffRL
118
17
0
12 Oct 2022
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRL
ELM
106
53
0
26 Apr 2021
Bootstrapping Fitted Q-Evaluation for Off-Policy Inference
Botao Hao
X. Ji
Yaqi Duan
Hao Lu
Csaba Szepesvári
Mengdi Wang
OffRL
87
40
0
06 Feb 2021
1