Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.15501
Cited By
v1
v2
v3 (latest)
Doubly Robust Interval Estimation for Optimal Policy Evaluation in Online Learning
29 October 2021
Ye Shen
Hengrui Cai
Rui Song
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Doubly Robust Interval Estimation for Optimal Policy Evaluation in Online Learning"
1 / 1 papers shown
Title
Anytime-valid off-policy inference for contextual bandits
Ian Waudby-Smith
Lili Wu
Aaditya Ramdas
Nikos Karampatziakis
Paul Mineiro
OffRL
119
30
0
19 Oct 2022
1