ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.15501
  4. Cited By
Doubly Robust Interval Estimation for Optimal Policy Evaluation in
  Online Learning
v1v2v3 (latest)

Doubly Robust Interval Estimation for Optimal Policy Evaluation in Online Learning

29 October 2021
Ye Shen
Hengrui Cai
Rui Song
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Doubly Robust Interval Estimation for Optimal Policy Evaluation in Online Learning"

1 / 1 papers shown
Title
Anytime-valid off-policy inference for contextual bandits
Anytime-valid off-policy inference for contextual bandits
Ian Waudby-Smith
Lili Wu
Aaditya Ramdas
Nikos Karampatziakis
Paul Mineiro
OffRL
119
30
0
19 Oct 2022
1