ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.05741
  4. Cited By
Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and
  Dual Bounds

Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds

9 March 2021
Yihao Feng
Ziyang Tang
Na Zhang
Qiang Liu
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds"

12 / 12 papers shown
Title
Conformal Prediction with Upper and Lower Bound Models
Conformal Prediction with Upper and Lower Bound Models
Miao Li
Michael Klamkin
Mathieu Tanneau
Reza Zandehshahvar
Pascal Van Hentenryck
92
0
0
06 Mar 2025
Multiple-policy Evaluation via Density Estimation
Multiple-policy Evaluation via Density Estimation
Yilei Chen
Aldo Pacchiano
I. Paschalidis
OffRL
62
1
0
29 Mar 2024
Conservative Exploration for Policy Optimization via Off-Policy Policy
  Evaluation
Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation
Paul Daoudi
Mathias Formoso
Othman Gaizi
Achraf Azize
Evrard Garcelon
OffRL
55
0
0
24 Dec 2023
AI in Pharma for Personalized Sequential Decision-Making: Methods,
  Applications and Opportunities
AI in Pharma for Personalized Sequential Decision-Making: Methods, Applications and Opportunities
Yuhan Li
Hongtao Zhang
Keaven M Anderson
Songzi Li
Ruoqing Zhu
57
0
0
30 Nov 2023
Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning
Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning
Weidong Liu
Jiyuan Tu
Yichen Zhang
Xi Chen
OffRL
123
4
0
04 Oct 2023
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified
  Error Quantification Framework
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework
Wenzhuo Zhou
Yuhan Li
Ruoqing Zhu
Annie Qu
OffRL
83
5
0
23 Sep 2023
Tight Non-asymptotic Inference via Sub-Gaussian Intrinsic Moment Norm
Tight Non-asymptotic Inference via Sub-Gaussian Intrinsic Moment Norm
Huiming Zhang
Haoyu Wei
Guang Cheng
77
1
0
13 Mar 2023
Hallucinated Adversarial Control for Conservative Offline Policy
  Evaluation
Hallucinated Adversarial Control for Conservative Offline Policy Evaluation
Jonas Rothfuss
Bhavya Sukhija
Tobias Birchler
Parnian Kassraie
Andreas Krause
OffRL
83
10
0
02 Mar 2023
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Yash Chandak
Shiv Shankar
Nathaniel D. Bastian
Bruno Castro da Silva
Emma Brunskil
Philip S. Thomas
OffRL
92
6
0
24 Jan 2023
A Unified Framework for Alternating Offline Model Training and Policy
  Learning
A Unified Framework for Alternating Offline Model Training and Policy Learning
Shentao Yang
Shujian Zhang
Yihao Feng
Mi Zhou
OffRL
118
17
0
12 Oct 2022
Universal Off-Policy Evaluation
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRLELM
106
53
0
26 Apr 2021
Bootstrapping Fitted Q-Evaluation for Off-Policy Inference
Bootstrapping Fitted Q-Evaluation for Off-Policy Inference
Botao Hao
X. Ji
Yaqi Duan
Hao Lu
Csaba Szepesvári
Mengdi Wang
OffRL
87
40
0
06 Feb 2021
1