ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.13609
  4. Cited By
Statistical Bootstrapping for Uncertainty Estimation in Off-Policy
  Evaluation

Statistical Bootstrapping for Uncertainty Estimation in Off-Policy Evaluation

27 July 2020
Ilya Kostrikov
Ofir Nachum
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Statistical Bootstrapping for Uncertainty Estimation in Off-Policy Evaluation"

25 / 25 papers shown
Title
STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation
STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation
Hossein Goli
Michael Gimelfarb
Nathan Samuel de Lara
Haruki Nishimura
Masha Itkina
Florian Shkurti
OffRL
50
0
0
27 May 2025
Primal-Dual Spectral Representation for Off-policy Evaluation
Primal-Dual Spectral Representation for Off-policy Evaluation
Yang Hu
Tianyi Chen
Na Li
Kai Wang
Bo Dai
OffRL
85
0
0
23 Oct 2024
Conservative Exploration for Policy Optimization via Off-Policy Policy
  Evaluation
Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation
Paul Daoudi
Mathias Formoso
Othman Gaizi
Achraf Azize
Evrard Garcelon
OffRL
55
0
0
24 Dec 2023
Probabilistic Offline Policy Ranking with Approximate Bayesian
  Computation
Probabilistic Offline Policy Ranking with Approximate Bayesian Computation
Longchao Da
Porter Jenkins
Trevor Schwantes
Jeffrey Dotson
Hua Wei
OffRL
54
2
0
17 Dec 2023
Off-Policy Evaluation for Human Feedback
Off-Policy Evaluation for Human Feedback
Qitong Gao
Ge Gao
Juncheng Dong
Vahid Tarokh
Min Chi
Miroslav Pajic
OffRL
86
5
0
11 Oct 2023
Hallucinated Adversarial Control for Conservative Offline Policy
  Evaluation
Hallucinated Adversarial Control for Conservative Offline Policy Evaluation
Jonas Rothfuss
Bhavya Sukhija
Tobias Birchler
Parnian Kassraie
Andreas Krause
OffRL
83
10
0
02 Mar 2023
Revisiting Bellman Errors for Offline Model Selection
Revisiting Bellman Errors for Offline Model Selection
Joshua P. Zitovsky
Daniel de Marchi
Rishabh Agarwal
Michael R. Kosorok University of North Carolina at Chapel Hill
OffRL
82
5
0
31 Jan 2023
Variational Latent Branching Model for Off-Policy Evaluation
Variational Latent Branching Model for Off-Policy Evaluation
Qitong Gao
Ge Gao
Min Chi
Miroslav Pajic
OffRL
84
6
0
28 Jan 2023
Learning Bellman Complete Representations for Offline Policy Evaluation
Learning Bellman Complete Representations for Offline Policy Evaluation
Jonathan D. Chang
Kaiwen Wang
Nathan Kallus
Wen Sun
OffRL
69
16
0
12 Jul 2022
Hybrid Value Estimation for Off-policy Evaluation and Offline
  Reinforcement Learning
Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning
Xuefeng Jin
Xu-Hui Liu
Shengyi Jiang
Yang Yu
OffRL
86
4
0
04 Jun 2022
Why So Pessimistic? Estimating Uncertainties for Offline RL through
  Ensembles, and Why Their Independence Matters
Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters
Seyed Kamyar Seyed Ghasemipour
S. Gu
Ofir Nachum
OffRL
90
72
0
27 May 2022
Offline Policy Comparison with Confidence: Benchmarks and Baselines
Offline Policy Comparison with Confidence: Benchmarks and Baselines
Anurag Koul
Mariano Phielipp
Alan Fern
OffRL
72
0
0
22 May 2022
Off-Policy Fitted Q-Evaluation with Differentiable Function
  Approximators: Z-Estimation and Inference Theory
Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory
Ruiqi Zhang
Xuezhou Zhang
Chengzhuo Ni
Mengdi Wang
OffRL
90
16
0
10 Feb 2022
A Workflow for Offline Model-Free Robotic Reinforcement Learning
A Workflow for Offline Model-Free Robotic Reinforcement Learning
Aviral Kumar
Anika Singh
Stephen Tian
Chelsea Finn
Sergey Levine
OffRL
215
87
0
22 Sep 2021
Debiasing Samples from Online Learning Using Bootstrap
Debiasing Samples from Online Learning Using Bootstrap
Ningyuan Chen
Xuefeng Gao
Yi Xiong
OffRLOnRL
52
4
0
31 Jul 2021
Supervised Off-Policy Ranking
Supervised Off-Policy Ranking
Yue Jin
Yue Zhang
Tao Qin
Xudong Zhang
Jian Yuan
Houqiang Li
Tie-Yan Liu
OffRL
70
6
0
03 Jul 2021
Autoregressive Dynamics Models for Offline Policy Evaluation and
  Optimization
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Michael Ruogu Zhang
T. Paine
Ofir Nachum
Cosmin Paduraru
George Tucker
Ziyun Wang
Mohammad Norouzi
OffRL
91
49
0
28 Apr 2021
Universal Off-Policy Evaluation
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRLELM
106
53
0
26 Apr 2021
Benchmarks for Deep Off-Policy Evaluation
Benchmarks for Deep Off-Policy Evaluation
Justin Fu
Mohammad Norouzi
Ofir Nachum
George Tucker
Ziyun Wang
...
Yutian Chen
Aviral Kumar
Cosmin Paduraru
Sergey Levine
T. Paine
ELMOffRL
97
104
0
30 Mar 2021
Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and
  Dual Bounds
Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds
Yihao Feng
Ziyang Tang
Na Zhang
Qiang Liu
OffRL
73
14
0
09 Mar 2021
Bootstrapping Fitted Q-Evaluation for Off-Policy Inference
Bootstrapping Fitted Q-Evaluation for Off-Policy Inference
Botao Hao
X. Ji
Yaqi Duan
Hao Lu
Csaba Szepesvári
Mengdi Wang
OffRL
87
40
0
06 Feb 2021
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning
Rongjun Qin
Songyi Gao
Xingyuan Zhang
Zhen Xu
Shengkai Huang
Zewen Li
Weinan Zhang
Yang Yu
OffRL
196
83
0
01 Feb 2021
High-Confidence Off-Policy (or Counterfactual) Variance Estimation
High-Confidence Off-Policy (or Counterfactual) Variance Estimation
Yash Chandak
Shiv Shankar
Philip S. Thomas
OffRL
31
8
0
25 Jan 2021
Offline Policy Selection under Uncertainty
Offline Policy Selection under Uncertainty
Mengjiao Yang
Bo Dai
Ofir Nachum
George Tucker
Dale Schuurmans
OffRL
57
35
0
12 Dec 2020
Reliable Off-policy Evaluation for Reinforcement Learning
Reliable Off-policy Evaluation for Reinforcement Learning
Jie Wang
Rui Gao
H. Zha
OffRL
88
11
0
08 Nov 2020
1