Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2202.10589
Cited By
Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process
22 February 2022
C. Shi
Jin Zhu
Ye Shen
S. Luo
Hong Zhu
R. Song
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process"
23 / 23 papers shown
Title
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Yuhan Li
Eugene Han
Yifan Hu
Wenzhuo Zhou
Zhengling Qi
Yifan Cui
Ruoqing Zhu
OffRL
87
0
0
01 May 2025
Statistical Inference in Reinforcement Learning: A Selective Survey
Chengchun Shi
OffRL
60
0
0
22 Feb 2025
Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning
Shuguang Yu
Shuxing Fang
Ruixin Peng
Zhengling Qi
Fan Zhou
C. Shi
CML
OffRL
72
1
0
08 Dec 2024
A Fine-grained Analysis of Fitted Q-evaluation: Beyond Parametric Models
Jiayi Wang
Zhengling Qi
Raymond K. W. Wong
22
0
0
14 Jun 2024
A Semiparametric Instrumented Difference-in-Differences Approach to Policy Learning
Pan Zhao
Yifan Cui
CML
21
1
0
14 Oct 2023
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework
Wenzhuo Zhou
Yuhan Li
Ruoqing Zhu
Annie Qu
OffRL
13
4
0
23 Sep 2023
Contextual Dynamic Pricing with Strategic Buyers
Pang-Tung Liu
Zhuoran Yang
Zhaoran Wang
W. Sun
19
4
0
08 Jul 2023
Statistical Inference on Multi-armed Bandits with Delayed Feedback
Lei Shi
Jingshen Wang
Tianhao Wu
14
4
0
03 Jul 2023
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding
Alizée Pace
Hugo Yèche
Bernhard Schölkopf
Gunnar Rätsch
Guy Tennenholtz
OffRL
6
6
0
01 Jun 2023
Statistical Inference with Stochastic Gradient Methods under
φ
φ
φ
-mixing Data
Ruiqi Liu
X. Chen
Zuofeng Shang
FedML
17
6
0
24 Feb 2023
A Survey on Causal Reinforcement Learning
Yan Zeng
Ruichu Cai
Fuchun Sun
Libo Huang
Z. Hao
CML
26
27
0
10 Feb 2023
Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous Unobserved Confounders
David Bruns-Smith
Angela Zhou
OffRL
13
9
0
01 Feb 2023
A Reinforcement Learning Framework for Dynamic Mediation Analysis
Linjuan Ge
Jitao Wang
C. Shi
Zhanghua Wu
Rui Song
19
5
0
31 Jan 2023
Online Statistical Inference for Contextual Bandits via Stochastic Gradient Descent
X. Chen
Zehua Lai
He Li
Yichen Zhang
16
4
0
30 Dec 2022
An Instrumental Variable Approach to Confounded Off-Policy Evaluation
Yang Xu
Jin Zhu
C. Shi
S. Luo
R. Song
OffRL
16
12
0
29 Dec 2022
Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Zuyue Fu
Zhengling Qi
Zhuoran Yang
Zhaoran Wang
Lan Wang
OffRL
18
0
0
23 Dec 2022
A Review of Off-Policy Evaluation in Reinforcement Learning
Masatoshi Uehara
C. Shi
Nathan Kallus
OffRL
17
47
0
13 Dec 2022
Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric Models
Rui Miao
Zhengling Qi
Xiaoke Zhang
OffRL
19
10
0
21 Sep 2022
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Masatoshi Uehara
Haruka Kiyohara
Andrew Bennett
Victor Chernozhukov
Nan Jiang
Nathan Kallus
C. Shi
Wen Sun
OffRL
18
16
0
26 Jul 2022
A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes
C. Shi
Masatoshi Uehara
Jiawei Huang
Nan Jiang
OffRL
9
22
0
12 Nov 2021
Estimating and Improving Dynamic Treatment Regimes With a Time-Varying Instrumental Variable
Shuxiao Chen
B. Zhang
15
19
0
15 Apr 2021
Who Make Drivers Stop? Towards Driver-centric Risk Assessment: Risk Object Identification via Causal Inference
Chengxi Li
Stanley H. Chan
Yi-Ting Chen
CML
76
51
0
05 Mar 2020
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
198
1,325
0
05 Jun 2016
1