ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.10589
  4. Cited By
Off-Policy Confidence Interval Estimation with Confounded Markov
  Decision Process

Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process

22 February 2022
C. Shi
Jin Zhu
Ye Shen
S. Luo
Hong Zhu
R. Song
    OffRL
ArXivPDFHTML

Papers citing "Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process"

23 / 23 papers shown
Title
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Yuhan Li
Eugene Han
Yifan Hu
Wenzhuo Zhou
Zhengling Qi
Yifan Cui
Ruoqing Zhu
OffRL
87
0
0
01 May 2025
Statistical Inference in Reinforcement Learning: A Selective Survey
Statistical Inference in Reinforcement Learning: A Selective Survey
Chengchun Shi
OffRL
60
0
0
22 Feb 2025
Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement
  Learning
Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning
Shuguang Yu
Shuxing Fang
Ruixin Peng
Zhengling Qi
Fan Zhou
C. Shi
CML
OffRL
72
1
0
08 Dec 2024
A Fine-grained Analysis of Fitted Q-evaluation: Beyond Parametric Models
A Fine-grained Analysis of Fitted Q-evaluation: Beyond Parametric Models
Jiayi Wang
Zhengling Qi
Raymond K. W. Wong
22
0
0
14 Jun 2024
A Semiparametric Instrumented Difference-in-Differences Approach to
  Policy Learning
A Semiparametric Instrumented Difference-in-Differences Approach to Policy Learning
Pan Zhao
Yifan Cui
CML
21
1
0
14 Oct 2023
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified
  Error Quantification Framework
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework
Wenzhuo Zhou
Yuhan Li
Ruoqing Zhu
Annie Qu
OffRL
13
4
0
23 Sep 2023
Contextual Dynamic Pricing with Strategic Buyers
Contextual Dynamic Pricing with Strategic Buyers
Pang-Tung Liu
Zhuoran Yang
Zhaoran Wang
W. Sun
19
4
0
08 Jul 2023
Statistical Inference on Multi-armed Bandits with Delayed Feedback
Statistical Inference on Multi-armed Bandits with Delayed Feedback
Lei Shi
Jingshen Wang
Tianhao Wu
14
4
0
03 Jul 2023
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden
  Confounding
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding
Alizée Pace
Hugo Yèche
Bernhard Schölkopf
Gunnar Rätsch
Guy Tennenholtz
OffRL
6
6
0
01 Jun 2023
Statistical Inference with Stochastic Gradient Methods under
  $φ$-mixing Data
Statistical Inference with Stochastic Gradient Methods under φφφ-mixing Data
Ruiqi Liu
X. Chen
Zuofeng Shang
FedML
17
6
0
24 Feb 2023
A Survey on Causal Reinforcement Learning
A Survey on Causal Reinforcement Learning
Yan Zeng
Ruichu Cai
Fuchun Sun
Libo Huang
Z. Hao
CML
26
27
0
10 Feb 2023
Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous
  Unobserved Confounders
Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous Unobserved Confounders
David Bruns-Smith
Angela Zhou
OffRL
13
9
0
01 Feb 2023
A Reinforcement Learning Framework for Dynamic Mediation Analysis
A Reinforcement Learning Framework for Dynamic Mediation Analysis
Linjuan Ge
Jitao Wang
C. Shi
Zhanghua Wu
Rui Song
19
5
0
31 Jan 2023
Online Statistical Inference for Contextual Bandits via Stochastic
  Gradient Descent
Online Statistical Inference for Contextual Bandits via Stochastic Gradient Descent
X. Chen
Zehua Lai
He Li
Yichen Zhang
16
4
0
30 Dec 2022
An Instrumental Variable Approach to Confounded Off-Policy Evaluation
An Instrumental Variable Approach to Confounded Off-Policy Evaluation
Yang Xu
Jin Zhu
C. Shi
S. Luo
R. Song
OffRL
16
12
0
29 Dec 2022
Offline Reinforcement Learning for Human-Guided Human-Machine
  Interaction with Private Information
Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Zuyue Fu
Zhengling Qi
Zhuoran Yang
Zhaoran Wang
Lan Wang
OffRL
18
0
0
23 Dec 2022
A Review of Off-Policy Evaluation in Reinforcement Learning
A Review of Off-Policy Evaluation in Reinforcement Learning
Masatoshi Uehara
C. Shi
Nathan Kallus
OffRL
17
47
0
13 Dec 2022
Off-Policy Evaluation for Episodic Partially Observable Markov Decision
  Processes under Non-Parametric Models
Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric Models
Rui Miao
Zhengling Qi
Xiaoke Zhang
OffRL
19
10
0
21 Sep 2022
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Masatoshi Uehara
Haruka Kiyohara
Andrew Bennett
Victor Chernozhukov
Nan Jiang
Nathan Kallus
C. Shi
Wen Sun
OffRL
18
16
0
26 Jul 2022
A Minimax Learning Approach to Off-Policy Evaluation in Confounded
  Partially Observable Markov Decision Processes
A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes
C. Shi
Masatoshi Uehara
Jiawei Huang
Nan Jiang
OffRL
9
22
0
12 Nov 2021
Estimating and Improving Dynamic Treatment Regimes With a Time-Varying
  Instrumental Variable
Estimating and Improving Dynamic Treatment Regimes With a Time-Varying Instrumental Variable
Shuxiao Chen
B. Zhang
15
19
0
15 Apr 2021
Who Make Drivers Stop? Towards Driver-centric Risk Assessment: Risk
  Object Identification via Causal Inference
Who Make Drivers Stop? Towards Driver-centric Risk Assessment: Risk Object Identification via Causal Inference
Chengxi Li
Stanley H. Chan
Yi-Ting Chen
CML
76
51
0
05 Mar 2020
Deep Reinforcement Learning for Dialogue Generation
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
198
1,325
0
05 Jun 2016
1