Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1912.13088
Cited By
Off-Policy Estimation of Long-Term Average Outcomes with Applications to Mobile Health
30 December 2019
Peng Liao
P. Klasnja
S. Murphy
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Off-Policy Estimation of Long-Term Average Outcomes with Applications to Mobile Health"
15 / 15 papers shown
Title
DOLCE: Decomposing Off-Policy Evaluation/Learning into Lagged and Current Effects
Shu Tamano
Masanori Nojima
OffRL
37
0
0
02 May 2025
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Yuhan Li
Eugene Han
Yifan Hu
Wenzhuo Zhou
Zhengling Qi
Yifan Cui
Ruoqing Zhu
OffRL
114
0
0
01 May 2025
Statistical Inference in Reinforcement Learning: A Selective Survey
Chengchun Shi
OffRL
62
0
0
22 Feb 2025
Spatially Randomized Designs Can Enhance Policy Evaluation
Ying Yang
Chengchun Shi
Fang Yao
Shouyang Wang
Hongtu Zhu
OffRL
33
0
0
18 Mar 2024
A Reinforcement Learning Framework for Dynamic Mediation Analysis
Linjuan Ge
Jitao Wang
C. Shi
Zhanghua Wu
Rui Song
27
5
0
31 Jan 2023
Quantile Off-Policy Evaluation via Deep Conditional Generative Learning
Yang Xu
C. Shi
S. Luo
Lan Wang
R. Song
OffRL
27
4
0
29 Dec 2022
Testing Stationarity and Change Point Detection in Reinforcement Learning
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
37
9
0
03 Mar 2022
Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process
C. Shi
Jin Zhu
Ye Shen
S. Luo
Hong Zhu
R. Song
OffRL
21
30
0
22 Feb 2022
Statistical Learning for Individualized Asset Allocation
Yi Ding
Yingying Li
Rui Song
15
0
0
20 Jan 2022
SOPE: Spectrum of Off-Policy Estimators
C. J. Yuan
Yash Chandak
S. Giguere
Philip S. Thomas
S. Niekum
OffRL
42
5
0
06 Nov 2021
Off-Policy Evaluation in Partially Observed Markov Decision Processes under Sequential Ignorability
Yupeng Tang
Seung-seob Lee
OffRL
52
22
0
24 Oct 2021
Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via pT-Learning
Wenzhuo Zhou
Ruoqing Zhu
A. Qu
27
22
0
20 Oct 2021
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRL
ELM
30
52
0
26 Apr 2021
Batch Policy Learning in Average Reward Markov Decision Processes
Peng Liao
Zhengling Qi
Runzhe Wan
P. Klasnja
S. Murphy
OffRL
12
81
0
23 Jul 2020
A Batch, Off-Policy, Actor-Critic Algorithm for Optimizing the Average Reward
S. Murphy
Yanzhen Deng
Eric B. Laber
H. Maei
R. Sutton
K. Witkiewitz
OffRL
25
22
0
18 Jul 2016
1