Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.15332
Cited By
Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in Partially Observed Markov Decision Processes
28 October 2021
Andrew Bennett
Nathan Kallus
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in Partially Observed Markov Decision Processes"
11 / 11 papers shown
Title
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Yuhan Li
Eugene Han
Yifan Hu
Wenzhuo Zhou
Zhengling Qi
Yifan Cui
Ruoqing Zhu
OffRL
66
0
0
01 May 2025
Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing
Jitao Wang
C. Shi
John D. Piette
Joshua R. Loftus
Donglin Zeng
Zhenke Wu
OffRL
51
0
0
10 Jan 2025
Spectral Representation Learning for Conditional Moment Models
Ziyu Wang
Yucen Luo
Yueru Li
Jun Zhu
Bernhard Schölkopf
CML
23
7
0
29 Oct 2022
Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric Models
Rui Miao
Zhengling Qi
Xiaoke Zhang
OffRL
19
10
0
21 Sep 2022
Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems
Masatoshi Uehara
Ayush Sekhari
Jason D. Lee
Nathan Kallus
Wen Sun
OffRL
47
31
0
24 Jun 2022
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Miao Lu
Yifei Min
Zhaoran Wang
Zhuoran Yang
OffRL
38
22
0
26 May 2022
Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process
C. Shi
Jin Zhu
Ye Shen
S. Luo
Hong Zhu
R. Song
OffRL
13
30
0
22 Feb 2022
Pessimistic Model Selection for Offline Deep Reinforcement Learning
Chao-Han Huck Yang
Zhengling Qi
Yifan Cui
Pin-Yu Chen
OffRL
10
4
0
29 Nov 2021
Off-Policy Evaluation in Partially Observed Markov Decision Processes under Sequential Ignorability
Yupeng Tang
Seung-seob Lee
OffRL
44
22
0
24 Oct 2021
Proximal Causal Inference for Complex Longitudinal Studies
Andrew Ying
Wang Miao
Xu Shi
E. T. Tchetgen
24
38
0
15 Sep 2021
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes
Nathan Kallus
Masatoshi Uehara
OffRL
31
180
0
22 Aug 2019
1