Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1909.03739
Cited By
v1
v2
v3 (latest)
Off-Policy Evaluation in Partially Observable Environments
AAAI Conference on Artificial Intelligence (AAAI), 2019
9 September 2019
Guy Tennenholtz
Shie Mannor
Uri Shalit
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Off-Policy Evaluation in Partially Observable Environments"
50 / 68 papers shown
Offline Reinforcement Learning in Large State Spaces: Algorithms and Guarantees
Nan Jiang
Tengyang Xie
OffRL
242
16
0
05 Oct 2025
The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability
Jiachen Hu
Rui Ai
Han Zhong
Xiaoyu Chen
L. Wang
Zhaoran Wang
Zhuoran Yang
249
0
0
11 Jun 2025
Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation
Hongyi Zhou
Josiah P. Hanna
Jin Zhu
Ying Yang
Chengchun Shi
OffRL
276
4
0
28 May 2025
Automatic Reward Shaping from Confounded Offline Data
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
OffRL
OnRL
580
4
0
16 May 2025
Off-Policy Evaluation for Sequential Persuasion Process with Unobserved Confounding
Nishanth Venkatesh S.
Heeseung Bang
Andreas A. Malikopoulos
OffRL
253
2
0
01 Apr 2025
Time After Time: Deep-Q Effect Estimation for Interventions on When and What to do
International Conference on Learning Representations (ICLR), 2025
Yoav Wald
M. Goldstein
Yonathan Efroni
Wouter A. C. van Amsterdam
Rajesh Ranganath
CML
404
0
0
20 Mar 2025
Statistical Tractability of Off-policy Evaluation of History-dependent Policies in POMDPs
International Conference on Learning Representations (ICLR), 2025
Yuheng Zhang
Nan Jiang
OffRL
305
5
0
03 Mar 2025
Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2024
Shuguang Yu
Shuxing Fang
Ruixin Peng
Zhengling Qi
Fan Zhou
C. Shi
CML
OffRL
386
8
0
08 Dec 2024
Data-Centric Approach to Constrained Machine Learning: A Case Study on Conway's Game of Life
A. Bibin
Anton Dereventsov
168
2
0
23 Aug 2024
Causal Deepsets for Off-policy Evaluation under Spatial or Spatio-temporal Interferences
Runpeng Dai
Jianing Wang
Fan Zhou
Shuang Luo
Zhiwei Qin
Chengchun Shi
Hongtu Zhu
CML
OffRL
318
3
0
25 Jul 2024
Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators
Ori Linial
Guy Tennenholtz
Uri Shalit
OffRL
296
1
0
30 Jun 2024
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation
Jeongyeol Kwon
Shie Mannor
Constantine Caramanis
Yonathan Efroni
OffRL
450
6
0
03 Jun 2024
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators
Allen Nie
Yash Chandak
Christina J. Yuan
Anirudhan Badrinath
Yannis Flet-Berliac
Emma Brunskil
OffRL
298
4
0
27 May 2024
A CMDP-within-online framework for Meta-Safe Reinforcement Learning
Vanshaj Khattar
Yuhao Ding
Bilgehan Sel
Javad Lavaei
Ming Jin
OffRL
309
24
0
26 May 2024
On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation
Yuheng Zhang
Nan Jiang
OffRL
328
7
0
22 Feb 2024
Source Condition Double Robust Inference on Functionals of Inverse Problems
Andrew Bennett
Nathan Kallus
Xiaojie Mao
Whitney Newey
Vasilis Syrgkanis
Masatoshi Uehara
254
10
0
25 Jul 2023
Comparing Causal Frameworks: Potential Outcomes, Structural Models, Graphs, and Abstractions
Neural Information Processing Systems (NeurIPS), 2023
D. Ibeling
Thomas Icard
CML
291
22
0
25 Jun 2023
Reinforcement Learning with Temporal-Logic-Based Causal Diagrams
International Cross-Domain Conference on Machine Learning and Knowledge Extraction (CD-MAKE), 2023
Yashi Paliwal
Rajarshi Roy
Jean-Raphael Gaglione
Nasim Baharisangari
Daniel Neider
Xiaoming Duan
Ufuk Topcu
Zhe Xu
208
5
0
23 Jun 2023
Finding Counterfactually Optimal Action Sequences in Continuous State Spaces
Neural Information Processing Systems (NeurIPS), 2023
Stratis Tsirtsis
Manuel Gomez Rodriguez
CML
OffRL
438
14
0
06 Jun 2023
HOPE: Human-Centric Off-Policy Evaluation for E-Learning and Healthcare
Adaptive Agents and Multi-Agent Systems (AAMAS), 2023
Ge Gao
Song Ju
Markel Sanz Ausin
Min Chi
OffRL
255
8
0
18 Feb 2023
A Survey on Causal Reinforcement Learning
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Yan Zeng
Ruichu Cai
Gang Hua
Libo Huang
Zijian Li
CML
542
70
0
10 Feb 2023
Reinforcement Learning with History-Dependent Dynamic Contexts
International Conference on Machine Learning (ICML), 2023
Guy Tennenholtz
Nadav Merlis
Lior Shani
Martin Mladenov
Craig Boutilier
AI4CE
297
13
0
04 Feb 2023
Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous Unobserved Confounders
David Bruns-Smith
Angela Zhou
OffRL
701
14
0
01 Feb 2023
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Neural Information Processing Systems (NeurIPS), 2023
Yash Chandak
Shiv Shankar
Nathaniel D. Bastian
Bruno Castro da Silva
Emma Brunskil
Philip S. Thomas
OffRL
272
6
0
24 Jan 2023
Safe Policy Improvement for POMDPs via Finite-State Controllers
AAAI Conference on Artificial Intelligence (AAAI), 2023
T. D. Simão
Marnix Suilen
N. Jansen
OffRL
296
12
0
12 Jan 2023
An Instrumental Variable Approach to Confounded Off-Policy Evaluation
International Conference on Machine Learning (ICML), 2022
Yang Xu
Jin Zhu
C. Shi
Shuang Luo
R. Song
OffRL
365
24
0
29 Dec 2022
A Review of Off-Policy Evaluation in Reinforcement Learning
Masatoshi Uehara
C. Shi
Nathan Kallus
OffRL
303
114
0
13 Dec 2022
Offline Policy Evaluation and Optimization under Confounding
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Chinmaya Kausik
Yangyi Lu
Kevin Tan
Maggie Makar
Yixin Wang
Ambuj Tewari
OffRL
430
15
0
29 Nov 2022
Causal Deep Reinforcement Learning Using Observational Data
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Wenxuan Zhu
Chao Yu
Qiaosheng Zhang
CML
OffRL
253
9
0
28 Nov 2022
A Reinforcement Learning Approach to Estimating Long-term Treatment Effects
Ziyang Tang
Yiheng Duan
Stephanie S. Zhang
Lihong Li
OffRL
229
6
0
14 Oct 2022
Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric Models
Neural Information Processing Systems (NeurIPS), 2022
Rui Miao
Zhengling Qi
Xiaoke Zhang
OffRL
348
15
0
21 Sep 2022
A Survey of Deep Causal Models and Their Industrial Applications
Artificial Intelligence Review (Artif Intell Rev), 2022
Zongyu Li
Xiaoning Guo
Siwei Qiang
CML
AI4CE
807
19
0
19 Sep 2022
Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach
Miao Lu
Wenhao Yang
Liangyu Zhang
Zhihua Zhang
OffRL
248
1
0
12 Sep 2022
Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments
Mengxin Yu
Zhuoran Yang
Jianqing Fan
OffRL
363
9
0
23 Aug 2022
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Neural Information Processing Systems (NeurIPS), 2022
Masatoshi Uehara
Haruka Kiyohara
Andrew Bennett
Victor Chernozhukov
Nan Jiang
Nathan Kallus
C. Shi
Wen Sun
OffRL
509
25
0
26 Jul 2022
Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems
Neural Information Processing Systems (NeurIPS), 2022
Masatoshi Uehara
Ayush Sekhari
Jason D. Lee
Nathan Kallus
Wen Sun
OffRL
322
44
0
24 Jun 2022
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
International Conference on Learning Representations (ICLR), 2022
Miao Lu
Yifei Min
Zhaoran Wang
Zhuoran Yang
OffRL
444
26
0
26 May 2022
Model-Free and Model-Based Policy Evaluation when Causality is Uncertain
International Conference on Machine Learning (ICML), 2022
David Bruns-Smith
CML
ELM
OffRL
216
14
0
02 Apr 2022
Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process
Journal of the American Statistical Association (JASA), 2022
C. Shi
Jin Zhu
Ye Shen
Shuang Luo
Hong Zhu
R. Song
OffRL
471
44
0
22 Feb 2022
Long-term Causal Inference Under Persistent Confounding via Data Combination
Guido Imbens
Nathan Kallus
Xiaojie Mao
Yuhao Wang
CML
596
60
0
15 Feb 2022
Generalizing Off-Policy Evaluation From a Causal Perspective For Sequential Decision-Making
S. Parbhoo
Shalmali Joshi
Finale Doshi-Velez
ELM
CML
OffRL
296
5
0
20 Jan 2022
Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach
S. Saghafian
CML
439
22
0
08 Dec 2021
A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes
International Conference on Machine Learning (ICML), 2021
C. Shi
Masatoshi Uehara
Jiawei Huang
Nan Jiang
OffRL
441
31
0
12 Nov 2021
Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in Partially Observed Markov Decision Processes
Operational Research (OR), 2021
Andrew Bennett
Nathan Kallus
OffRL
262
59
0
28 Oct 2021
Off-Policy Evaluation in Partially Observed Markov Decision Processes under Sequential Ignorability
Annals of Statistics (Ann. Stat.), 2021
Yupeng Tang
Seung-seob Lee
OffRL
414
30
0
24 Oct 2021
On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Guy Tennenholtz
Assaf Hallak
Gal Dalal
Shie Mannor
Gal Chechik
Uri Shalit
OOD
OffRL
384
16
0
13 Oct 2021
A Spectral Approach to Off-Policy Evaluation for POMDPs
Yash Nair
Nan Jiang
OffRL
244
19
0
22 Sep 2021
Learning-to-defer for sequential medical decision-making under uncertainty
Shalmali Joshi
S. Parbhoo
Finale Doshi-Velez
OffRL
272
13
0
13 Sep 2021
Direct Advantage Estimation
Hsiao-Ru Pan
Nico Gürtler
Alexander Neitz
Bernhard Schölkopf
OffRL
CML
200
14
0
13 Sep 2021
Causal Reinforcement Learning using Observational and Interventional Data
Maxime Gasse
Damien Grasset
Guillaume Gaudron
Pierre-Yves Oudeyer
CML
OffRL
258
59
0
28 Jun 2021
1
2
Next
Page 1 of 2