ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.05623
  4. Cited By
Off-policy Policy Evaluation For Sequential Decisions Under Unobserved
  Confounding

Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding

Neural Information Processing Systems (NeurIPS), 2020
12 March 2020
Hongseok Namkoong
Ramtin Keramati
Steve Yadlowsky
Emma Brunskill
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding"

50 / 52 papers shown
Title
Confounding Robust Deep Reinforcement Learning: A Causal Approach
Confounding Robust Deep Reinforcement Learning: A Causal Approach
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
OffRLCML
168
0
0
24 Oct 2025
Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation
Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation
Hongyi Zhou
Josiah P. Hanna
Jin Zhu
Ying Yang
Chengchun Shi
OffRL
164
3
0
28 May 2025
Time After Time: Deep-Q Effect Estimation for Interventions on When and What to do
Time After Time: Deep-Q Effect Estimation for Interventions on When and What to doInternational Conference on Learning Representations (ICLR), 2025
Yoav Wald
M. Goldstein
Yonathan Efroni
Wouter A. C. van Amsterdam
Rajesh Ranganath
CML
335
0
0
20 Mar 2025
Statistical Tractability of Off-policy Evaluation of History-dependent Policies in POMDPsInternational Conference on Learning Representations (ICLR), 2025
Yuheng Zhang
Nan Jiang
OffRL
219
2
0
03 Mar 2025
Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement
  Learning
Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Shuguang Yu
Shuxing Fang
Ruixin Peng
Zhengling Qi
Fan Zhou
C. Shi
CMLOffRL
264
5
0
08 Dec 2024
Off-Policy Selection for Initiating Human-Centric Experimental Design
Off-Policy Selection for Initiating Human-Centric Experimental DesignNeural Information Processing Systems (NeurIPS), 2024
Ge Gao
Xi Yang
Qitong Gao
Song Ju
Miroslav Pajic
Min Chi
OffRL
287
0
0
26 Oct 2024
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy
  Evaluation
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation
Jeongyeol Kwon
Shie Mannor
Constantine Caramanis
Yonathan Efroni
OffRL
377
4
0
03 Jun 2024
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates
  of Multiple Estimators
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators
Allen Nie
Yash Chandak
Christina J. Yuan
Anirudhan Badrinath
Yannis Flet-Berliac
Emma Brunskil
OffRL
240
4
0
27 May 2024
Learning Decision Policies with Instrumental Variables through Double
  Machine Learning
Learning Decision Policies with Instrumental Variables through Double Machine LearningInternational Conference on Machine Learning (ICML), 2024
Daqian Shao
Ashkan Soleymani
Francesco Quinzan
Marta Z. Kwiatkowska
460
2
0
14 May 2024
Predictive Performance Comparison of Decision Policies Under Confounding
Predictive Performance Comparison of Decision Policies Under Confounding
Luke M. Guerdan
Amanda Coston
Kenneth Holstein
Zhiwei Steven Wu
OffRL
399
0
0
01 Apr 2024
Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision
  Processes
Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes
Andrew Bennett
Nathan Kallus
Miruna Oprescu
Wen Sun
Kaiwen Wang
AAMLOffRL
238
2
0
29 Mar 2024
On the Curses of Future and History in Future-dependent Value Functions
  for Off-policy Evaluation
On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation
Yuheng Zhang
Nan Jiang
OffRL
227
5
0
22 Feb 2024
Distributionally Robust Policy Evaluation under General Covariate Shift
  in Contextual Bandits
Distributionally Robust Policy Evaluation under General Covariate Shift in Contextual Bandits
Yi Guo
Hao Liu
Yisong Yue
Anqi Liu
OffRL
247
3
0
21 Jan 2024
Off-Policy Evaluation for Large Action Spaces via Policy Convolution
Off-Policy Evaluation for Large Action Spaces via Policy ConvolutionThe Web Conference (WWW), 2023
Noveen Sachdeva
Lequn Wang
Dawen Liang
Nathan Kallus
Julian McAuley
OffRL
245
16
0
24 Oct 2023
Confounding-Robust Policy Improvement with Human-AI Teams
Confounding-Robust Policy Improvement with Human-AI Teams
Ruijiang Gao
Mingzhang Yin
596
5
0
13 Oct 2023
Off-Policy Evaluation for Human Feedback
Off-Policy Evaluation for Human FeedbackNeural Information Processing Systems (NeurIPS), 2023
Qitong Gao
Ge Gao
Juncheng Dong
Vahid Tarokh
Min Chi
Miroslav Pajic
OffRL
290
7
0
11 Oct 2023
Offline Recommender System Evaluation under Unobserved Confounding
Offline Recommender System Evaluation under Unobserved Confounding
Olivier Jeunen
Ben London
OffRL
162
6
0
08 Sep 2023
Causal Reinforcement Learning: A Survey
Causal Reinforcement Learning: A Survey
Zhi-Hong Deng
Jing Jiang
Guodong Long
Chen Zhang
CMLLRM
314
32
0
04 Jul 2023
Comparing Causal Frameworks: Potential Outcomes, Structural Models,
  Graphs, and Abstractions
Comparing Causal Frameworks: Potential Outcomes, Structural Models, Graphs, and AbstractionsNeural Information Processing Systems (NeurIPS), 2023
D. Ibeling
Thomas Icard
CML
184
17
0
25 Jun 2023
Finding Counterfactually Optimal Action Sequences in Continuous State
  Spaces
Finding Counterfactually Optimal Action Sequences in Continuous State SpacesNeural Information Processing Systems (NeurIPS), 2023
Stratis Tsirtsis
Manuel Gomez Rodriguez
CMLOffRL
304
13
0
06 Jun 2023
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden
  Confounding
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden ConfoundingInternational Conference on Learning Representations (ICLR), 2023
Alizée Pace
Hugo Yèche
Bernhard Schölkopf
Gunnar Rätsch
Guy Tennenholtz
OffRL
173
8
0
01 Jun 2023
HOPE: Human-Centric Off-Policy Evaluation for E-Learning and Healthcare
HOPE: Human-Centric Off-Policy Evaluation for E-Learning and HealthcareAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Ge Gao
Song Ju
Markel Sanz Ausin
Min Chi
OffRL
183
8
0
18 Feb 2023
A Survey on Causal Reinforcement Learning
A Survey on Causal Reinforcement LearningIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Yan Zeng
Ruichu Cai
Gang Hua
Libo Huang
Zijian Li
CML
398
51
0
10 Feb 2023
Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous Unobserved Confounders
Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous Unobserved Confounders
David Bruns-Smith
Angela Zhou
OffRL
544
13
0
01 Feb 2023
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Off-Policy Evaluation for Action-Dependent Non-Stationary EnvironmentsNeural Information Processing Systems (NeurIPS), 2023
Yash Chandak
Shiv Shankar
Nathaniel D. Bastian
Bruno Castro da Silva
Emma Brunskil
Philip S. Thomas
OffRL
199
6
0
24 Jan 2023
Off-Policy Evaluation with Out-of-Sample Guarantees
Off-Policy Evaluation with Out-of-Sample Guarantees
Sofia Ek
Dave Zachariah
Fredrik D. Johansson
Petre Stoica
CMLOffRL
224
4
0
20 Jan 2023
Causal Falsification of Digital Twins
Causal Falsification of Digital Twins
R. Cornish
Muhammad Faaiz Taufiq
Arnaud Doucet
Chris Holmes
SyDaCML
214
1
0
17 Jan 2023
Safe Policy Improvement for POMDPs via Finite-State Controllers
Safe Policy Improvement for POMDPs via Finite-State ControllersAAAI Conference on Artificial Intelligence (AAAI), 2023
T. D. Simão
Marnix Suilen
N. Jansen
OffRL
160
10
0
12 Jan 2023
An Instrumental Variable Approach to Confounded Off-Policy Evaluation
An Instrumental Variable Approach to Confounded Off-Policy EvaluationInternational Conference on Machine Learning (ICML), 2022
Yang Xu
Jin Zhu
C. Shi
Shuang Luo
R. Song
OffRL
275
23
0
29 Dec 2022
Offline Reinforcement Learning for Human-Guided Human-Machine
  Interaction with Private Information
Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private InformationManagement Sciences (MS), 2022
Zuyue Fu
Zhengling Qi
Zhuoran Yang
Zhaoran Wang
Lan Wang
OffRL
171
1
0
23 Dec 2022
A Review of Off-Policy Evaluation in Reinforcement Learning
A Review of Off-Policy Evaluation in Reinforcement Learning
Masatoshi Uehara
C. Shi
Nathan Kallus
OffRL
234
100
0
13 Dec 2022
Offline Policy Evaluation and Optimization under Confounding
Offline Policy Evaluation and Optimization under ConfoundingInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Chinmaya Kausik
Yangyi Lu
Kevin Tan
Maggie Makar
Yixin Wang
Ambuj Tewari
OffRL
308
14
0
29 Nov 2022
Off-Policy Evaluation for Episodic Partially Observable Markov Decision
  Processes under Non-Parametric Models
Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric ModelsNeural Information Processing Systems (NeurIPS), 2022
Rui Miao
Zhengling Qi
Xiaoke Zhang
OffRL
307
11
0
21 Sep 2022
Strategic Decision-Making in the Presence of Information Asymmetry:
  Provably Efficient RL with Algorithmic Instruments
Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments
Mengxin Yu
Zhuoran Yang
Jianqing Fan
OffRL
312
9
0
23 Aug 2022
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Future-Dependent Value-Based Off-Policy Evaluation in POMDPsNeural Information Processing Systems (NeurIPS), 2022
Masatoshi Uehara
Haruka Kiyohara
Andrew Bennett
Victor Chernozhukov
Nan Jiang
Nathan Kallus
C. Shi
Wen Sun
OffRL
385
23
0
26 Jul 2022
Model-Free and Model-Based Policy Evaluation when Causality is Uncertain
Model-Free and Model-Based Policy Evaluation when Causality is UncertainInternational Conference on Machine Learning (ICML), 2022
David Bruns-Smith
CMLELMOffRL
138
14
0
02 Apr 2022
Off-Policy Confidence Interval Estimation with Confounded Markov
  Decision Process
Off-Policy Confidence Interval Estimation with Confounded Markov Decision ProcessJournal of the American Statistical Association (JASA), 2022
C. Shi
Jin Zhu
Ye Shen
Shuang Luo
Hong Zhu
R. Song
OffRL
317
38
0
22 Feb 2022
Generalizing Off-Policy Evaluation From a Causal Perspective For
  Sequential Decision-Making
Generalizing Off-Policy Evaluation From a Causal Perspective For Sequential Decision-Making
S. Parbhoo
Shalmali Joshi
Finale Doshi-Velez
ELMCMLOffRL
198
5
0
20 Jan 2022
Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach
Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach
S. Saghafian
CML
251
21
0
08 Dec 2021
Case-based off-policy policy evaluation using prototype learning
Case-based off-policy policy evaluation using prototype learning
Anton Matsson
Fredrik D. Johansson
OffRL
144
1
0
22 Nov 2021
A Minimax Learning Approach to Off-Policy Evaluation in Confounded
  Partially Observable Markov Decision Processes
A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision ProcessesInternational Conference on Machine Learning (ICML), 2021
C. Shi
Masatoshi Uehara
Jiawei Huang
Nan Jiang
OffRL
281
28
0
12 Nov 2021
Causal Multi-Agent Reinforcement Learning: Review and Open Problems
Causal Multi-Agent Reinforcement Learning: Review and Open Problems
St John Grimbly
Jonathan P. Shock
Arnu Pretorius
215
23
0
12 Nov 2021
Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in
  Partially Observed Markov Decision Processes
Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in Partially Observed Markov Decision ProcessesOperational Research (OR), 2021
Andrew Bennett
Nathan Kallus
OffRL
215
54
0
28 Oct 2021
On Covariate Shift of Latent Confounders in Imitation and Reinforcement
  Learning
On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Guy Tennenholtz
Assaf Hallak
Gal Dalal
Shie Mannor
Gal Chechik
Uri Shalit
OODOffRL
324
16
0
13 Oct 2021
Universal Off-Policy Evaluation
Universal Off-Policy EvaluationNeural Information Processing Systems (NeurIPS), 2021
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRLELM
244
57
0
26 Apr 2021
Learning Under Adversarial and Interventional Shifts
Learning Under Adversarial and Interventional Shifts
Harvineet Singh
Shalmali Joshi
Finale Doshi-Velez
Himabindu Lakkaraju
OOD
170
4
0
29 Mar 2021
Instrumental Variable Value Iteration for Causal Offline Reinforcement
  Learning
Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning
Luofeng Liao
Zuyue Fu
Zhuoran Yang
Yixin Wang
Mladen Kolar
Zhaoran Wang
OffRL
258
39
0
19 Feb 2021
Causal Markov Decision Processes: Learning Good Interventions
  Efficiently
Causal Markov Decision Processes: Learning Good Interventions Efficiently
Yangyi Lu
A. Meisami
Ambuj Tewari
137
12
0
15 Feb 2021
Learning Deep Features in Instrumental Variable Regression
Learning Deep Features in Instrumental Variable Regression
Liyuan Xu
Yutian Chen
Siddarth Srinivasan
Nando de Freitas
Arnaud Doucet
Arthur Gretton
CMLOOD
386
81
0
14 Oct 2020
Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with
  Latent Confounders
Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent ConfoundersInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Andrew Bennett
Nathan Kallus
Lihong Li
Ali Mousavi
OffRL
167
46
0
27 Jul 2020
12
Next