ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.04518
  4. Cited By
Confounding-Robust Policy Evaluation in Infinite-Horizon Reinforcement
  Learning
v1v2 (latest)

Confounding-Robust Policy Evaluation in Infinite-Horizon Reinforcement Learning

Neural Information Processing Systems (NeurIPS), 2020
11 February 2020
Nathan Kallus
Angela Zhou
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Confounding-Robust Policy Evaluation in Infinite-Horizon Reinforcement Learning"

47 / 47 papers shown
Confounding Robust Deep Reinforcement Learning: A Causal Approach
Confounding Robust Deep Reinforcement Learning: A Causal Approach
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
OffRLCML
237
2
0
24 Oct 2025
Offline Reinforcement Learning in Large State Spaces: Algorithms and Guarantees
Offline Reinforcement Learning in Large State Spaces: Algorithms and Guarantees
Nan Jiang
Tengyang Xie
OffRL
239
16
0
05 Oct 2025
The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability
The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability
Jiachen Hu
Rui Ai
Han Zhong
Xiaoyu Chen
L. Wang
Zhaoran Wang
Zhuoran Yang
246
0
0
11 Jun 2025
Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation
Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation
Hongyi Zhou
Josiah P. Hanna
Jin Zhu
Ying Yang
Chengchun Shi
OffRL
273
4
0
28 May 2025
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Yuhan Li
Eugene Han
Yifan Hu
Wenzhuo Zhou
Zhengling Qi
Yifan Cui
Ruoqing Zhu
OffRL
1.0K
2
0
01 May 2025
Off-Policy Evaluation for Sequential Persuasion Process with Unobserved Confounding
Off-Policy Evaluation for Sequential Persuasion Process with Unobserved Confounding
Nishanth Venkatesh S.
Heeseung Bang
Andreas A. Malikopoulos
OffRL
239
2
0
01 Apr 2025
Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement
  Learning
Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Shuguang Yu
Shuxing Fang
Ruixin Peng
Zhengling Qi
Fan Zhou
C. Shi
CMLOffRL
380
7
0
08 Dec 2024
Causal Deepsets for Off-policy Evaluation under Spatial or
  Spatio-temporal Interferences
Causal Deepsets for Off-policy Evaluation under Spatial or Spatio-temporal Interferences
Runpeng Dai
Jianing Wang
Fan Zhou
Shuang Luo
Zhiwei Qin
Chengchun Shi
Hongtu Zhu
CMLOffRL
318
3
0
25 Jul 2024
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates
  of Multiple Estimators
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators
Allen Nie
Yash Chandak
Christina J. Yuan
Anirudhan Badrinath
Yannis Flet-Berliac
Emma Brunskil
OffRL
291
4
0
27 May 2024
Bounding Causal Effects with Leaky Instruments
Bounding Causal Effects with Leaky Instruments
David S. Watson
Jordan Penn
L. Gunderson
Gecia Bravo Hermsdorff
Afsaneh Mastouri
Ricardo M. A. Silva
CML
285
2
0
05 Apr 2024
Predictive Performance Comparison of Decision Policies Under Confounding
Predictive Performance Comparison of Decision Policies Under Confounding
Luke M. Guerdan
Amanda Coston
Kenneth Holstein
Zhiwei Steven Wu
OffRL
502
1
0
01 Apr 2024
Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision
  Processes
Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes
Andrew Bennett
Nathan Kallus
Miruna Oprescu
Wen Sun
Kaiwen Wang
AAMLOffRL
310
4
0
29 Mar 2024
Partial Counterfactual Identification of Continuous Outcomes with a
  Curvature Sensitivity Model
Partial Counterfactual Identification of Continuous Outcomes with a Curvature Sensitivity ModelNeural Information Processing Systems (NeurIPS), 2023
Valentyn Melnychuk
Dennis Frauen
Stefan Feuerriegel
722
14
0
02 Jun 2023
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden
  Confounding
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden ConfoundingInternational Conference on Learning Representations (ICLR), 2023
Alizée Pace
Hugo Yèche
Bernhard Schölkopf
Gunnar Rätsch
Guy Tennenholtz
OffRL
289
9
0
01 Jun 2023
Personalized Pricing with Invalid Instrumental Variables:
  Identification, Estimation, and Policy Learning
Personalized Pricing with Invalid Instrumental Variables: Identification, Estimation, and Policy Learning
Rui Miao
Zhengling Qi
Cong Shi
Lin Lin
214
2
0
24 Feb 2023
A Survey on Causal Reinforcement Learning
A Survey on Causal Reinforcement LearningIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Yan Zeng
Ruichu Cai
Gang Hua
Libo Huang
Zijian Li
CML
534
68
0
10 Feb 2023
Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous Unobserved Confounders
Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous Unobserved Confounders
David Bruns-Smith
Angela Zhou
OffRL
696
14
0
01 Feb 2023
An Instrumental Variable Approach to Confounded Off-Policy Evaluation
An Instrumental Variable Approach to Confounded Off-Policy EvaluationInternational Conference on Machine Learning (ICML), 2022
Yang Xu
Jin Zhu
C. Shi
Shuang Luo
R. Song
OffRL
361
24
0
29 Dec 2022
Quantile Off-Policy Evaluation via Deep Conditional Generative Learning
Quantile Off-Policy Evaluation via Deep Conditional Generative Learning
Yang Xu
C. Shi
Shuang Luo
Lan Wang
R. Song
OffRL
296
6
0
29 Dec 2022
Offline Reinforcement Learning for Human-Guided Human-Machine
  Interaction with Private Information
Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private InformationManagement Sciences (MS), 2022
Zuyue Fu
Zhengling Qi
Zhuoran Yang
Zhaoran Wang
Lan Wang
OffRL
222
1
0
23 Dec 2022
A Review of Off-Policy Evaluation in Reinforcement Learning
A Review of Off-Policy Evaluation in Reinforcement Learning
Masatoshi Uehara
C. Shi
Nathan Kallus
OffRL
302
114
0
13 Dec 2022
Instrumental Variables in Causal Inference and Machine Learning: A
  Survey
Instrumental Variables in Causal Inference and Machine Learning: A SurveyACM Computing Surveys (ACM CSUR), 2022
Anpeng Wu
Kun Kuang
Ruoxuan Xiong
Leilei Gan
SyDaCML
306
19
0
12 Dec 2022
Offline Policy Evaluation and Optimization under Confounding
Offline Policy Evaluation and Optimization under ConfoundingInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Chinmaya Kausik
Yangyi Lu
Kevin Tan
Maggie Makar
Yixin Wang
Ambuj Tewari
OffRL
426
15
0
29 Nov 2022
Causal Deep Reinforcement Learning Using Observational Data
Causal Deep Reinforcement Learning Using Observational DataInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Wenxuan Zhu
Chao Yu
Qiaosheng Zhang
CMLOffRL
247
8
0
28 Nov 2022
Off-Policy Evaluation for Episodic Partially Observable Markov Decision
  Processes under Non-Parametric Models
Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric ModelsNeural Information Processing Systems (NeurIPS), 2022
Rui Miao
Zhengling Qi
Xiaoke Zhang
OffRL
346
14
0
21 Sep 2022
Data-Driven Influence Functions for Optimization-Based Causal Inference
Data-Driven Influence Functions for Optimization-Based Causal Inference
Michael I. Jordan
Yixin Wang
Angela Zhou
TDICML
386
3
0
29 Aug 2022
Strategic Decision-Making in the Presence of Information Asymmetry:
  Provably Efficient RL with Algorithmic Instruments
Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments
Mengxin Yu
Zhuoran Yang
Jianqing Fan
OffRL
359
9
0
23 Aug 2022
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Future-Dependent Value-Based Off-Policy Evaluation in POMDPsNeural Information Processing Systems (NeurIPS), 2022
Masatoshi Uehara
Haruka Kiyohara
Andrew Bennett
Victor Chernozhukov
Nan Jiang
Nathan Kallus
C. Shi
Wen Sun
OffRL
504
25
0
26 Jul 2022
Regularizing a Model-based Policy Stationary Distribution to Stabilize
  Offline Reinforcement Learning
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Shentao Yang
Yihao Feng
Shujian Zhang
Mi Zhou
OffRL
285
14
0
14 Jun 2022
Pessimism in the Face of Confounders: Provably Efficient Offline
  Reinforcement Learning in Partially Observable Markov Decision Processes
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision ProcessesInternational Conference on Learning Representations (ICLR), 2022
Miao Lu
Yifei Min
Zhaoran Wang
Zhuoran Yang
OffRL
432
26
0
26 May 2022
Model-Free and Model-Based Policy Evaluation when Causality is Uncertain
Model-Free and Model-Based Policy Evaluation when Causality is UncertainInternational Conference on Machine Learning (ICML), 2022
David Bruns-Smith
CMLELMOffRL
209
14
0
02 Apr 2022
Stochastic Causal Programming for Bounding Treatment Effects
Stochastic Causal Programming for Bounding Treatment EffectsCLEaR (CLEaR), 2022
Kirtan Padh
Jakob Zeitler
David S. Watson
Matt J. Kusner
Ricardo M. A. Silva
Niki Kilbertus
CML
519
29
0
22 Feb 2022
Off-Policy Confidence Interval Estimation with Confounded Markov
  Decision Process
Off-Policy Confidence Interval Estimation with Confounded Markov Decision ProcessJournal of the American Statistical Association (JASA), 2022
C. Shi
Jin Zhu
Ye Shen
Shuang Luo
Hong Zhu
R. Song
OffRL
462
44
0
22 Feb 2022
A Behavior Regularized Implicit Policy for Offline Reinforcement
  Learning
A Behavior Regularized Implicit Policy for Offline Reinforcement Learning
Shentao Yang
Zhendong Wang
Huangjie Zheng
Yihao Feng
Mingyuan Zhou
OffRL
209
10
0
19 Feb 2022
Generalizing Off-Policy Evaluation From a Causal Perspective For
  Sequential Decision-Making
Generalizing Off-Policy Evaluation From a Causal Perspective For Sequential Decision-Making
S. Parbhoo
Shalmali Joshi
Finale Doshi-Velez
ELMCMLOffRL
296
5
0
20 Jan 2022
Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach
Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach
S. Saghafian
CML
427
22
0
08 Dec 2021
A Minimax Learning Approach to Off-Policy Evaluation in Confounded
  Partially Observable Markov Decision Processes
A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision ProcessesInternational Conference on Machine Learning (ICML), 2021
C. Shi
Masatoshi Uehara
Jiawei Huang
Nan Jiang
OffRL
428
31
0
12 Nov 2021
Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in
  Partially Observed Markov Decision Processes
Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in Partially Observed Markov Decision ProcessesOperational Research (OR), 2021
Andrew Bennett
Nathan Kallus
OffRL
258
58
0
28 Oct 2021
On Covariate Shift of Latent Confounders in Imitation and Reinforcement
  Learning
On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Guy Tennenholtz
Assaf Hallak
Gal Dalal
Shie Mannor
Gal Chechik
Uri Shalit
OODOffRL
380
16
0
13 Oct 2021
Partial Counterfactual Identification from Observational and
  Experimental Data
Partial Counterfactual Identification from Observational and Experimental DataInternational Conference on Machine Learning (ICML), 2021
Junzhe Zhang
Jin Tian
Elias Bareinboim
238
77
0
12 Oct 2021
Invariant Policy Learning: A Causal Perspective
Invariant Policy Learning: A Causal PerspectiveIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Sorawit Saengkyongam
Nikolaj Thams
J. Peters
Niklas Pfister
CMLOffRL
634
22
0
01 Jun 2021
Universal Off-Policy Evaluation
Universal Off-Policy EvaluationNeural Information Processing Systems (NeurIPS), 2021
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRLELM
332
58
0
26 Apr 2021
Estimating and Improving Dynamic Treatment Regimes With a Time-Varying
  Instrumental Variable
Estimating and Improving Dynamic Treatment Regimes With a Time-Varying Instrumental Variable
Shuxiao Chen
B. Zhang
331
26
0
15 Apr 2021
Instrumental Variable Value Iteration for Causal Offline Reinforcement
  Learning
Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning
Luofeng Liao
Zuyue Fu
Zhuoran Yang
Yixin Wang
Mladen Kolar
Zhaoran Wang
OffRL
340
39
0
19 Feb 2021
Sharp Sensitivity Analysis for Inverse Propensity Weighting via Quantile
  Balancing
Sharp Sensitivity Analysis for Inverse Propensity Weighting via Quantile BalancingJournal of the American Statistical Association (JASA), 2021
Jacob Dorn
Kevin Guo
433
74
0
08 Feb 2021
Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with
  Latent Confounders
Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent ConfoundersInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Andrew Bennett
Nathan Kallus
Lihong Li
Ali Mousavi
OffRL
212
48
0
27 Jul 2020
Off-policy Policy Evaluation For Sequential Decisions Under Unobserved
  Confounding
Off-policy Policy Evaluation For Sequential Decisions Under Unobserved ConfoundingNeural Information Processing Systems (NeurIPS), 2020
Hongseok Namkoong
Ramtin Keramati
Steve Yadlowsky
Emma Brunskill
OffRL
399
72
0
12 Mar 2020
1
Page 1 of 1