ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.13893
  4. Cited By
Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with
  Latent Confounders

Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders

27 July 2020
Andrew Bennett
Nathan Kallus
Lihong Li
Ali Mousavi
    OffRL
ArXivPDFHTML

Papers citing "Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders"

37 / 37 papers shown
Title
Learning Decision Policies with Instrumental Variables through Double
  Machine Learning
Learning Decision Policies with Instrumental Variables through Double Machine Learning
Daqian Shao
Ashkan Soleymani
Francesco Quinzan
Marta Z. Kwiatkowska
36
1
0
14 May 2024
Sequential Decision Making with Expert Demonstrations under Unobserved
  Heterogeneity
Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity
Vahid Balazadeh Meresht
Keertana Chidambaram
Viet Nguyen
Rahul G. Krishnan
Vasilis Syrgkanis
44
0
0
10 Apr 2024
Advancing Investment Frontiers: Industry-grade Deep Reinforcement
  Learning for Portfolio Optimization
Advancing Investment Frontiers: Industry-grade Deep Reinforcement Learning for Portfolio Optimization
Philip Ndikum
Serge Ndikum
49
1
0
27 Feb 2024
Off-Policy Evaluation in Markov Decision Processes under Weak
  Distributional Overlap
Off-Policy Evaluation in Markov Decision Processes under Weak Distributional Overlap
Mohammad Mehrabi
Stefan Wager
OffRL
24
14
0
13 Feb 2024
Offline Recommender System Evaluation under Unobserved Confounding
Offline Recommender System Evaluation under Unobserved Confounding
Olivier Jeunen
Ben London
OffRL
19
4
0
08 Sep 2023
Causal Reinforcement Learning: A Survey
Causal Reinforcement Learning: A Survey
Zhi-Hong Deng
Jing Jiang
Guodong Long
Chen Zhang
CML
LRM
47
13
0
04 Jul 2023
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden
  Confounding
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding
Alizée Pace
Hugo Yèche
Bernhard Schölkopf
Gunnar Rätsch
Guy Tennenholtz
OffRL
16
6
0
01 Jun 2023
Estimation Beyond Data Reweighting: Kernel Method of Moments
Estimation Beyond Data Reweighting: Kernel Method of Moments
Heiner Kremer
Yassine Nemmour
Bernhard Schölkopf
Jia-Jie Zhu
31
7
0
18 May 2023
HOPE: Human-Centric Off-Policy Evaluation for E-Learning and Healthcare
HOPE: Human-Centric Off-Policy Evaluation for E-Learning and Healthcare
Ge Gao
Song Ju
Markel Sanz Ausin
Min Chi
OffRL
24
8
0
18 Feb 2023
A Survey on Causal Reinforcement Learning
A Survey on Causal Reinforcement Learning
Yan Zeng
Ruichu Cai
Fuchun Sun
Libo Huang
Z. Hao
CML
26
27
0
10 Feb 2023
Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous
  Unobserved Confounders
Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous Unobserved Confounders
David Bruns-Smith
Angela Zhou
OffRL
18
9
0
01 Feb 2023
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Yash Chandak
Shiv Shankar
Nathaniel D. Bastian
Bruno Castro da Silva
Emma Brunskil
Philip S. Thomas
OffRL
39
6
0
24 Jan 2023
An Instrumental Variable Approach to Confounded Off-Policy Evaluation
An Instrumental Variable Approach to Confounded Off-Policy Evaluation
Yang Xu
Jin Zhu
C. Shi
S. Luo
R. Song
OffRL
21
12
0
29 Dec 2022
Offline Reinforcement Learning for Human-Guided Human-Machine
  Interaction with Private Information
Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Zuyue Fu
Zhengling Qi
Zhuoran Yang
Zhaoran Wang
Lan Wang
OffRL
18
0
0
23 Dec 2022
A Review of Off-Policy Evaluation in Reinforcement Learning
A Review of Off-Policy Evaluation in Reinforcement Learning
Masatoshi Uehara
C. Shi
Nathan Kallus
OffRL
33
67
0
13 Dec 2022
Offline Policy Evaluation and Optimization under Confounding
Offline Policy Evaluation and Optimization under Confounding
Chinmaya Kausik
Yangyi Lu
Kevin Tan
Maggie Makar
Yixin Wang
Ambuj Tewari
OffRL
18
8
0
29 Nov 2022
A Reinforcement Learning Approach to Estimating Long-term Treatment
  Effects
A Reinforcement Learning Approach to Estimating Long-term Treatment Effects
Ziyang Tang
Yiheng Duan
Stephanie S. Zhang
Lihong Li
OffRL
24
4
0
14 Oct 2022
Statistical Estimation of Confounded Linear MDPs: An Instrumental
  Variable Approach
Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach
Miao Lu
Wenhao Yang
Liangyu Zhang
Zhihua Zhang
OffRL
21
0
0
12 Sep 2022
Strategic Decision-Making in the Presence of Information Asymmetry:
  Provably Efficient RL with Algorithmic Instruments
Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments
Mengxin Yu
Zhuoran Yang
Jianqing Fan
OffRL
13
8
0
23 Aug 2022
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
Masatoshi Uehara
Haruka Kiyohara
Andrew Bennett
Victor Chernozhukov
Nan Jiang
Nathan Kallus
C. Shi
Wen Sun
OffRL
26
16
0
26 Jul 2022
Functional Generalized Empirical Likelihood Estimation for Conditional
  Moment Restrictions
Functional Generalized Empirical Likelihood Estimation for Conditional Moment Restrictions
Heiner Kremer
Jia-Jie Zhu
Krikamol Muandet
Bernhard Schölkopf
35
8
0
11 Jul 2022
Pessimism in the Face of Confounders: Provably Efficient Offline
  Reinforcement Learning in Partially Observable Markov Decision Processes
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Miao Lu
Yifei Min
Zhaoran Wang
Zhuoran Yang
OffRL
51
22
0
26 May 2022
Off-Policy Confidence Interval Estimation with Confounded Markov
  Decision Process
Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process
C. Shi
Jin Zhu
Ye Shen
S. Luo
Hong Zhu
R. Song
OffRL
23
30
0
22 Feb 2022
Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach
Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach
S. Saghafian
CML
24
14
0
08 Dec 2021
Causal Forecasting:Generalization Bounds for Autoregressive Models
Causal Forecasting:Generalization Bounds for Autoregressive Models
L. C. Vankadara
P. M. Faller
Michaela Hardt
Lenon Minorics
D. Ghoshdastidar
Dominik Janzing
OOD
17
6
0
18 Nov 2021
A Minimax Learning Approach to Off-Policy Evaluation in Confounded
  Partially Observable Markov Decision Processes
A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes
C. Shi
Masatoshi Uehara
Jiawei Huang
Nan Jiang
OffRL
11
23
0
12 Nov 2021
Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in
  Partially Observed Markov Decision Processes
Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in Partially Observed Markov Decision Processes
Andrew Bennett
Nathan Kallus
OffRL
24
41
0
28 Oct 2021
A Spectral Approach to Off-Policy Evaluation for POMDPs
A Spectral Approach to Off-Policy Evaluation for POMDPs
Yash Nair
Nan Jiang
OffRL
23
17
0
22 Sep 2021
Causal Reinforcement Learning using Observational and Interventional
  Data
Causal Reinforcement Learning using Observational and Interventional Data
Maxime Gasse
Damien Grasset
Guillaume Gaudron
Pierre-Yves Oudeyer
CML
OffRL
24
50
0
28 Jun 2021
On Instrumental Variable Regression for Deep Offline Policy Evaluation
On Instrumental Variable Regression for Deep Offline Policy Evaluation
Yutian Chen
Liyuan Xu
Çağlar Gülçehre
T. Paine
A. Gretton
Nando de Freitas
Arnaud Doucet
OffRL
31
17
0
21 May 2021
Universal Off-Policy Evaluation
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRL
ELM
30
52
0
26 Apr 2021
Instrumental Variable Value Iteration for Causal Offline Reinforcement
  Learning
Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning
Luofeng Liao
Zuyue Fu
Zhuoran Yang
Yixin Wang
Mladen Kolar
Zhaoran Wang
OffRL
18
33
0
19 Feb 2021
Training a Resilient Q-Network against Observational Interference
Training a Resilient Q-Network against Observational Interference
Chao-Han Huck Yang
I-Te Danny Hung
Ouyang Yi
Pin-Yu Chen
OOD
18
14
0
18 Feb 2021
The Variational Method of Moments
The Variational Method of Moments
Andrew Bennett
Nathan Kallus
22
30
0
17 Dec 2020
Provably Efficient Causal Reinforcement Learning with Confounded
  Observational Data
Provably Efficient Causal Reinforcement Learning with Confounded Observational Data
Lingxiao Wang
Zhuoran Yang
Zhaoran Wang
OffRL
16
45
0
22 Jun 2020
Dynamic Causal Effects Evaluation in A/B Testing with a Reinforcement
  Learning Framework
Dynamic Causal Effects Evaluation in A/B Testing with a Reinforcement Learning Framework
C. Shi
Xiaoyu Wang
S. Luo
Hongtu Zhu
Jieping Ye
R. Song
CML
OffRL
25
33
0
05 Feb 2020
Double Reinforcement Learning for Efficient Off-Policy Evaluation in
  Markov Decision Processes
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes
Nathan Kallus
Masatoshi Uehara
OffRL
33
181
0
22 Aug 2019
1