ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.03478
  4. Cited By
Interpretable Off-Policy Evaluation in Reinforcement Learning by
  Highlighting Influential Transitions
v1v2v3 (latest)

Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions

International Conference on Machine Learning (ICML), 2020
10 February 2020
Omer Gottesman
Joseph D. Futoma
Yao Liu
Soanli Parbhoo
Leo Anthony Celi
Emma Brunskill
Finale Doshi-Velez
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions"

35 / 35 papers shown
Limits of Generative Pre-Training in Structured EMR Trajectories with Irregular Sampling
Limits of Generative Pre-Training in Structured EMR Trajectories with Irregular Sampling
N. Kuo
B. Gallego
Louisa R Jorm
100
0
0
27 Oct 2025
Which Rewards Matter? Reward Selection for Reinforcement Learning under Limited Feedback
Which Rewards Matter? Reward Selection for Reinforcement Learning under Limited Feedback
Shreyas Chaudhari
Renhao Zhang
Philip S. Thomas
Bruno Castro da Silva
OffRL
250
1
0
30 Sep 2025
PERRY: Policy Evaluation with Confidence Intervals using Auxiliary Data
PERRY: Policy Evaluation with Confidence Intervals using Auxiliary Data
Aishwarya Mandyam
Jason Meng
Ge Gao
Jiankai Sun
Mac Schwager
Barbara E. Engelhardt
Emma Brunskill
OffRL
217
2
0
26 Jul 2025
Translate Policy to Language: Flow Matching Generated Rewards for LLM Explanations
Translate Policy to Language: Flow Matching Generated Rewards for LLM Explanations
Xinyi Yang
Liang Zeng
Heng Dong
Chao Yu
Xiaojun Wu
H. Yang
Yu Wang
Milind Tambe
Tonghan Wang
412
7
0
18 Feb 2025
Concept-driven Off Policy Evaluation
Concept-driven Off Policy Evaluation
Ritam Majumdar
Jack Teversham
Sonali Parbhoo
OffRL
337
0
0
28 Nov 2024
Empowering Clinicians with Medical Decision Transformers: A Framework
  for Sepsis Treatment
Empowering Clinicians with Medical Decision Transformers: A Framework for Sepsis Treatment
A. Rahman
Pranav Agarwal
R. Noumeir
P. Jouvet
Vincent Michalski
Samira Ebrahimi Kahou
OffRL
296
4
0
28 Jul 2024
Reinforcement Learning in Dynamic Treatment Regimes Needs Critical
  Reexamination
Reinforcement Learning in Dynamic Treatment Regimes Needs Critical Reexamination
Zhiyao Luo
Yangchen Pan
Peter Watkinson
Tingting Zhu
OffRL
253
3
0
28 May 2024
Which Experiences Are Influential for RL Agents? Efficiently Estimating The Influence of Experiences
Which Experiences Are Influential for RL Agents? Efficiently Estimating The Influence of Experiences
Takuya Hiraoka
Guanquan Wang
Takashi Onishi
Yoshimasa Tsuruoka
321
0
0
23 May 2024
How Consistent are Clinicians? Evaluating the Predictability of Sepsis
  Disease Progression with Dynamics Models
How Consistent are Clinicians? Evaluating the Predictability of Sepsis Disease Progression with Dynamics Models
Unnseo Park
Venkatesh Sivaraman
Adam Perer
145
2
0
10 Apr 2024
Data Poisoning Attacks on Off-Policy Policy Evaluation Methods
Data Poisoning Attacks on Off-Policy Policy Evaluation Methods
Elita Lobo
Harvineet Singh
Marek Petrik
Cynthia Rudin
Himabindu Lakkaraju
256
3
0
06 Apr 2024
Closed-loop Teaching via Demonstrations to Improve Policy Transparency
Closed-loop Teaching via Demonstrations to Improve Policy Transparency
Michael S. Lee
Reid G. Simmons
H. Admoni
230
0
0
01 Apr 2024
Accountability in Offline Reinforcement Learning: Explaining Decisions
  with a Corpus of Examples
Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of ExamplesNeural Information Processing Systems (NeurIPS), 2023
Hao Sun
Alihan Huyuk
Daniel Jarrett
M. Schaar
OffRL
399
10
0
11 Oct 2023
Deep Attention Q-Network for Personalized Treatment Recommendation
Deep Attention Q-Network for Personalized Treatment Recommendation
Simin Ma
Junghwan Lee
N. Serban
Shihao Yang
OffRL
179
11
0
04 Jul 2023
Inference for relative sparsity
Inference for relative sparsity
Samuel J. Weisenthal
Sally W. Thurston
Ashkan Ertefaie
CML
294
0
0
25 Jun 2023
The Unintended Consequences of Discount Regularization: Improving
  Regularization in Certainty Equivalence Reinforcement Learning
The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023
Sarah Rathnam
S. Parbhoo
Weiwei Pan
Susan A. Murphy
Finale Doshi-Velez
OffRL
189
6
0
20 Jun 2023
HOPE: Human-Centric Off-Policy Evaluation for E-Learning and Healthcare
HOPE: Human-Centric Off-Policy Evaluation for E-Learning and HealthcareAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Ge Gao
Song Ju
Markel Sanz Ausin
Min Chi
OffRL
239
8
0
18 Feb 2023
ASQ-IT: Interactive Explanations for Reinforcement-Learning Agents
ASQ-IT: Interactive Explanations for Reinforcement-Learning Agents
Yotam Amitai
Guy Avni
Ofra Amir
325
4
0
24 Jan 2023
Explainable Deep Reinforcement Learning: State of the Art and Challenges
Explainable Deep Reinforcement Learning: State of the Art and ChallengesACM Computing Surveys (ACM CSUR), 2022
G. Vouros
XAI
570
132
0
24 Jan 2023
Decisions that Explain Themselves: A User-Centric Deep Reinforcement
  Learning Explanation System
Decisions that Explain Themselves: A User-Centric Deep Reinforcement Learning Explanation System
Xiaoran Wu
Zihan Yan
Chongjie Zhang
Tongshuang Wu
217
2
0
01 Dec 2022
Relative Sparsity for Medical Decision Problems
Relative Sparsity for Medical Decision ProblemsStatistics in Medicine (Stat Med), 2022
Samuel J. Weisenthal
Sally W. Thurston
Ashkan Ertefaie
239
4
0
29 Nov 2022
Mitigating Health Data Poverty: Generative Approaches versus Resampling
  for Time-series Clinical Data
Mitigating Health Data Poverty: Generative Approaches versus Resampling for Time-series Clinical Data
Raffaele Marchesi
Nicolo Micheletti
Giuseppe Jurman
V. Osmani
AI4TS
201
5
0
25 Oct 2022
Generating Synthetic Clinical Data that Capture Class Imbalanced
  Distributions with Generative Adversarial Networks: Example using
  Antiretroviral Therapy for HIV
Generating Synthetic Clinical Data that Capture Class Imbalanced Distributions with Generative Adversarial Networks: Example using Antiretroviral Therapy for HIVJournal of Biomedical Informatics (JBI), 2022
N. Kuo
Federico Garcia
Anders Sönnerborg
Maurizio Zazzi
Michael Böhm
Rolf Kaiser
Mark Polizzotto
Louisa R Jorm
S. Barbieri
GAN
332
39
0
18 Aug 2022
The Health Gym: Synthetic Health-Related Datasets for the Development of
  Reinforcement Learning Algorithms
The Health Gym: Synthetic Health-Related Datasets for the Development of Reinforcement Learning AlgorithmsScientific Data (Sci Data), 2022
N. Kuo
Mark Polizzotto
S. Finfer
Federico Garcia
Anders Sönnerborg
Maurizio Zazzi
Michael Böhm
Louisa R Jorm
S. Barbieri
OOD
192
32
0
12 Mar 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
306
25
0
23 Feb 2022
A Survey of Explainable Reinforcement Learning
A Survey of Explainable Reinforcement Learning
Stephanie Milani
Nicholay Topin
Manuela Veloso
Fei Fang
XAILRM
269
60
0
17 Feb 2022
Generalizing Off-Policy Evaluation From a Causal Perspective For
  Sequential Decision-Making
Generalizing Off-Policy Evaluation From a Causal Perspective For Sequential Decision-Making
S. Parbhoo
Shalmali Joshi
Finale Doshi-Velez
ELMCMLOffRL
274
5
0
20 Jan 2022
Synthetic Acute Hypotension and Sepsis Datasets Based on MIMIC-III and
  Published as Part of the Health Gym Project
Synthetic Acute Hypotension and Sepsis Datasets Based on MIMIC-III and Published as Part of the Health Gym Project
N. Kuo
Mark Polizzotto
S. Finfer
Louisa R Jorm
S. Barbieri
105
8
0
07 Dec 2021
Case-based off-policy policy evaluation using prototype learning
Case-based off-policy policy evaluation using prototype learning
Anton Matsson
Fredrik D. Johansson
OffRL
182
1
0
22 Nov 2021
State Relevance for Off-Policy Evaluation
State Relevance for Off-Policy Evaluation
S. Shen
Yecheng Ma
Omer Gottesman
Finale Doshi-Velez
OffRL
171
6
0
13 Sep 2021
Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman
  Operators
Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman Operators
Zaiwei Chen
S. T. Maguluri
Sanjay Shakkottai
Karthikeyan Shanmugam
OffRL
203
20
0
24 Jun 2021
The Medkit-Learn(ing) Environment: Medical Decision Modelling through
  Simulation
The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation
Alex J. Chan
Ioana Bica
Alihan Huyuk
Daniel Jarrett
M. Schaar
239
15
0
08 Jun 2021
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear
  Function Approximation
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear Function ApproximationIEEE Control Systems Letters (L-CSS), 2021
Zaiwei Chen
S. Khodadadian
S. T. Maguluri
OffRL
285
32
0
26 May 2021
Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm
Finite-Sample Analysis of Off-Policy Natural Actor-Critic AlgorithmInternational Conference on Machine Learning (ICML), 2021
S. Khodadadian
Zaiwei Chen
S. T. Maguluri
CMLOffRL
331
33
0
18 Feb 2021
Continuous Doubly Constrained Batch Reinforcement Learning
Continuous Doubly Constrained Batch Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2021
Rasool Fakoor
Jonas W. Mueller
Kavosh Asadi
Pratik Chaudhari
Alex Smola
OffRL
697
32
0
18 Feb 2021
Model-based Reinforcement Learning for Semi-Markov Decision Processes
  with Neural ODEs
Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs
Jianzhun Du
Joseph D. Futoma
Finale Doshi-Velez
211
59
0
29 Jun 2020
1
Page 1 of 1