ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.05824
  4. Cited By
Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal
  Models
v1v2v3 (latest)

Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models

International Conference on Machine Learning (ICML), 2019
14 May 2019
Michael Oberst
David Sontag
    CMLOffRL
ArXiv (abs)PDFHTML

Papers citing "Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models"

50 / 118 papers shown
Title
Narrowing Action Choices with AI Improves Human Sequential Decisions
Narrowing Action Choices with AI Improves Human Sequential Decisions
Eleni Straitouri
Stratis Tsirtsis
Ander Artola Velasco
Manuel Gomez Rodriguez
100
0
0
17 Oct 2025
On the Reasoning Abilities of Masked Diffusion Language Models
On the Reasoning Abilities of Masked Diffusion Language Models
Anej Svete
Ashish Sabharwal
DiffMLRM
88
0
0
15 Oct 2025
Large Language Models as Nondeterministic Causal Models
Large Language Models as Nondeterministic Causal Models
Sander Beckers
LRM
104
1
0
26 Sep 2025
PERRY: Policy Evaluation with Confidence Intervals using Auxiliary Data
PERRY: Policy Evaluation with Confidence Intervals using Auxiliary Data
Aishwarya Mandyam
Jason Meng
Ge Gao
Jiankai Sun
Mac Schwager
Barbara E. Engelhardt
Emma Brunskill
OffRL
97
1
0
26 Jul 2025
Abstract Counterfactuals for Language Model Agents
Abstract Counterfactuals for Language Model Agents
Edoardo Pona
Milad Kazemi
Yali Du
David Watson
Nicola Paoletti
LLMAG
243
1
0
03 Jun 2025
When Counterfactual Reasoning Fails: Chaos and Real-World Complexity
When Counterfactual Reasoning Fails: Chaos and Real-World Complexity
Yahya Aalaila
Gerrit Großmann
Sumantrak Mukherjee
Jonas Wahl
Sebastian Vollmer
CMLLRM
331
0
0
31 Mar 2025
Time After Time: Deep-Q Effect Estimation for Interventions on When and What to do
Time After Time: Deep-Q Effect Estimation for Interventions on When and What to doInternational Conference on Learning Representations (ICLR), 2025
Yoav Wald
M. Goldstein
Yonathan Efroni
Wouter A. C. van Amsterdam
Rajesh Ranganath
CML
323
0
0
20 Mar 2025
Towards Optimal Offline Reinforcement Learning
Towards Optimal Offline Reinforcement Learning
Mengmeng Li
Daniel Kuhn
Tobias Sutter
OffRL
269
1
0
15 Mar 2025
Evaluation of Large Language Models via Coupled Token Generation
Evaluation of Large Language Models via Coupled Token Generation
N. C. Benz
Stratis Tsirtsis
Eleni Straitouri
Ivi Chatzi
Ander Artola Velasco
Suhas Thejaswi
Manuel Gomez Rodriguez
315
3
0
03 Feb 2025
To Measure or Not: A Cost-Sensitive, Selective Measuring Environment for Agricultural Management Decisions with Reinforcement Learning
To Measure or Not: A Cost-Sensitive, Selective Measuring Environment for Agricultural Management Decisions with Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2025
Hilmy Baja
Michiel Kallenberg
Ioannis Athanasiadis
OffRL
197
4
0
22 Jan 2025
Methodology for Interpretable Reinforcement Learning for Optimizing Mechanical Ventilation
Methodology for Interpretable Reinforcement Learning for Optimizing Mechanical Ventilation
Joo Seung Lee
Malini Mahendra
Anil Aswani
OffRL
296
1
0
10 Jan 2025
Preserving Expert-Level Privacy in Offline Reinforcement Learning
Preserving Expert-Level Privacy in Offline Reinforcement Learning
Navodita Sharma
Vishnu Vinod
Abhradeep Thakurta
Alekh Agarwal
Borja Balle
Christoph Dann
A. Raghuveer
OffRL
245
0
0
18 Nov 2024
Gumbel Counterfactual Generation From Language Models
Gumbel Counterfactual Generation From Language ModelsInternational Conference on Learning Representations (ICLR), 2024
Shauli Ravfogel
Anej Svete
Vésteinn Snæbjarnarson
Robert Bamler
LRMCML
527
1
0
11 Nov 2024
Off-Policy Selection for Initiating Human-Centric Experimental Design
Off-Policy Selection for Initiating Human-Centric Experimental DesignNeural Information Processing Systems (NeurIPS), 2024
Ge Gao
Xi Yang
Qitong Gao
Song Ju
Miroslav Pajic
Min Chi
OffRL
279
0
0
26 Oct 2024
Episodic Future Thinking Mechanism for Multi-agent Reinforcement
  Learning
Episodic Future Thinking Mechanism for Multi-agent Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Dongsu Lee
Minhae Kwon
242
4
0
22 Oct 2024
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making
Stelios Triantafyllou
A. Sukovic
Yasaman Zolfimoselo
Goran Radanović
CML
324
0
0
16 Oct 2024
Towards Cost Sensitive Decision Making
Towards Cost Sensitive Decision MakingInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Yang Li
Junier Oliva
OffRL
136
1
0
04 Oct 2024
Counterfactual Token Generation in Large Language Models
Counterfactual Token Generation in Large Language ModelsCLEaR (CLEaR), 2024
Ivi Chatzi
N. C. Benz
Eleni Straitouri
Stratis Tsirtsis
Manuel Gomez Rodriguez
LRM
345
9
0
25 Sep 2024
Preference Elicitation for Offline Reinforcement Learning
Preference Elicitation for Offline Reinforcement Learning
Alizée Pace
Bernhard Schölkopf
Gunnar Rätsch
Giorgia Ramponi
OffRL
223
1
0
26 Jun 2024
Teleporter Theory: A General and Simple Approach for Modeling
  Cross-World Counterfactual Causality
Teleporter Theory: A General and Simple Approach for Modeling Cross-World Counterfactual Causality
Jiangmeng Li
Bin Qin
Qirui Ji
Yi Li
Jingyao Wang
Jianwen Cao
Jianwei Niu
221
0
0
17 Jun 2024
ICU-Sepsis: A Benchmark MDP Built from Real Medical Data
ICU-Sepsis: A Benchmark MDP Built from Real Medical Data
Kartik Choudhary
Dhawal Gupta
Philip S. Thomas
OODVLM
134
4
0
09 Jun 2024
Fine-Grained Causal Dynamics Learning with Quantization for Improving
  Robustness in Reinforcement Learning
Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning
Inwoo Hwang
Yunhyeok Kwak
Jaein Kim
Byoung-Tak Zhang
Sanghack Lee
236
6
0
05 Jun 2024
DTR-Bench: An in silico Environment and Benchmark Platform for
  Reinforcement Learning Based Dynamic Treatment Regime
DTR-Bench: An in silico Environment and Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime
Zhiyao Luo
Mingcheng Zhu
Fenglin Liu
Jiali Li
Yangchen Pan
Jiandong Zhou
Tingting Zhu
OffRL
168
6
0
28 May 2024
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates
  of Multiple Estimators
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators
Allen Nie
Yash Chandak
Christina J. Yuan
Anirudhan Badrinath
Yannis Flet-Berliac
Emma Brunskil
OffRL
228
4
0
27 May 2024
Counterfactual Influence in Markov Decision Processes
Counterfactual Influence in Markov Decision Processes
Milad Kazemi
Jessica Lally
Ekaterina Tishchenko
Hana Chockler
Nicola Paoletti
271
2
0
13 Feb 2024
Understanding What Affects Generalization Gap in Visual Reinforcement
  Learning: Theory and Empirical Evidence
Understanding What Affects Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence
Jiafei Lyu
Le Wan
Xiu Li
Zongqing Lu
CMLOffRL
261
2
0
05 Feb 2024
Natural Counterfactuals With Necessary Backtracking
Natural Counterfactuals With Necessary Backtracking
Guang-Yuan Hao
Jiji Zhang
Erdun Gao
Hao Wang
Kun Zhang
175
0
0
02 Feb 2024
Distributionally Robust Policy Evaluation under General Covariate Shift
  in Contextual Bandits
Distributionally Robust Policy Evaluation under General Covariate Shift in Contextual Bandits
Yi Guo
Hao Liu
Yisong Yue
Anqi Liu
OffRL
239
3
0
21 Jan 2024
Counterfactual-Augmented Importance Sampling for Semi-Offline Policy
  Evaluation
Counterfactual-Augmented Importance Sampling for Semi-Offline Policy EvaluationNeural Information Processing Systems (NeurIPS), 2023
Shengpu Tang
Jenna Wiens
OffRLCML
206
6
0
26 Oct 2023
Agent-Specific Effects: A Causal Effect Propagation Analysis in
  Multi-Agent MDPs
Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPsInternational Conference on Machine Learning (ICML), 2023
Stelios Triantafyllou
A. Sukovic
Debmalya Mandal
Goran Radanović
349
0
0
17 Oct 2023
Offline Imitation Learning with Variational Counterfactual Reasoning
Offline Imitation Learning with Variational Counterfactual ReasoningNeural Information Processing Systems (NeurIPS), 2023
Bowei He
Zexu Sun
Jinxin Liu
Shuai Zhang
Xu Chen
Chen Ma
OffRL
185
10
0
07 Oct 2023
Estimation of Counterfactual Interventions under Uncertainties
Estimation of Counterfactual Interventions under UncertaintiesAsian Conference on Machine Learning (ACML), 2023
Juliane Weilbach
S. Gerwinn
M. Kandemir
Martin Fraenzle
153
0
0
15 Sep 2023
Leveraging Factored Action Spaces for Off-Policy Evaluation
Leveraging Factored Action Spaces for Off-Policy Evaluation
Aaman Rebello
Shengpu Tang
Jenna Wiens
Sonali Parbhoo Department of Engineering
CMLOffRL
128
2
0
13 Jul 2023
High Fidelity Image Counterfactuals with Probabilistic Causal Models
High Fidelity Image Counterfactuals with Probabilistic Causal ModelsInternational Conference on Machine Learning (ICML), 2023
Fabio De Sousa Ribeiro
Tian Xia
M. Monteiro
Nick Pawlowski
Ben Glocker
DiffM
191
58
0
27 Jun 2023
Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning
  Approach to Critical Care
Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical CareIEEE journal of biomedical and health informatics (IEEE JBHI), 2023
Ali Shirali
Alexander Schubert
Ahmed Alaa
OffRL
183
8
0
13 Jun 2023
Finding Counterfactually Optimal Action Sequences in Continuous State
  Spaces
Finding Counterfactually Optimal Action Sequences in Continuous State SpacesNeural Information Processing Systems (NeurIPS), 2023
Stratis Tsirtsis
Manuel Gomez Rodriguez
CMLOffRL
296
13
0
06 Jun 2023
Partial Counterfactual Identification of Continuous Outcomes with a
  Curvature Sensitivity Model
Partial Counterfactual Identification of Continuous Outcomes with a Curvature Sensitivity ModelNeural Information Processing Systems (NeurIPS), 2023
Valentyn Melnychuk
Dennis Frauen
Stefan Feuerriegel
457
13
0
02 Jun 2023
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden
  Confounding
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden ConfoundingInternational Conference on Learning Representations (ICLR), 2023
Alizée Pace
Hugo Yèche
Bernhard Schölkopf
Gunnar Rätsch
Guy Tennenholtz
OffRL
169
8
0
01 Jun 2023
Leveraging Factored Action Spaces for Efficient Offline Reinforcement
  Learning in Healthcare
Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in HealthcareNeural Information Processing Systems (NeurIPS), 2023
Shengpu Tang
Maggie Makar
Michael Sjoding
Finale Doshi-Velez
Jenna Wiens
OffRL
180
46
0
02 May 2023
CREATED: Generating Viable Counterfactual Sequences for Predictive
  Process Analytics
CREATED: Generating Viable Counterfactual Sequences for Predictive Process AnalyticsInternational Conference on Advanced Information Systems Engineering (CAiSE), 2023
Olusanmi A. Hundogan
Xixi Lu
Yupei Du
H. Reijers
AI4TS
127
12
0
28 Mar 2023
Towards Computationally Efficient Responsibility Attribution in
  Decentralized Partially Observable MDPs
Towards Computationally Efficient Responsibility Attribution in Decentralized Partially Observable MDPsAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Stelios Triantafyllou
Goran Radanović
150
5
0
24 Feb 2023
HOPE: Human-Centric Off-Policy Evaluation for E-Learning and Healthcare
HOPE: Human-Centric Off-Policy Evaluation for E-Learning and HealthcareAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Ge Gao
Song Ju
Markel Sanz Ausin
Min Chi
OffRL
179
8
0
18 Feb 2023
A Survey on Causal Reinforcement Learning
A Survey on Causal Reinforcement LearningIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Yan Zeng
Ruichu Cai
Gang Hua
Libo Huang
Zijian Li
CML
366
49
0
10 Feb 2023
Counterfactual Identifiability of Bijective Causal Models
Counterfactual Identifiability of Bijective Causal ModelsInternational Conference on Machine Learning (ICML), 2023
Arash Nasr-Esfahany
MohammadIman Alizadeh
Devavrat Shah
CMLBDL
392
37
0
04 Feb 2023
Counterfactual (Non-)identifiability of Learned Structural Causal Models
Counterfactual (Non-)identifiability of Learned Structural Causal Models
Arash Nasr-Esfahany
Emre Kıcıman
158
16
0
22 Jan 2023
Risk Sensitive Dead-end Identification in Safety-Critical Offline
  Reinforcement Learning
Risk Sensitive Dead-end Identification in Safety-Critical Offline Reinforcement Learning
Taylor W. Killian
S. Parbhoo
Marzyeh Ghassemi
OffRL
186
8
0
13 Jan 2023
Causal Temporal Reasoning for Markov Decision Processes
Causal Temporal Reasoning for Markov Decision Processes
Milad Kazemi
Nicola Paoletti
LRMAI4CE
224
2
0
16 Dec 2022
Counterfactuals for the Future
Counterfactuals for the FutureAAAI Conference on Artificial Intelligence (AAAI), 2022
Lucius E.J. Bynum
Joshua R. Loftus
Julia Stoyanovich
147
12
0
07 Dec 2022
Offline Policy Evaluation and Optimization under Confounding
Offline Policy Evaluation and Optimization under ConfoundingInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Chinmaya Kausik
Yangyi Lu
Kevin Tan
Maggie Makar
Yixin Wang
Ambuj Tewari
OffRL
308
14
0
29 Nov 2022
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Curiosity in Hindsight: Intrinsic Exploration in Stochastic EnvironmentsInternational Conference on Machine Learning (ICML), 2022
Daniel Jarrett
Corentin Tallec
Florent Altché
Thomas Mesnard
Rémi Munos
Michal Valko
205
6
0
18 Nov 2022
123
Next