Counterfactual Strategies for Markov Decision Processes

14 May 2025

Abstract

Counterfactuals are widely used in AI to explain how minimal changes to a model's input can lead to a different output. However, established methods for computing counterfactuals typically focus on one-step decision-making, and are not directly applicable to sequential decision-making tasks. This paper fills this gap by introducing counterfactual strategies for Markov Decision Processes (MDPs). During MDP execution, a strategy decides which of the enabled actions (with known probabilistic effects) to execute next. Given an initial strategy that reaches an undesired outcome with a probability above some limit, we identify minimal changes to the initial strategy to reduce that probability below the limit. We encode such counterfactual strategies as solutions to non-linear optimization problems, and further extend our encoding to synthesize diverse counterfactual strategies. We evaluate our approach on four real-world datasets and demonstrate its practical viability in sophisticated sequential decision-making tasks.

View on arXiv

@article{kobialka2025_2505.09412,
  title={ Counterfactual Strategies for Markov Decision Processes },
  author={ Paul Kobialka and Lina Gerlach and Francesco Leofante and Erika Ábrahám and Silvia Lizeth Tapia Tarifa and Einar Broch Johnsen },
  journal={arXiv preprint arXiv:2505.09412},
  year={ 2025 }
}

Comments on this paper