On Generating Explanations for Reinforcement Learning Policies: An Empirical Study

29 September 2023

Abstract

Understanding a \textit{reinforcement learning} policy, which guides state-to-action mappings to maximize rewards, necessitates an accompanying explanation for human comprehension. In this paper, we introduce a set of \textit{linear temporal logic} formulae designed to provide explanations for policies, and an algorithm for searching through those formulae for the one that best explains a given policy. Our focus is on explanations that elucidate both the ultimate objectives accomplished by the policy and the prerequisite conditions it upholds throughout its execution. The effectiveness of our proposed approach is illustrated through a simulated game of capture-the-flag and a car-parking environment,

View on arXiv

@article{yuasa2025_2309.16960,
  title={ On Generating Explanations for Reinforcement Learning Policies: An Empirical Study },
  author={ Mikihisa Yuasa and Huy T. Tran and Ramavarapu S. Sreenivas },
  journal={arXiv preprint arXiv:2309.16960},
  year={ 2025 }
}

Comments on this paper