A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov GamesInternational Conference on Machine Learning (ICML), 2022 |
Online Learning for Unknown Partially Observable MDPsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021 |