Beyond Average Return in Markov Decision ProcessesNeural Information Processing Systems (NeurIPS), 2023 |
The Statistical Benefits of Quantile Temporal-Difference Learning for
Value EstimationInternational Conference on Machine Learning (ICML), 2023 |
Bridging Distributional and Risk-sensitive Reinforcement Learning with
Provable Regret BoundsJournal of machine learning research (JMLR), 2022 |