Information-Theoretic Regret Bounds for Bandits with Fixed Expert AdviceInformation Theory Workshop (ITW), 2023 |
Reward-Free Policy Space Compression for Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022 |