VITS : Variational Inference Thompson Sampling for contextual banditsInternational Conference on Machine Learning (ICML), 2023 |
Provable and Practical: Efficient Exploration in Reinforcement Learning
via Langevin Monte CarloInternational Conference on Learning Representations (ICLR), 2023 |
Variational Bayesian Optimistic SamplingNeural Information Processing Systems (NeurIPS), 2021 |
Apple Tasting Revisited: Bayesian Approaches to Partially Monitored
Online Binary Classification James A. Grant David S. Leslie |