v1v2v3 (latest)

Thompson Sampling for Bandits with Clustered Arms

6 September 2021

Emil Carlsson

Devdatt Dubhashi

Papers citing "Thompson Sampling for Bandits with Clustered Arms"

4 / 4 papers shown

Title
LNUCB-TA: Linear-nonlinear Hybrid Bandit Learning with Temporal Attention H. Khosravi Mohammad Reza Shafie Ahmed Shoyeb Raihan Srinjoy Das I. Imtiaz Ahmed 71 0 0 01 Mar 2025
Bilevel Multi-Armed Bandit-Based Hierarchical Reinforcement Learning for Interaction-Aware Self-Driving at Unsignalized Intersections Zengqi Peng Yubin Wang Lei Zheng Jun Ma 83 1 0 06 Feb 2025
Clustered Linear Contextual Bandits with Knapsacks Yichuan Deng M. Mamakos Zhao Song 63 0 0 21 Aug 2023
Optimal Clustering with Bandit Feedback Junwen Yang Zixin Zhong Vincent Y. F. Tan 65 12 0 09 Feb 2022