Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.01656
Cited By
v1
v2
v3 (latest)
Thompson Sampling for Bandits with Clustered Arms
6 September 2021
Emil Carlsson
Devdatt Dubhashi
Fredrik D. Johansson
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Thompson Sampling for Bandits with Clustered Arms"
4 / 4 papers shown
Title
LNUCB-TA: Linear-nonlinear Hybrid Bandit Learning with Temporal Attention
H. Khosravi
Mohammad Reza Shafie
Ahmed Shoyeb Raihan
Srinjoy Das
I. Imtiaz Ahmed
71
0
0
01 Mar 2025
Bilevel Multi-Armed Bandit-Based Hierarchical Reinforcement Learning for Interaction-Aware Self-Driving at Unsignalized Intersections
Zengqi Peng
Yubin Wang
Lei Zheng
Jun Ma
83
1
0
06 Feb 2025
Clustered Linear Contextual Bandits with Knapsacks
Yichuan Deng
M. Mamakos
Zhao Song
63
0
0
21 Aug 2023
Optimal Clustering with Bandit Feedback
Junwen Yang
Zixin Zhong
Vincent Y. F. Tan
65
12
0
09 Feb 2022
1