ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.01656
  4. Cited By
Thompson Sampling for Bandits with Clustered Arms
v1v2v3 (latest)

Thompson Sampling for Bandits with Clustered Arms

6 September 2021
Emil Carlsson
Devdatt Dubhashi
Fredrik D. Johansson
ArXiv (abs)PDFHTML

Papers citing "Thompson Sampling for Bandits with Clustered Arms"

4 / 4 papers shown
Title
LNUCB-TA: Linear-nonlinear Hybrid Bandit Learning with Temporal Attention
LNUCB-TA: Linear-nonlinear Hybrid Bandit Learning with Temporal Attention
H. Khosravi
Mohammad Reza Shafie
Ahmed Shoyeb Raihan
Srinjoy Das
I. Imtiaz Ahmed
71
0
0
01 Mar 2025
Bilevel Multi-Armed Bandit-Based Hierarchical Reinforcement Learning for Interaction-Aware Self-Driving at Unsignalized Intersections
Bilevel Multi-Armed Bandit-Based Hierarchical Reinforcement Learning for Interaction-Aware Self-Driving at Unsignalized Intersections
Zengqi Peng
Yubin Wang
Lei Zheng
Jun Ma
83
1
0
06 Feb 2025
Clustered Linear Contextual Bandits with Knapsacks
Clustered Linear Contextual Bandits with Knapsacks
Yichuan Deng
M. Mamakos
Zhao Song
63
0
0
21 Aug 2023
Optimal Clustering with Bandit Feedback
Optimal Clustering with Bandit Feedback
Junwen Yang
Zixin Zhong
Vincent Y. F. Tan
65
12
0
09 Feb 2022
1