Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.06929
Cited By
Hierarchical Bayesian Bandits
12 November 2021
Joey Hong
B. Kveton
Manzil Zaheer
Mohammad Ghavamzadeh
FedML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hierarchical Bayesian Bandits"
30 / 30 papers shown
Title
A Classification View on Meta Learning Bandits
Mirco Mutti
Jeongyeol Kwon
Shie Mannor
Aviv Tamar
18
0
0
06 Apr 2025
Online Posterior Sampling with a Diffusion Prior
B. Kveton
Boris Oreshkin
Youngsuk Park
Aniket Deshmukh
Rui Song
DiffM
25
0
0
04 Oct 2024
Meta Clustering of Neural Bandits
Yikun Ban
Yunzhe Qi
Tianxin Wei
Lihui Liu
Jingrui He
30
2
0
10 Aug 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert Nowak
61
2
0
07 Jun 2024
Federated Linear Contextual Bandits with Heterogeneous Clients
Ethan Blaser
Chuanhao Li
Hongning Wang
FedML
19
1
0
29 Feb 2024
reBandit: Random Effects based Online RL algorithm for Reducing Cannabis Use
Susobhan Ghosh
Yongyi Guo
Pei-Yao Hung
Lara N. Coughlin
Erin Bonar
Inbal Nahum-Shani
Maureen A. Walton
Susan Murphy
21
4
0
27 Feb 2024
Diffusion Models Meet Contextual Bandits with Large Action Spaces
Imad Aouali
DiffM
11
4
0
15 Feb 2024
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Nicolas Nguyen
Imad Aouali
András Gyorgy
Claire Vernade
20
2
0
08 Feb 2024
Thompson Sampling for Stochastic Bandits with Noisy Contexts: An Information-Theoretic Regret Analysis
Sharu Theresa Jose
Shana Moothedath
17
2
0
21 Jan 2024
Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning
Mirco Mutti
Ric De Santi
Marcello Restelli
Alexander Marx
Giorgia Ramponi
CML
17
4
0
11 Oct 2023
Online Clustering of Bandits with Misspecified User Models
Zhiyong Wang
Jize Xie
Xutong Liu
Shuai Li
J. C. Lui
31
10
0
04 Oct 2023
Finite-Time Logarithmic Bayes Regret Upper Bounds
Alexia Atsidakou
B. Kveton
S. Katariya
C. Caramanis
Sujay Sanghavi
16
0
0
15 Jun 2023
Only Pay for What Is Uncertain: Variance-Adaptive Thompson Sampling
Aadirupa Saha
B. Kveton
15
1
0
16 Mar 2023
Multi-Task Off-Policy Learning from Bandit Feedback
Joey Hong
B. Kveton
S. Katariya
Manzil Zaheer
Mohammad Ghavamzadeh
OffRL
23
10
0
09 Dec 2022
Bayesian Fixed-Budget Best-Arm Identification
Alexia Atsidakou
S. Katariya
Sujay Sanghavi
B. Kveton
14
11
0
15 Nov 2022
Lifelong Bandit Optimization: No Prior and No Regret
Felix Schur
Parnian Kassraie
Jonas Rothfuss
Andreas Krause
22
3
0
27 Oct 2022
Robust Contextual Linear Bandits
Rong Zhu
B. Kveton
4
3
0
26 Oct 2022
Group Distributionally Robust Reinforcement Learning with Hierarchical Latent Variables
Mengdi Xu
Peide Huang
Yaru Niu
Visak C. V. Kumar
Jielin Qiu
...
Kuan-Hui Lee
Xuewei Qi
H. Lam
Bo-wen Li
Ding Zhao
OOD
52
9
0
21 Oct 2022
Hierarchical Conversational Preference Elicitation with Bandit Feedback
Jinhang Zuo
Songwen Hu
Tong Yu
Shuai Li
Handong Zhao
Carlee Joe-Wong
27
9
0
06 Sep 2022
Thompson Sampling for Robust Transfer in Multi-Task Bandits
Zhi Wang
Chicheng Zhang
Kamalika Chaudhuri
AAML
22
5
0
17 Jun 2022
FedPop: A Bayesian Approach for Personalised Federated Learning
Nikita Kotelevskii
Maxime Vono
Eric Moulines
Alain Durmus
FedML
11
34
0
07 Jun 2022
Mixed-Effect Thompson Sampling
Imad Aouali
B. Kveton
S. Katariya
OffRL
24
11
0
30 May 2022
Multi-Environment Meta-Learning in Stochastic Linear Bandits
Ahmadreza Moradipari
Mohammad Ghavamzadeh
Taha Rajabzadeh
Christos Thrampoulidis
M. Alizadeh
14
4
0
12 May 2022
Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework
Runzhe Wan
Linjuan Ge
Rui Song
8
13
0
26 Feb 2022
Meta-Learning for Simple Regret Minimization
Javad Azizi
B. Kveton
Mohammad Ghavamzadeh
S. Katariya
10
10
0
25 Feb 2022
Deep Hierarchy in Bandits
Joey Hong
B. Kveton
S. Katariya
Manzil Zaheer
Mohammad Ghavamzadeh
17
20
0
03 Feb 2022
No Regrets for Learning the Prior in Bandits
Soumya Basu
B. Kveton
Manzil Zaheer
Csaba Szepesvári
38
33
0
13 Jul 2021
Metalearning Linear Bandits by Prior Update
Amit Peleg
Naama Pearl
Ron Meir
29
18
0
12 Jul 2021
Federated Multi-Armed Bandits
Chengshuai Shi
Cong Shen
FedML
47
91
0
28 Jan 2021
Necessary and Sufficient Conditions for Inverse Reinforcement Learning of Bayesian Stopping Time Problems
Kunal Pattanayak
Vikram Krishnamurthy
OffRL
14
4
0
07 Jul 2020
1