ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.06929
  4. Cited By
Hierarchical Bayesian Bandits

Hierarchical Bayesian Bandits

12 November 2021
Joey Hong
B. Kveton
Manzil Zaheer
Mohammad Ghavamzadeh
    FedML
ArXivPDFHTML

Papers citing "Hierarchical Bayesian Bandits"

30 / 30 papers shown
Title
A Classification View on Meta Learning Bandits
A Classification View on Meta Learning Bandits
Mirco Mutti
Jeongyeol Kwon
Shie Mannor
Aviv Tamar
18
0
0
06 Apr 2025
Online Posterior Sampling with a Diffusion Prior
Online Posterior Sampling with a Diffusion Prior
B. Kveton
Boris Oreshkin
Youngsuk Park
Aniket Deshmukh
Rui Song
DiffM
25
0
0
04 Oct 2024
Meta Clustering of Neural Bandits
Meta Clustering of Neural Bandits
Yikun Ban
Yunzhe Qi
Tianxin Wei
Lihui Liu
Jingrui He
30
2
0
10 Aug 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert Nowak
61
2
0
07 Jun 2024
Federated Linear Contextual Bandits with Heterogeneous Clients
Federated Linear Contextual Bandits with Heterogeneous Clients
Ethan Blaser
Chuanhao Li
Hongning Wang
FedML
19
1
0
29 Feb 2024
reBandit: Random Effects based Online RL algorithm for Reducing Cannabis
  Use
reBandit: Random Effects based Online RL algorithm for Reducing Cannabis Use
Susobhan Ghosh
Yongyi Guo
Pei-Yao Hung
Lara N. Coughlin
Erin Bonar
Inbal Nahum-Shani
Maureen A. Walton
Susan Murphy
21
4
0
27 Feb 2024
Diffusion Models Meet Contextual Bandits with Large Action Spaces
Diffusion Models Meet Contextual Bandits with Large Action Spaces
Imad Aouali
DiffM
11
4
0
15 Feb 2024
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Nicolas Nguyen
Imad Aouali
András Gyorgy
Claire Vernade
22
2
0
08 Feb 2024
Thompson Sampling for Stochastic Bandits with Noisy Contexts: An
  Information-Theoretic Regret Analysis
Thompson Sampling for Stochastic Bandits with Noisy Contexts: An Information-Theoretic Regret Analysis
Sharu Theresa Jose
Shana Moothedath
17
2
0
21 Jan 2024
Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement
  Learning
Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning
Mirco Mutti
Ric De Santi
Marcello Restelli
Alexander Marx
Giorgia Ramponi
CML
17
4
0
11 Oct 2023
Online Clustering of Bandits with Misspecified User Models
Online Clustering of Bandits with Misspecified User Models
Zhiyong Wang
Jize Xie
Xutong Liu
Shuai Li
J. C. Lui
31
10
0
04 Oct 2023
Finite-Time Logarithmic Bayes Regret Upper Bounds
Finite-Time Logarithmic Bayes Regret Upper Bounds
Alexia Atsidakou
B. Kveton
S. Katariya
C. Caramanis
Sujay Sanghavi
16
0
0
15 Jun 2023
Only Pay for What Is Uncertain: Variance-Adaptive Thompson Sampling
Only Pay for What Is Uncertain: Variance-Adaptive Thompson Sampling
Aadirupa Saha
B. Kveton
15
1
0
16 Mar 2023
Multi-Task Off-Policy Learning from Bandit Feedback
Multi-Task Off-Policy Learning from Bandit Feedback
Joey Hong
B. Kveton
S. Katariya
Manzil Zaheer
Mohammad Ghavamzadeh
OffRL
23
10
0
09 Dec 2022
Bayesian Fixed-Budget Best-Arm Identification
Bayesian Fixed-Budget Best-Arm Identification
Alexia Atsidakou
S. Katariya
Sujay Sanghavi
B. Kveton
14
11
0
15 Nov 2022
Lifelong Bandit Optimization: No Prior and No Regret
Lifelong Bandit Optimization: No Prior and No Regret
Felix Schur
Parnian Kassraie
Jonas Rothfuss
Andreas Krause
24
3
0
27 Oct 2022
Robust Contextual Linear Bandits
Robust Contextual Linear Bandits
Rong Zhu
B. Kveton
6
3
0
26 Oct 2022
Group Distributionally Robust Reinforcement Learning with Hierarchical
  Latent Variables
Group Distributionally Robust Reinforcement Learning with Hierarchical Latent Variables
Mengdi Xu
Peide Huang
Yaru Niu
Visak C. V. Kumar
Jielin Qiu
...
Kuan-Hui Lee
Xuewei Qi
H. Lam
Bo-wen Li
Ding Zhao
OOD
54
9
0
21 Oct 2022
Hierarchical Conversational Preference Elicitation with Bandit Feedback
Hierarchical Conversational Preference Elicitation with Bandit Feedback
Jinhang Zuo
Songwen Hu
Tong Yu
Shuai Li
Handong Zhao
Carlee Joe-Wong
27
9
0
06 Sep 2022
Thompson Sampling for Robust Transfer in Multi-Task Bandits
Thompson Sampling for Robust Transfer in Multi-Task Bandits
Zhi Wang
Chicheng Zhang
Kamalika Chaudhuri
AAML
22
5
0
17 Jun 2022
FedPop: A Bayesian Approach for Personalised Federated Learning
FedPop: A Bayesian Approach for Personalised Federated Learning
Nikita Kotelevskii
Maxime Vono
Eric Moulines
Alain Durmus
FedML
11
34
0
07 Jun 2022
Mixed-Effect Thompson Sampling
Mixed-Effect Thompson Sampling
Imad Aouali
B. Kveton
S. Katariya
OffRL
24
11
0
30 May 2022
Multi-Environment Meta-Learning in Stochastic Linear Bandits
Multi-Environment Meta-Learning in Stochastic Linear Bandits
Ahmadreza Moradipari
Mohammad Ghavamzadeh
Taha Rajabzadeh
Christos Thrampoulidis
M. Alizadeh
14
4
0
12 May 2022
Towards Scalable and Robust Structured Bandits: A Meta-Learning
  Framework
Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework
Runzhe Wan
Linjuan Ge
Rui Song
8
13
0
26 Feb 2022
Meta-Learning for Simple Regret Minimization
Meta-Learning for Simple Regret Minimization
Javad Azizi
B. Kveton
Mohammad Ghavamzadeh
S. Katariya
10
10
0
25 Feb 2022
Deep Hierarchy in Bandits
Deep Hierarchy in Bandits
Joey Hong
B. Kveton
S. Katariya
Manzil Zaheer
Mohammad Ghavamzadeh
17
20
0
03 Feb 2022
No Regrets for Learning the Prior in Bandits
No Regrets for Learning the Prior in Bandits
Soumya Basu
B. Kveton
Manzil Zaheer
Csaba Szepesvári
41
33
0
13 Jul 2021
Metalearning Linear Bandits by Prior Update
Metalearning Linear Bandits by Prior Update
Amit Peleg
Naama Pearl
Ron Meir
29
18
0
12 Jul 2021
Federated Multi-Armed Bandits
Federated Multi-Armed Bandits
Chengshuai Shi
Cong Shen
FedML
50
91
0
28 Jan 2021
Necessary and Sufficient Conditions for Inverse Reinforcement Learning
  of Bayesian Stopping Time Problems
Necessary and Sufficient Conditions for Inverse Reinforcement Learning of Bayesian Stopping Time Problems
Kunal Pattanayak
Vikram Krishnamurthy
OffRL
14
4
0
07 Jul 2020
1