Hierarchical Bayesian Bandits

12 November 2021

Papers citing "Hierarchical Bayesian Bandits"

30 / 30 papers shown

Title
A Classification View on Meta Learning Bandits Mirco Mutti Jeongyeol Kwon Shie Mannor Aviv Tamar 18 0 0 06 Apr 2025
Online Posterior Sampling with a Diffusion Prior B. Kveton Boris Oreshkin Youngsuk Park Aniket Deshmukh Rui Song DiffM 25 0 0 04 Oct 2024
Meta Clustering of Neural Bandits Yikun Ban Yunzhe Qi Tianxin Wei Lihui Liu Jingrui He 30 2 0 10 Aug 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning Subhojyoti Mukherjee Josiah P. Hanna Qiaomin Xie Robert Nowak 61 2 0 07 Jun 2024
Federated Linear Contextual Bandits with Heterogeneous Clients Ethan Blaser Chuanhao Li Hongning Wang FedML 19 1 0 29 Feb 2024
reBandit: Random Effects based Online RL algorithm for Reducing Cannabis Use Susobhan Ghosh Yongyi Guo Pei-Yao Hung Lara N. Coughlin Erin Bonar Inbal Nahum-Shani Maureen A. Walton Susan Murphy 21 4 0 27 Feb 2024
Diffusion Models Meet Contextual Bandits with Large Action Spaces Imad Aouali DiffM 11 4 0 15 Feb 2024
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits Nicolas Nguyen Imad Aouali András Gyorgy Claire Vernade 20 2 0 08 Feb 2024
Thompson Sampling for Stochastic Bandits with Noisy Contexts: An Information-Theoretic Regret Analysis Sharu Theresa Jose Shana Moothedath 17 2 0 21 Jan 2024
Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning Mirco Mutti Ric De Santi Marcello Restelli Alexander Marx Giorgia Ramponi CML 17 4 0 11 Oct 2023
Online Clustering of Bandits with Misspecified User Models Zhiyong Wang Jize Xie Xutong Liu Shuai Li J. C. Lui 31 10 0 04 Oct 2023
Finite-Time Logarithmic Bayes Regret Upper Bounds Alexia Atsidakou B. Kveton S. Katariya C. Caramanis Sujay Sanghavi 16 0 0 15 Jun 2023
Only Pay for What Is Uncertain: Variance-Adaptive Thompson Sampling Aadirupa Saha B. Kveton 15 1 0 16 Mar 2023
Multi-Task Off-Policy Learning from Bandit Feedback Joey Hong B. Kveton S. Katariya Manzil Zaheer Mohammad Ghavamzadeh OffRL 23 10 0 09 Dec 2022
Bayesian Fixed-Budget Best-Arm Identification Alexia Atsidakou S. Katariya Sujay Sanghavi B. Kveton 14 11 0 15 Nov 2022
Lifelong Bandit Optimization: No Prior and No Regret Felix Schur Parnian Kassraie Jonas Rothfuss Andreas Krause 22 3 0 27 Oct 2022
Robust Contextual Linear Bandits Rong Zhu B. Kveton 4 3 0 26 Oct 2022
Group Distributionally Robust Reinforcement Learning with Hierarchical Latent Variables Mengdi Xu Peide Huang Yaru Niu Visak C. V. Kumar Jielin Qiu ... Kuan-Hui Lee Xuewei Qi H. Lam Bo-wen Li Ding Zhao OOD 52 9 0 21 Oct 2022
Hierarchical Conversational Preference Elicitation with Bandit Feedback Jinhang Zuo Songwen Hu Tong Yu Shuai Li Handong Zhao Carlee Joe-Wong 27 9 0 06 Sep 2022
Thompson Sampling for Robust Transfer in Multi-Task Bandits Zhi Wang Chicheng Zhang Kamalika Chaudhuri AAML 22 5 0 17 Jun 2022
FedPop: A Bayesian Approach for Personalised Federated Learning Nikita Kotelevskii Maxime Vono Eric Moulines Alain Durmus FedML 11 34 0 07 Jun 2022
Mixed-Effect Thompson Sampling Imad Aouali B. Kveton S. Katariya OffRL 24 11 0 30 May 2022
Multi-Environment Meta-Learning in Stochastic Linear Bandits Ahmadreza Moradipari Mohammad Ghavamzadeh Taha Rajabzadeh Christos Thrampoulidis M. Alizadeh 14 4 0 12 May 2022
Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework Runzhe Wan Linjuan Ge Rui Song 8 13 0 26 Feb 2022
Meta-Learning for Simple Regret Minimization Javad Azizi B. Kveton Mohammad Ghavamzadeh S. Katariya 10 10 0 25 Feb 2022
Deep Hierarchy in Bandits Joey Hong B. Kveton S. Katariya Manzil Zaheer Mohammad Ghavamzadeh 17 20 0 03 Feb 2022
No Regrets for Learning the Prior in Bandits Soumya Basu B. Kveton Manzil Zaheer Csaba Szepesvári 38 33 0 13 Jul 2021
Metalearning Linear Bandits by Prior Update Amit Peleg Naama Pearl Ron Meir 29 18 0 12 Jul 2021
Federated Multi-Armed Bandits Chengshuai Shi Cong Shen FedML 47 91 0 28 Jan 2021
Necessary and Sufficient Conditions for Inverse Reinforcement Learning of Bayesian Stopping Time Problems Kunal Pattanayak Vikram Krishnamurthy OffRL 14 4 0 07 Jul 2020