Double Doubly Robust Thompson Sampling for Generalized Linear Contextual
Bandits

v1v2 (latest)

Double Doubly Robust Thompson Sampling for Generalized Linear Contextual Bandits

AAAI Conference on Artificial Intelligence (AAAI), 2022

15 September 2022

Wonyoung Hedge Kim

ArXiv (abs)PDF HTML

Papers citing "Double Doubly Robust Thompson Sampling for Generalized Linear Contextual Bandits"

12 / 12 papers shown

Title
Neural Logistic Bandits Seoungbin Bae Dabeen Lee 906 1 0 04 May 2025
Linear Bandits with Partially Observable Features Wonyoung Hedge Kim Sungwoo Park G. Iyengar A. Zeevi Min Hwan Oh 597 3 0 10 Feb 2025
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits Junghyun Lee Se-Young Yun Kwang-Sung Jun 525 14 0 19 Jul 2024
Nearly Minimax Optimal Regret for Multinomial Logistic BanditNeural Information Processing Systems (NeurIPS), 2024 Joongkyu Lee Min-hwan Oh 271 11 0 16 May 2024
RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions Easton K. Huch Jieru Shi Madeline R Abbott J. Golbus Alexander Moreno Walter Dempsey OffRL 152 0 0 11 Dec 2023
Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set ConversionInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023 Junghyun Lee Se-Young Yun Kwang-Sung Jun 344 19 0 28 Oct 2023
A Doubly Robust Approach to Sparse Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023 Wonyoung Hedge Kim Garud Iyengar A. Zeevi 143 4 0 23 Oct 2023
Learning the Pareto Front Using Bootstrapped Observation SamplesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023 Wonyoung Hedge Kim G. Iyengar A. Zeevi 223 7 0 31 May 2023
Improved Algorithms for Multi-period Multi-class Packing Problems with Bandit FeedbackInternational Conference on Machine Learning (ICML), 2023 Wonyoung Hedge Kim G. Iyengar A. Zeevi 138 4 0 31 Jan 2023
Risk-aware linear bandits with convex lossInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022 Patrick Saux Odalric-Ambrym Maillard 146 3 0 15 Sep 2022
Ranking In Generalized Linear Bandits Amitis Shidani George Deligiannidis Arnaud Doucet 180 1 0 30 Jun 2022
Squeeze All: Novel Estimator and Self-Normalized Bound for Linear Contextual BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022 Wonyoung Hedge Kim M. Paik Min-whan Oh 173 6 0 11 Jun 2022