ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.06983
  4. Cited By
Double Doubly Robust Thompson Sampling for Generalized Linear Contextual
  Bandits
v1v2 (latest)

Double Doubly Robust Thompson Sampling for Generalized Linear Contextual Bandits

AAAI Conference on Artificial Intelligence (AAAI), 2022
15 September 2022
Wonyoung Hedge Kim
Kyungbok Lee
M. Paik
ArXiv (abs)PDFHTML

Papers citing "Double Doubly Robust Thompson Sampling for Generalized Linear Contextual Bandits"

12 / 12 papers shown
Title
Neural Logistic Bandits
Neural Logistic Bandits
Seoungbin Bae
Dabeen Lee
906
1
0
04 May 2025
Linear Bandits with Partially Observable Features
Linear Bandits with Partially Observable Features
Wonyoung Hedge Kim
Sungwoo Park
G. Iyengar
A. Zeevi
Min Hwan Oh
597
3
0
10 Feb 2025
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits
Junghyun Lee
Se-Young Yun
Kwang-Sung Jun
525
14
0
19 Jul 2024
Nearly Minimax Optimal Regret for Multinomial Logistic Bandit
Nearly Minimax Optimal Regret for Multinomial Logistic BanditNeural Information Processing Systems (NeurIPS), 2024
Joongkyu Lee
Min-hwan Oh
271
11
0
16 May 2024
RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions
RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions
Easton K. Huch
Jieru Shi
Madeline R Abbott
J. Golbus
Alexander Moreno
Walter Dempsey
OffRL
152
0
0
11 Dec 2023
Improved Regret Bounds of (Multinomial) Logistic Bandits via
  Regret-to-Confidence-Set Conversion
Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set ConversionInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Junghyun Lee
Se-Young Yun
Kwang-Sung Jun
344
19
0
28 Oct 2023
A Doubly Robust Approach to Sparse Reinforcement Learning
A Doubly Robust Approach to Sparse Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Wonyoung Hedge Kim
Garud Iyengar
A. Zeevi
143
4
0
23 Oct 2023
Learning the Pareto Front Using Bootstrapped Observation Samples
Learning the Pareto Front Using Bootstrapped Observation SamplesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Wonyoung Hedge Kim
G. Iyengar
A. Zeevi
223
7
0
31 May 2023
Improved Algorithms for Multi-period Multi-class Packing Problems with
  Bandit Feedback
Improved Algorithms for Multi-period Multi-class Packing Problems with Bandit FeedbackInternational Conference on Machine Learning (ICML), 2023
Wonyoung Hedge Kim
G. Iyengar
A. Zeevi
138
4
0
31 Jan 2023
Risk-aware linear bandits with convex loss
Risk-aware linear bandits with convex lossInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Patrick Saux
Odalric-Ambrym Maillard
146
3
0
15 Sep 2022
Ranking In Generalized Linear Bandits
Ranking In Generalized Linear Bandits
Amitis Shidani
George Deligiannidis
Arnaud Doucet
180
1
0
30 Jun 2022
Squeeze All: Novel Estimator and Self-Normalized Bound for Linear
  Contextual Bandits
Squeeze All: Novel Estimator and Self-Normalized Bound for Linear Contextual BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Wonyoung Hedge Kim
M. Paik
Min-whan Oh
173
6
0
11 Jun 2022
1