ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.15595
24
0

Zero-Inflated Bandits

25 December 2023
Haoyu Wei
Runzhe Wan
Lei Shi
Rui Song
ArXivPDFHTML
Abstract

Many real-world bandit applications are characterized by sparse rewards, which can significantly hinder learning efficiency. Leveraging problem-specific structures for careful distribution modeling is recognized as essential for improving estimation efficiency in statistics. However, this approach remains under-explored in the context of bandits. To address this gap, we initiate the study of zero-inflated bandits, where the reward is modeled using a classic semi-parametric distribution known as the zero-inflated distribution. We develop algorithms based on the Upper Confidence Bound and Thompson Sampling frameworks for this specific structure. The superior empirical performance of these methods is demonstrated through extensive numerical studies.

View on arXiv
@article{wei2025_2312.15595,
  title={ Zero-Inflated Bandits },
  author={ Haoyu Wei and Runzhe Wan and Lei Shi and Rui Song },
  journal={arXiv preprint arXiv:2312.15595},
  year={ 2025 }
}
Comments on this paper