ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2412.10682
81
1

Stochastic kkk-Submodular Bandits with Full Bandit Feedback

14 December 2024
G. Nie
Vaneet Aggarwal
Christopher J. Quinn
    OffRL
ArXivPDFHTML
Abstract

In this paper, we present the first sublinear α\alphaα-regret bounds for online kkk-submodular optimization problems with full-bandit feedback, where α\alphaα is a corresponding offline approximation ratio. Specifically, we propose online algorithms for multiple kkk-submodular stochastic combinatorial multi-armed bandit problems, including (i) monotone functions and individual size constraints, (ii) monotone functions with matroid constraints, (iii) non-monotone functions with matroid constraints, (iv) non-monotone functions without constraints, and (v) monotone functions without constraints. We transform approximation algorithms for offline kkk-submodular maximization problems into online algorithms through the offline-to-online framework proposed by Nie et al. (2023a). A key contribution of our work is analyzing the robustness of the offline algorithms.

View on arXiv
Comments on this paper