ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.09312
117
2
v1v2v3v4v5v6v7v8 (latest)

Near Optimal Adversarial Attack on UCB Bandits

21 August 2020
Shiliang Zuo
    AAML
ArXiv (abs)PDFHTML
Abstract

We consider a stochastic multi-arm bandit problem where rewards are subject to adversarial corruption. We propose a novel attack strategy that manipulates a UCB principle into pulling some non-optimal target arm T−o(T)T - o(T)T−o(T) times with a cumulative cost that scales as log⁡T\sqrt{\log T}logT​, where TTT is the number of rounds. We also prove the first lower bound on the cumulative attack cost. Our lower bound matches our upper bound up to log⁡log⁡T\log \log TloglogT factors, showing our attack to be near optimal.

View on arXiv
Comments on this paper