ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.06506
35
5

Online Newton Method for Bandit Convex Optimisation

10 June 2024
Hidde Fokkema
Dirk van der Hoeven
Tor Lattimore
Jack J. Mayo
ArXivPDFHTML
Abstract

We introduce a computationally efficient algorithm for zeroth-order bandit convex optimisation and prove that in the adversarial setting its regret is at most d3.5npolylog(n,d)d^{3.5} \sqrt{n} \mathrm{polylog}(n, d)d3.5n​polylog(n,d) with high probability where ddd is the dimension and nnn is the time horizon. In the stochastic setting the bound improves to Md2npolylog(n,d)M d^{2} \sqrt{n} \mathrm{polylog}(n, d)Md2n​polylog(n,d) where M∈[d−1/2,d−1/4]M \in [d^{-1/2}, d^{-1 / 4}]M∈[d−1/2,d−1/4] is a constant that depends on the geometry of the constraint set and the desired computational properties.

View on arXiv
Comments on this paper