ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.05193
46
1
v1v2v3 (latest)

Suboptimal Performance of the Bayes Optimal Algorithm in Frequentist Best Arm Identification

10 February 2022
Junpei Komiyama
ArXiv (abs)PDFHTML
Abstract

We consider the fixed-budget best-arm identification problem with Normal reward distributions. In this problem, the forecaster is given KKK arms (or treatments) and TTT time steps. The forecaster attempts to find the best arm, defined by the largest mean, via an adaptive experiment conducted using an algorithm. The algorithm's performance is measured by the simple regret, that is, the quality of the estimated best arm. The frequentist simple regret can be exponentially small to TTT, whereas the Bayesian simple regret is polynomially small to TTT. This paper demonstrates that Bayes optimal algorithm, which minimizes the Bayesian simple regret, does not produce an exponential simple regret for some parameters, a finding that contrasts with the many results indicating the asymptotic equivalence of Bayesian and frequentist algorithms in the context of fixed sampling regimes. While the Bayes optimal algorithm is described in terms of a recursive equation that is virtually impossible to compute exactly, we establish the foundations for further analysis by introducing a key quantity that we call the expected Bellman improvement.

View on arXiv
Comments on this paper