27
0

On the Problem of Best Arm Retention

Abstract

This paper presents a comprehensive study on the problem of Best Arm Retention (BAR), which has recently found applications in streaming algorithms for multi-armed bandits. In the BAR problem, the goal is to retain mm arms with the best arm included from nn after some trials, in stochastic multi-armed bandit settings. We first investigate pure exploration for the BAR problem under different criteria, and then minimize the regret with specific constraints, in the context of further exploration in streaming algorithms.- We begin by revisiting the lower bound for the (ε,δ)(\varepsilon,\delta)-PAC algorithm for Best Arm Identification (BAI) and adapt the classical KL-divergence argument to derive optimal bounds for (ε,δ)(\varepsilon,\delta)-PAC algorithms for BAR.- We further study another variant of the problem, called rr-BAR, which requires the expected gap between the best arm and the optimal arm retained is less than rr. We prove tight sample complexity for the problem.- We explore the regret minimization problem for rr-BAR and develop algorithm beyond pure exploration. We conclude with a conjecture on the optimal regret in this setting.

View on arXiv
@article{chen2025_2504.11866,
  title={ On the Problem of Best Arm Retention },
  author={ Houshuang Chen and Yuchen He and Chihao Zhang },
  journal={arXiv preprint arXiv:2504.11866},
  year={ 2025 }
}
Comments on this paper