Faster Rates for Private Adversarial Bandits
- PICVFedML

We design new differentially private algorithms for the problems of adversarial bandits and bandits with expert advice. For adversarial bandits, we give a simple and efficient conversion of any non-private bandit algorithm to a private bandit algorithm. Instantiating our conversion with existing non-private bandit algorithms gives a regret upper bound of , improving upon the existing upper bound for all . In particular, our algorithms allow for sublinear expected regret even when , establishing the first known separation between central and local differential privacy for this problem. For bandits with expert advice, we give the first differentially private algorithms, with expected regret , and , where and are the number of actions and experts respectively. These rates allow us to get sublinear regret for different combinations of small and large and
View on arXiv@article{asi2025_2505.21790, title={ Faster Rates for Private Adversarial Bandits }, author={ Hilal Asi and Vinod Raman and Kunal Talwar }, journal={arXiv preprint arXiv:2505.21790}, year={ 2025 } }