154

Near-Optimal Private Learning in Linear Contextual Bandits

Main:14 Pages
Bibliography:3 Pages
1 Tables
Appendix:28 Pages
Abstract

We analyze the problem of private learning in generalized linear contextual bandits. Our approach is based on a novel method of re-weighted regression, yielding an efficient algorithm with regret of order T+1α\sqrt{T}+\frac{1}{\alpha} and T/α\sqrt{T}/\alpha in the joint and local model of α\alpha-privacy, respectively. Further, we provide near-optimal private procedures that achieve dimension-independent rates in private linear models and linear contextual bandits. In particular, our results imply that joint privacy is almost "for free" in all the settings we consider, partially addressing the open problem posed by Azize and Basu (2024).

View on arXiv
Comments on this paper