Lower Bounds for -Regret via the Decision-Estimation Coefficient
- OffRL

Abstract
In this note, we give a new lower bound for the -regret in bandit problems, the regret which arises when comparing against a benchmark that is times the optimal solution, i.e., . The -regret arises in structured bandit problems where finding an exact optimum of is intractable. Our lower bound is given in terms of a modification of the constrained Decision-Estimation Coefficient (DEC) of~\citet{foster2023tight} (and closely related to the original offset DEC of \citet{foster2021statistical}), which we term the -DEC. When restricted to the traditional regret setting where , our result removes the logarithmic factors in the lower bound of \citet{foster2023tight}.
View on arXivComments on this paper