16
4

Improved Space Bounds for Learning with Experts

Abstract

We give improved tradeoffs between space and regret for the online learning with expert advice problem over TT days with nn experts. Given a space budget of nδn^{\delta} for δ(0,1)\delta \in (0,1), we provide an algorithm achieving regret O~(n2T1/(1+δ))\tilde{O}(n^2 T^{1/(1+\delta)}), improving upon the regret bound O~(n2T2/(2+δ))\tilde{O}(n^2 T^{2/(2+\delta)}) in the recent work of [PZ23]. The improvement is particularly salient in the regime δ1\delta \rightarrow 1 where the regret of our algorithm approaches O~n(T)\tilde{O}_n(\sqrt{T}), matching the TT dependence in the standard online setting without space restrictions.

View on arXiv
Comments on this paper