22
1

Fully Unconstrained Online Learning

Abstract

We provide an online learning algorithm that obtains regret GwTlog(wGT)+w2+G2G\|w_\star\|\sqrt{T\log(\|w_\star\|G\sqrt{T})} + \|w_\star\|^2 + G^2 on GG-Lipschitz convex losses for any comparison point ww_\star without knowing either GG or w\|w_\star\|. Importantly, this matches the optimal bound GwTG\|w_\star\|\sqrt{T} available with such knowledge (up to logarithmic factors), unless either w\|w_\star\| or GG is so large that even GwTG\|w_\star\|\sqrt{T} is roughly linear in TT. Thus, it matches the optimal bound in all cases in which one can achieve sublinear regret, which arguably most "interesting" scenarios.

View on arXiv
Comments on this paper