A second order regret bound for NormalHedge
Yoav Freund
Nicholas J. A. Harvey
Victor S. Portella
Yabing Qi
Yu-Xiang Wang
Main:11 Pages
2 Figures
Bibliography:4 Pages
1 Tables
Appendix:39 Pages
Abstract
We consider the problem of prediction with expert advice for ``easy'' sequences. We show that a variant of NormalHedge enjoys a second-order -quantile regret bound of $O\big(\sqrt{V_T \log(V_T/\epsilon)}\big) $ when , where is the cumulative second moment of instantaneous per-expert regret averaged with respect to a natural distribution determined by the algorithm. The algorithm is motivated by a continuous time limit using Stochastic Differential Equations. The discrete time analysis uses self-concordance techniques.
View on arXivComments on this paper
