Informational Confidence Bounds for Self-Normalized Averages and Applications

13 September 2013

Papers citing "Informational Confidence Bounds for Self-Normalized Averages and Applications"

9 / 9 papers shown

Title
Conservative Bandits Yifan Wu R. Shariff Tor Lattimore Csaba Szepesvári 155 98 0 13 Feb 2016
Kullback-Leibler upper confidence bounds for optimal sequential allocation Olivier Cappé Aurélien Garivier Odalric-Ambrym Maillard Rémi Munos Gilles Stoltz 96 394 0 03 Oct 2012
Optimal discovery with probabilistic expert advice: finite time analysis and macroscopic optimality Sébastien Bubeck D. Ernst Aurélien Garivier 48 30 0 22 Jul 2012
Consistency of maximum-likelihood and variational estimators in the Stochastic Block Model Alain Celisse J. Daudin L. Pierre 91 197 0 17 May 2011
Online Least Squares Estimation with Self-Normalized Processes: An Application to Bandit Problems Yasin Abbasi-Yadkori D. Pál Csaba Szepesvári 77 70 0 14 Feb 2011
The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond Aurélien Garivier Olivier Cappé 135 613 0 12 Feb 2011
Optimism in Reinforcement Learning and Kullback-Leibler Divergence Sarah Filippi Olivier Cappé Aurélien Garivier 110 105 0 29 Apr 2010
Context tree selection and linguistic rhythm retrieval from written texts A. Galves Charlotte Galves Jesús E. García N. Garcia Florencia Leonardi 50 63 0 20 Feb 2009
On Upper-Confidence Bound Policies for Non-Stationary Bandit Problems Aurélien Garivier Eric Moulines 84 294 0 22 May 2008