ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1309.3376
  4. Cited By
Informational Confidence Bounds for Self-Normalized Averages and
  Applications

Informational Confidence Bounds for Self-Normalized Averages and Applications

13 September 2013
Aurélien Garivier
ArXivPDFHTML

Papers citing "Informational Confidence Bounds for Self-Normalized Averages and Applications"

9 / 9 papers shown
Title
Conservative Bandits
Conservative Bandits
Yifan Wu
R. Shariff
Tor Lattimore
Csaba Szepesvári
155
98
0
13 Feb 2016
Kullback-Leibler upper confidence bounds for optimal sequential
  allocation
Kullback-Leibler upper confidence bounds for optimal sequential allocation
Olivier Cappé
Aurélien Garivier
Odalric-Ambrym Maillard
Rémi Munos
Gilles Stoltz
96
394
0
03 Oct 2012
Optimal discovery with probabilistic expert advice: finite time analysis
  and macroscopic optimality
Optimal discovery with probabilistic expert advice: finite time analysis and macroscopic optimality
Sébastien Bubeck
D. Ernst
Aurélien Garivier
48
30
0
22 Jul 2012
Consistency of maximum-likelihood and variational estimators in the
  Stochastic Block Model
Consistency of maximum-likelihood and variational estimators in the Stochastic Block Model
Alain Celisse
J. Daudin
L. Pierre
91
197
0
17 May 2011
Online Least Squares Estimation with Self-Normalized Processes: An
  Application to Bandit Problems
Online Least Squares Estimation with Self-Normalized Processes: An Application to Bandit Problems
Yasin Abbasi-Yadkori
D. Pál
Csaba Szepesvári
77
70
0
14 Feb 2011
The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond
The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond
Aurélien Garivier
Olivier Cappé
135
613
0
12 Feb 2011
Optimism in Reinforcement Learning and Kullback-Leibler Divergence
Optimism in Reinforcement Learning and Kullback-Leibler Divergence
Sarah Filippi
Olivier Cappé
Aurélien Garivier
110
105
0
29 Apr 2010
Context tree selection and linguistic rhythm retrieval from written
  texts
Context tree selection and linguistic rhythm retrieval from written texts
A. Galves
Charlotte Galves
Jesús E. García
N. Garcia
Florencia Leonardi
50
63
0
20 Feb 2009
On Upper-Confidence Bound Policies for Non-Stationary Bandit Problems
On Upper-Confidence Bound Policies for Non-Stationary Bandit Problems
Aurélien Garivier
Eric Moulines
84
294
0
22 May 2008
1