ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1311.0466
  4. Cited By
Thompson Sampling for Complex Bandit Problems

Thompson Sampling for Complex Bandit Problems

3 November 2013
Aditya Gopalan
Shie Mannor
Yishay Mansour
ArXivPDFHTML

Papers citing "Thompson Sampling for Complex Bandit Problems"

12 / 12 papers shown
Title
Safe Linear Thompson Sampling with Side Information
Safe Linear Thompson Sampling with Side Information
Ahmadreza Moradipari
Sanae Amani
M. Alizadeh
Christos Thrampoulidis
106
43
0
06 Nov 2019
Improving Regret Bounds for Combinatorial Semi-Bandits with
  Probabilistically Triggered Arms and Its Applications
Improving Regret Bounds for Combinatorial Semi-Bandits with Probabilistically Triggered Arms and Its Applications
Qinshi Wang
Wei Chen
47
87
0
05 Mar 2017
Combinatorial Multi-Armed Bandit with General Reward Functions
Combinatorial Multi-Armed Bandit with General Reward Functions
Wei Chen
Wei Hu
Fu Li
Jiacheng Li
Yu Liu
Pinyan Lu
53
132
0
20 Oct 2016
Thompson Sampling for 1-Dimensional Exponential Family Bandits
Thompson Sampling for 1-Dimensional Exponential Family Bandits
N. Korda
E. Kaufmann
Rémi Munos
62
155
0
12 Jul 2013
(More) Efficient Reinforcement Learning via Posterior Sampling
(More) Efficient Reinforcement Learning via Posterior Sampling
Ian Osband
Daniel Russo
Benjamin Van Roy
105
529
0
04 Jun 2013
Learning to Optimize Via Posterior Sampling
Learning to Optimize Via Posterior Sampling
Daniel Russo
Benjamin Van Roy
163
699
0
11 Jan 2013
Thompson Sampling for Contextual Bandits with Linear Payoffs
Thompson Sampling for Contextual Bandits with Linear Payoffs
Shipra Agrawal
Navin Goyal
164
993
0
15 Sep 2012
Thompson Sampling: An Asymptotically Optimal Finite Time Analysis
Thompson Sampling: An Asymptotically Optimal Finite Time Analysis
E. Kaufmann
N. Korda
Rémi Munos
128
585
0
18 May 2012
Minimax Policies for Combinatorial Prediction Games
Minimax Policies for Combinatorial Prediction Games
Jean-Yves Audibert
Sébastien Bubeck
Gabor Lugosi
OffRL
138
81
0
24 May 2011
The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond
The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond
Aurélien Garivier
Olivier Cappé
133
613
0
12 Feb 2011
X-Armed Bandits
X-Armed Bandits
Sébastien Bubeck
Rémi Munos
Gilles Stoltz
Csaba Szepesvari
126
383
0
25 Jan 2010
A Minimum Relative Entropy Principle for Learning and Acting
A Minimum Relative Entropy Principle for Learning and Acting
Pedro A. Ortega
Daniel A. Braun
107
125
0
20 Oct 2008
1