ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1409.8428
  4. Cited By
Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback

Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback

30 September 2014
N. Alon
Nicolò Cesa-Bianchi
Claudio Gentile
Shie Mannor
Yishay Mansour
Ohad Shamir
    OffRL
ArXivPDFHTML

Papers citing "Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback"

12 / 12 papers shown
Title
Bandit Regret Scaling with the Effective Loss Range
Bandit Regret Scaling with the Effective Loss Range
Nicolò Cesa-Bianchi
Ohad Shamir
59
8
0
15 May 2017
Online Learning with Feedback Graphs Without the Graphs
Online Learning with Feedback Graphs Without the Graphs
Alon Cohen
Tamir Hazan
Tomer Koren
62
59
0
23 May 2016
Online Learning with Gaussian Payoffs and Side Observations
Online Learning with Gaussian Payoffs and Side Observations
Yifan Wu
András Gyorgy
Csaba Szepesvári
42
45
0
27 Oct 2015
Explore no more: Improved high-probability regret bounds for
  non-stochastic bandits
Explore no more: Improved high-probability regret bounds for non-stochastic bandits
Gergely Neu
217
182
0
10 Jun 2015
Online Learning with Feedback Graphs: Beyond Bandits
Online Learning with Feedback Graphs: Beyond Bandits
N. Alon
Nicolò Cesa-Bianchi
O. Dekel
Tomer Koren
71
158
0
26 Feb 2015
From Bandits to Experts: A Tale of Domination and Independence
From Bandits to Experts: A Tale of Domination and Independence
N. Alon
Nicolò Cesa-Bianchi
Claudio Gentile
Yishay Mansour
122
79
0
17 Jul 2013
Leveraging Side Observations in Stochastic Bandits
Leveraging Side Observations in Stochastic Bandits
S. Caron
Branislav Kveton
Marc Lelarge
Smriti Bhagat
65
111
0
16 Oct 2012
Online Bandit Learning against an Adaptive Adversary: from Regret to
  Policy Regret
Online Bandit Learning against an Adaptive Adversary: from Regret to Policy Regret
R. Arora
O. Dekel
Ambuj Tewari
OffRL
63
194
0
27 Jun 2012
The multi-armed bandit problem with covariates
The multi-armed bandit problem with covariates
Vianney Perchet
Philippe Rigollet
291
173
0
27 Oct 2011
From Bandits to Experts: On the Value of Side-Observations
From Bandits to Experts: On the Value of Side-Observations
Shie Mannor
Ohad Shamir
OffRL
153
220
0
13 Jun 2011
Contextual Bandits with Similarity Information
Contextual Bandits with Similarity Information
Aleksandrs Slivkins
309
452
0
23 Jul 2009
Linearly Parameterized Bandits
Linearly Parameterized Bandits
Paat Rusmevichientong
J. Tsitsiklis
243
558
0
18 Dec 2008
1