ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.06246
  4. Cited By
Corralling a Band of Bandit Algorithms
v1v2v3 (latest)

Corralling a Band of Bandit Algorithms

Annual Conference Computational Learning Theory (COLT), 2016
19 December 2016
Alekh Agarwal
Haipeng Luo
Behnam Neyshabur
Robert Schapire
ArXiv (abs)PDFHTML

Papers citing "Corralling a Band of Bandit Algorithms"

21 / 121 papers shown
Learning The Best Expert Efficiently
Learning The Best Expert Efficiently
Daron Anderson
D. Leith
147
1
0
11 Nov 2019
Model-free Reinforcement Learning in Infinite-horizon Average-reward
  Markov Decision Processes
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision ProcessesInternational Conference on Machine Learning (ICML), 2019
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
344
113
0
15 Oct 2019
Accelerated learning from recommender systems using multi-armed bandit
Accelerated learning from recommender systems using multi-armed bandit
Meisam Hejazinia
Kyler M. Eastman
Shu Ye
A. Amirabadi
Ravi Divvela
258
3
0
16 Aug 2019
Bandit Convex Optimization in Non-stationary Environments
Bandit Convex Optimization in Non-stationary EnvironmentsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2019
Peng Zhao
G. Wang
Lijun Zhang
Zhi Zhou
206
54
0
29 Jul 2019
Bandits with Feedback Graphs and Switching Costs
Bandits with Feedback Graphs and Switching CostsNeural Information Processing Systems (NeurIPS), 2019
R. Arora
T. V. Marinov
M. Mohri
197
25
0
29 Jul 2019
Model selection for contextual bandits
Model selection for contextual banditsNeural Information Processing Systems (NeurIPS), 2019
Dylan J. Foster
A. Krishnamurthy
Haipeng Luo
OffRL
522
96
0
03 Jun 2019
Equipping Experts/Bandits with Long-term Memory
Equipping Experts/Bandits with Long-term MemoryNeural Information Processing Systems (NeurIPS), 2019
Kai Zheng
Haipeng Luo
Ilias Diakonikolas
Liwei Wang
OffRL
172
15
0
30 May 2019
OSOM: A simultaneously optimal algorithm for multi-armed and linear
  contextual bandits
OSOM: A simultaneously optimal algorithm for multi-armed and linear contextual banditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2019
Niladri S. Chatterji
Vidya Muthukumar
Peter L. Bartlett
213
48
0
24 May 2019
Introduction to Multi-Armed Bandits
Introduction to Multi-Armed Bandits
Aleksandrs Slivkins
1.3K
1,170
0
15 Apr 2019
Hedging the Drift: Learning to Optimize under Non-Stationarity
Hedging the Drift: Learning to Optimize under Non-StationarityManagement Sciences (MS), 2019
Wang Chi Cheung
D. Simchi-Levi
Ruihao Zhu
287
100
0
04 Mar 2019
Bandit Principal Component Analysis
Bandit Principal Component Analysis
W. Kotłowski
Gergely Neu
173
18
0
08 Feb 2019
Contextual Bandits with Continuous Actions: Smoothing, Zooming, and
  Adapting
Contextual Bandits with Continuous Actions: Smoothing, Zooming, and Adapting
A. Krishnamurthy
John Langford
Aleksandrs Slivkins
Chicheng Zhang
OffRL
379
70
0
05 Feb 2019
Learning to Collaborate in Markov Decision Processes
Learning to Collaborate in Markov Decision Processes
Goran Radanović
R. Devidze
David C. Parkes
Adish Singla
231
34
0
23 Jan 2019
Warm-starting Contextual Bandits: Robustly Combining Supervised and
  Bandit Feedback
Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback
Chicheng Zhang
Alekh Agarwal
Hal Daumé
John Langford
S. Negahban
335
41
0
02 Jan 2019
Learning to Optimize under Non-Stationarity
Learning to Optimize under Non-Stationarity
Wang Chi Cheung
D. Simchi-Levi
Ruihao Zhu
569
147
0
06 Oct 2018
Tsallis-INF: An Optimal Algorithm for Stochastic and Adversarial Bandits
Tsallis-INF: An Optimal Algorithm for Stochastic and Adversarial BanditsJournal of machine learning research (JMLR), 2018
Julian Zimmert
Yevgeny Seldin
AAML
553
198
0
19 Jul 2018
Best of many worlds: Robust model selection for online supervised
  learning
Best of many worlds: Robust model selection for online supervised learning
Vidya Muthukumar
Mitas Ray
A. Sahai
Peter L. Bartlett
OffRL
170
8
0
22 May 2018
Efficient Online Portfolio with Logarithmic Regret
Efficient Online Portfolio with Logarithmic Regret
Haipeng Luo
Chen-Yu Wei
Kai Zheng
157
49
0
18 May 2018
More Adaptive Algorithms for Adversarial Bandits
More Adaptive Algorithms for Adversarial Bandits
Chen-Yu Wei
Haipeng Luo
529
195
0
10 Jan 2018
Efficient Contextual Bandits in Non-stationary Worlds
Efficient Contextual Bandits in Non-stationary Worlds
Haipeng Luo
Chen-Yu Wei
Alekh Agarwal
John Langford
299
143
0
05 Aug 2017
Learning to Use Learners' Advice
Learning to Use Learners' Advice
Adish Singla
Seyed Hamed Hassani
Andreas Krause
OffRL
145
2
0
16 Feb 2017
Previous
123