v1v2v3 (latest)

Corralling a Band of Bandit Algorithms

Annual Conference Computational Learning Theory (COLT), 2016

19 December 2016

Papers citing "Corralling a Band of Bandit Algorithms"

21 / 121 papers shown

Learning The Best Expert Efficiently

Daron Anderson

D. Leith

147

11 Nov 2019

Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision ProcessesInternational Conference on Machine Learning (ICML), 2019

Chen-Yu Wei

Mehdi Jafarnia-Jahromi

Haipeng Luo

Hiteshi Sharma

R. Jain

344

113

15 Oct 2019

Accelerated learning from recommender systems using multi-armed bandit

258

16 Aug 2019

Bandit Convex Optimization in Non-stationary EnvironmentsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2019

206

29 Jul 2019

Bandits with Feedback Graphs and Switching CostsNeural Information Processing Systems (NeurIPS), 2019

R. Arora

T. V. Marinov

M. Mohri

197

29 Jul 2019

Model selection for contextual banditsNeural Information Processing Systems (NeurIPS), 2019

522

03 Jun 2019

Equipping Experts/Bandits with Long-term MemoryNeural Information Processing Systems (NeurIPS), 2019

172

30 May 2019

OSOM: A simultaneously optimal algorithm for multi-armed and linear contextual banditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2019

Niladri S. Chatterji

Vidya Muthukumar

Peter L. Bartlett

213

24 May 2019

Introduction to Multi-Armed Bandits

Aleksandrs Slivkins

1.3K

1,170

15 Apr 2019

Hedging the Drift: Learning to Optimize under Non-StationarityManagement Sciences (MS), 2019

Wang Chi Cheung

D. Simchi-Levi

Ruihao Zhu

287

100

04 Mar 2019

Bandit Principal Component Analysis

W. Kotłowski

Gergely Neu

173

08 Feb 2019

Contextual Bandits with Continuous Actions: Smoothing, Zooming, and Adapting

379

05 Feb 2019

Learning to Collaborate in Markov Decision Processes

231

23 Jan 2019

Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback

335

02 Jan 2019

Learning to Optimize under Non-Stationarity

Wang Chi Cheung

D. Simchi-Levi

Ruihao Zhu

569

147

06 Oct 2018

Tsallis-INF: An Optimal Algorithm for Stochastic and Adversarial BanditsJournal of machine learning research (JMLR), 2018

Julian Zimmert

Yevgeny Seldin

AAML

553

198

19 Jul 2018

Best of many worlds: Robust model selection for online supervised learning

170

22 May 2018

Efficient Online Portfolio with Logarithmic Regret

Haipeng Luo

Chen-Yu Wei

Kai Zheng

157

18 May 2018

More Adaptive Algorithms for Adversarial Bandits

Chen-Yu Wei

Haipeng Luo

529

195

10 Jan 2018

Efficient Contextual Bandits in Non-stationary Worlds

299

143

05 Aug 2017

Learning to Use Learners' Advice

145

16 Feb 2017