v1v2v3 (latest)

Online Algorithms for the Multi-Armed Bandit Problem with Markovian Rewards

14 July 2010

Cem Tekin

M. Liu

ArXiv (abs)PDF HTML

Papers citing "Online Algorithms for the Multi-Armed Bandit Problem with Markovian Rewards"

24 / 24 papers shown

Restless Multi-Armed Bandits under Exogenous Global Markov ProcessIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Tomer Gafni

M. Yemini

Kobi Cohen

228

28 Feb 2022

Learning in Restless Bandits under Exogenous Global Markov Process

Tomer Gafni

M. Yemini

Kobi Cohen

265

17 Dec 2021

Bandit problems with fidelity rewards

Gábor Lugosi

Ciara Pike-Burke

Pierre-André Savalle

145

25 Nov 2021

Adaptive KL-UCB based Bandit Algorithms for Markovian and i.i.d. SettingsIEEE Transactions on Automatic Control (TAC), 2020

Member Ieee Arghyadip Roy

Fellow Ieee Sanjay Shakkottai

F. I. R. Srikant

468

14 Sep 2020

LACO: A Latency-Driven Network Slicing Orchestration in Beyond-5G NetworksIEEE Transactions on Wireless Communications (TWC), 2020

Lanfranco Zanzi

Vincenzo Sciancalepore

A. Garcia-Saavedra

Hans D. Schotten

Xavier Costa Pérez

114

07 Sep 2020

Bandit Learning with Delayed Impact of ActionsNeural Information Processing Systems (NeurIPS), 2020

Wei Tang

Chien-Ju Ho

Yang Liu

318

24 Feb 2020

Finite-Time Analysis of Round-Robin Kullback-Leibler Upper Confidence Bounds for Optimal Adaptive Allocation with Multiple Plays and Markovian Rewards

Vrettos Moulos

262

30 Jan 2020

A Hoeffding Inequality for Finite State Markov Chains and its Applications to Markovian BanditsInternational Symposium on Information Theory (ISIT), 2020

Vrettos Moulos

408

05 Jan 2020

Online Newton Step Algorithm with Estimated Gradient

Huan Liu

210

25 Nov 2018

Combinatorial Bandits for Incentivizing Agents with Dynamic PreferencesConference on Uncertainty in Artificial Intelligence (UAI), 2018

160

06 Jul 2018

An Asymptotically Optimal Algorithm for Communicating Multiplayer Multi-Armed Bandit Problems

Noyan Evirgen

Alper Köse

Hakan Gokcesu

02 Dec 2017

The Effect of Communication on Noncooperative Multiplayer Multi-Armed Bandit Problems

Noyan Evirgen

Alper Köse

161

05 Nov 2017

Asymptotic Allocation Rules for a Class of Dynamic Multi-armed Bandit Problems

T. W. U. Madhushani

D. H. S. Maithripala

Naomi Ehrich Leonard

142

02 Oct 2017

The Multi-Armed Bandit Problem: An Efficient Non-Parametric Solution

H. Chan

306

24 Mar 2017

Online Learning in Decentralized Multiuser Resource Sharing Problems

Cem Tekin

M. Liu

183

19 Oct 2012

Decentralized Learning for Multi-player Multi-armed BanditsIEEE Conference on Decision and Control (CDC), 2012

D. Kalathil

Naumaan Nayyar

R. Jain

255

14 Jun 2012

Online Learning for Combinatorial Network Optimization with Restless Markovian Rewards

Yi Gai

Bhaskar Krishnamachari

M. Liu

OffRL

218

08 Sep 2011

Performance and Convergence of Multi-user Online LearningInternational ICST Conference on Game Theory for Networks (GameNets), 2011

Cem Tekin

M. Liu

222

21 Jul 2011

Deterministic Sequencing of Exploration and Exploitation for Multi-Armed Bandit ProblemsIEEE Journal on Selected Topics in Signal Processing (JSTSP), 2011

Sattar Vakili

Keqin Liu

Qing Zhao

425

112

30 Jun 2011

Online Learning of Rested and Restless BanditsIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2011

Cem Tekin

M. Liu

275

210

17 Feb 2011

Decentralized Restless Bandit with Multiple Players and Unknown Dynamics

Haoyang Liu

Keqin Liu

Qing Zhao

267

15 Feb 2011

On the Combinatorial Multi-Armed Bandit Problem with Markovian Rewards

Yi Gai

Bhaskar Krishnamachari

M. Liu

396

14 Dec 2010

Learning in A Changing World: Restless Multi-Armed Bandit with Unknown Dynamics

Haoyang Liu

Keqin Liu

Qing Zhao

381

171

22 Nov 2010

Online Learning in Opportunistic Spectrum Access: A Restless Bandit Approach

Cem Tekin

M. Liu

329

106

01 Oct 2010