ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1007.2238
  4. Cited By
Online Algorithms for the Multi-Armed Bandit Problem with Markovian
  Rewards
v1v2v3 (latest)

Online Algorithms for the Multi-Armed Bandit Problem with Markovian Rewards

14 July 2010
Cem Tekin
M. Liu
ArXiv (abs)PDFHTML

Papers citing "Online Algorithms for the Multi-Armed Bandit Problem with Markovian Rewards"

24 / 24 papers shown
Restless Multi-Armed Bandits under Exogenous Global Markov Process
Restless Multi-Armed Bandits under Exogenous Global Markov ProcessIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Tomer Gafni
M. Yemini
Kobi Cohen
220
4
0
28 Feb 2022
Learning in Restless Bandits under Exogenous Global Markov Process
Learning in Restless Bandits under Exogenous Global Markov Process
Tomer Gafni
M. Yemini
Kobi Cohen
264
15
0
17 Dec 2021
Bandit problems with fidelity rewards
Bandit problems with fidelity rewards
Gábor Lugosi
Ciara Pike-Burke
Pierre-André Savalle
144
0
0
25 Nov 2021
Adaptive KL-UCB based Bandit Algorithms for Markovian and i.i.d.
  Settings
Adaptive KL-UCB based Bandit Algorithms for Markovian and i.i.d. SettingsIEEE Transactions on Automatic Control (TAC), 2020
Member Ieee Arghyadip Roy
Fellow Ieee Sanjay Shakkottai
F. I. R. Srikant
452
4
0
14 Sep 2020
LACO: A Latency-Driven Network Slicing Orchestration in Beyond-5G
  Networks
LACO: A Latency-Driven Network Slicing Orchestration in Beyond-5G NetworksIEEE Transactions on Wireless Communications (TWC), 2020
Lanfranco Zanzi
Vincenzo Sciancalepore
A. Garcia-Saavedra
Hans D. Schotten
Xavier Costa Pérez
108
47
0
07 Sep 2020
Bandit Learning with Delayed Impact of Actions
Bandit Learning with Delayed Impact of ActionsNeural Information Processing Systems (NeurIPS), 2020
Wei Tang
Chien-Ju Ho
Yang Liu
318
14
0
24 Feb 2020
Finite-Time Analysis of Round-Robin Kullback-Leibler Upper Confidence
  Bounds for Optimal Adaptive Allocation with Multiple Plays and Markovian
  Rewards
Finite-Time Analysis of Round-Robin Kullback-Leibler Upper Confidence Bounds for Optimal Adaptive Allocation with Multiple Plays and Markovian Rewards
Vrettos Moulos
253
3
0
30 Jan 2020
A Hoeffding Inequality for Finite State Markov Chains and its
  Applications to Markovian Bandits
A Hoeffding Inequality for Finite State Markov Chains and its Applications to Markovian BanditsInternational Symposium on Information Theory (ISIT), 2020
Vrettos Moulos
407
14
0
05 Jan 2020
Online Newton Step Algorithm with Estimated Gradient
Online Newton Step Algorithm with Estimated Gradient
Binbin Liu
Jundong Li
Yunquan Song
Xijun Liang
Ling Jian
Huan Liu
210
4
0
25 Nov 2018
Combinatorial Bandits for Incentivizing Agents with Dynamic Preferences
Combinatorial Bandits for Incentivizing Agents with Dynamic PreferencesConference on Uncertainty in Artificial Intelligence (UAI), 2018
Tanner Fiez
S. Sekar
Liyuan Zheng
Lillian J. Ratliff
160
3
0
06 Jul 2018
An Asymptotically Optimal Algorithm for Communicating Multiplayer
  Multi-Armed Bandit Problems
An Asymptotically Optimal Algorithm for Communicating Multiplayer Multi-Armed Bandit Problems
Noyan Evirgen
Alper Köse
Hakan Gokcesu
98
0
0
02 Dec 2017
The Effect of Communication on Noncooperative Multiplayer Multi-Armed
  Bandit Problems
The Effect of Communication on Noncooperative Multiplayer Multi-Armed Bandit Problems
Noyan Evirgen
Alper Köse
160
10
0
05 Nov 2017
Asymptotic Allocation Rules for a Class of Dynamic Multi-armed Bandit
  Problems
Asymptotic Allocation Rules for a Class of Dynamic Multi-armed Bandit Problems
T. W. U. Madhushani
D. H. S. Maithripala
Naomi Ehrich Leonard
139
0
0
02 Oct 2017
The Multi-Armed Bandit Problem: An Efficient Non-Parametric Solution
The Multi-Armed Bandit Problem: An Efficient Non-Parametric Solution
H. Chan
306
15
0
24 Mar 2017
Online Learning in Decentralized Multiuser Resource Sharing Problems
Online Learning in Decentralized Multiuser Resource Sharing Problems
Cem Tekin
M. Liu
182
5
0
19 Oct 2012
Decentralized Learning for Multi-player Multi-armed Bandits
Decentralized Learning for Multi-player Multi-armed BanditsIEEE Conference on Decision and Control (CDC), 2012
D. Kalathil
Naumaan Nayyar
R. Jain
255
46
0
14 Jun 2012
Online Learning for Combinatorial Network Optimization with Restless
  Markovian Rewards
Online Learning for Combinatorial Network Optimization with Restless Markovian Rewards
Yi Gai
Bhaskar Krishnamachari
M. Liu
OffRL
217
15
0
08 Sep 2011
Performance and Convergence of Multi-user Online Learning
Performance and Convergence of Multi-user Online LearningInternational ICST Conference on Game Theory for Networks (GameNets), 2011
Cem Tekin
M. Liu
221
29
0
21 Jul 2011
Deterministic Sequencing of Exploration and Exploitation for Multi-Armed
  Bandit Problems
Deterministic Sequencing of Exploration and Exploitation for Multi-Armed Bandit ProblemsIEEE Journal on Selected Topics in Signal Processing (JSTSP), 2011
Sattar Vakili
Keqin Liu
Qing Zhao
420
112
0
30 Jun 2011
Online Learning of Rested and Restless Bandits
Online Learning of Rested and Restless BanditsIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2011
Cem Tekin
M. Liu
272
210
0
17 Feb 2011
Decentralized Restless Bandit with Multiple Players and Unknown Dynamics
Decentralized Restless Bandit with Multiple Players and Unknown Dynamics
Haoyang Liu
Keqin Liu
Qing Zhao
260
2
0
15 Feb 2011
On the Combinatorial Multi-Armed Bandit Problem with Markovian Rewards
On the Combinatorial Multi-Armed Bandit Problem with Markovian Rewards
Yi Gai
Bhaskar Krishnamachari
M. Liu
381
30
0
14 Dec 2010
Learning in A Changing World: Restless Multi-Armed Bandit with Unknown
  Dynamics
Learning in A Changing World: Restless Multi-Armed Bandit with Unknown Dynamics
Haoyang Liu
Keqin Liu
Qing Zhao
378
171
0
22 Nov 2010
Online Learning in Opportunistic Spectrum Access: A Restless Bandit
  Approach
Online Learning in Opportunistic Spectrum Access: A Restless Bandit Approach
Cem Tekin
M. Liu
328
106
0
01 Oct 2010
1
Page 1 of 1