ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.01199
39
13
v1v2 (latest)

A Hoeffding Inequality for Finite State Markov Chains and its Applications to Markovian Bandits

5 January 2020
Vrettos Moulos
ArXiv (abs)PDFHTML
Abstract

This paper develops a Hoeffding inequality for the partial sums ∑k=1nf(Xk)\sum_{k=1}^n f (X_k)∑k=1n​f(Xk​), where {Xk}k∈Z>0\{X_k\}_{k \in \mathbb{Z}_{> 0}}{Xk​}k∈Z>0​​ is an irreducible Markov chain on a finite state space SSS, and f:S→[a,b]f : S \to [a, b]f:S→[a,b] is a real-valued function. Our bound is simple, general, since it only assumes irreducibility and finiteness of the state space, and powerful. In order to demonstrate its usefulness we provide two applications in multi-armed bandit problems. The first is about identifying an approximately best Markovian arm, while the second is concerned with regret minimization in the context of Markovian bandits.

View on arXiv
Comments on this paper