Hoeffding's lemma for Markov Chains and its applications to statistical learning

We establish the counterpart of Hoeffding's lemma for Markov dependent random variables. Specifically, if a stationary Markov chain with invariant measure admits an -spectral gap , then for any bounded functions , the sum of is sub-Gaussian with variance proxy . The counterpart of Hoeffding's inequality immediately follows. Our results assume none of reversibility, countable state space and time-homogeneity of Markov chains. They are optimal in terms of the multiplicative coefficient , and cover Hoeffding's lemma and inequality for independent random variables as special cases with . We illustrate the utility of these results by applying them to six problems in statistics and machine learning. They are linear regression, lasso regression, sparse covariance matrix estimation with Markov-dependent samples; Markov chain Monte Carlo estimation; respondence driven sampling; and multi-armed bandit problems with Markovian rewards.
View on arXiv