244

Probabilistic Contraction Analysis of Iterated Random Operators

Abhishek Gupta
Abstract

Consider a contraction operator TT over a complete metric space X\mathcal X with the fixed point xx^\star. In many computational applications, it is difficult to compute T(x)T(x); therefore, one replaces the application contraction operator TT at iteration kk by a random operator T^kn\hat T^n_k using nn independent and identically distributed samples of a random variable. Consider the Markov chain (X^kn)kN(\hat X^n_k)_{k\in\mathbb{N}}, which is generated by X^k+1n=T^kn(X^kn)\hat X^n_{k+1} = \hat T^n_k(\hat X^n_k). In this paper, we identify some sufficient conditions under which (i) the distribution of X^kn\hat X^n_k converges to a Dirac mass over xx^\star as kk and nn go to infinity, and (ii) the probability that X^kn\hat X^n_k is far from xx^\star as kk goes to infinity can be made arbitrarily small by an appropriate choice of nn. We also derive an upper bound on the probability that X^kn\hat X^n_k is far from xx^\star as kk\rightarrow \infty. We apply the result to study the convergence in probability of iterates generated by empirical value iteration algorithms for discounted and average cost Markov decision problems.

View on arXiv
Comments on this paper