Probabilistic Contraction Analysis of Iterated Random Operators
Consider a contraction operator over a complete metric space with the fixed point . In many computational applications, it is difficult to compute ; therefore, one replaces the application contraction operator at iteration by a random operator using independent and identically distributed samples of a random variable. Consider the Markov chain , which is generated by . In this paper, we identify some sufficient conditions under which (i) the distribution of converges to a Dirac mass over as and go to infinity, and (ii) the probability that is far from as goes to infinity can be made arbitrarily small by an appropriate choice of . We also derive an upper bound on the probability that is far from as . We apply the result to study the convergence in probability of iterates generated by empirical value iteration algorithms for discounted and average cost Markov decision problems.
View on arXiv