245

Probabilistic Contraction Analysis of Iterated Random Operators

Abhishek Gupta
Abstract

Consider a contraction operator TT over a Polish space X\mathcal X with a fixed point xx^\star. Assume that one can approximate the operator TT by a random operator T^n\hat T^n using nNn\in\mathbb{N} independent and identically distributed samples of a random variable. Consider the sequence (X^kn)kN(\hat X^n_k)_{k\in\mathbb{N}}, which is generated by X^k+1n=T^n(X^kn)\hat X^n_{k+1} = \hat T^n(\hat X^n_k) and is a random sequence. In this paper, we prove that under certain conditions on the random operator, (i) the distribution of X^kn\hat X^n_k converges to a unit mass over xx^\star as kk and nn goes to infinity, and (ii) the probability that X^kn\hat X^n_k is far from xx^\star as kk goes to infinity can be made arbitrarily small by an appropriate choice of nn. We also find a lower bound on the probability that X^kn\hat X^n_k is far from xx^\star as kk\rightarrow \infty. We apply the result to study probabilistic convergence of certain randomized value and Q-value iteration algorithms for discounted and average cost Markov decision processes.

View on arXiv
Comments on this paper