16
109

Finite-Time Analysis of Asynchronous Stochastic Approximation and QQ-Learning

Guannan Qu
Adam Wierman
Abstract

We consider a general asynchronous Stochastic Approximation (SA) scheme featuring a weighted infinity-norm contractive operator, and prove a bound on its finite-time convergence rate on a single trajectory. Additionally, we specialize the result to asynchronous QQ-learning. The resulting bound matches the sharpest available bound for synchronous QQ-learning, and improves over previous known bounds for asynchronous QQ-learning.

View on arXiv
Comments on this paper