30
0

A Concentration Bound for LSPE(λλ)

Abstract

The popular LSPE(λ\lambda) algorithm for policy evaluation is revisited to derive a concentration bound that gives high probability performance guarantees from some time on.

View on arXiv
Comments on this paper