Stability of Stochastic Approximations with `Controlled Markov' Noise
and Temporal Difference Learning

v1v2 (latest)

Stability of Stochastic Approximations with `Controlled Markov' Noise and Temporal Difference Learning

23 April 2015

Arunselvan Ramaswamy

ArXiv (abs)PDF HTML

Papers citing "Stability of Stochastic Approximations with `Controlled Markov' Noise and Temporal Difference Learning"

7 / 7 papers shown

Title
STOPS: Short-Term-based Volatility-controlled Policy Search and its Global Convergence Liang Xu Daoming Lyu Yangchen Pan Aiwen Jiang Bo Liu 95 0 0 24 Jan 2022
Schedule Based Temporal Difference Algorithms Rohan Deb Meet Gandhi S. Bhatnagar 28 0 0 23 Nov 2021
Gradient Temporal Difference with Momentum: Stability and Convergence Rohan Deb S. Bhatnagar 55 5 0 22 Nov 2021
The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning Vivek Borkar Shuhang Chen Adithya M. Devraj Ioannis Kontoyiannis Sean P. Meyn 79 32 0 27 Oct 2021
Zap Q-Learning With Nonlinear Function Approximation Shuhang Chen Adithya M. Devraj Fan Lu Ana Bušić Sean P. Meyn 67 20 0 11 Oct 2019
Two Time-scale Off-Policy TD Learning: Non-asymptotic Analysis over Markovian Samples Tengyu Xu Shaofeng Zou Yingbin Liang 76 73 0 26 Sep 2019
Finite-Sample Analysis of Nonlinear Stochastic Approximation with Applications in Reinforcement Learning Zaiwei Chen Sheng Zhang Thinh T. Doan John-Paul Clarke S. T. Maguluri 172 59 0 27 May 2019