Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima

24 May 2019

Papers citing "Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima"

5 / 5 papers shown

Title
Approximation to Deep Q-Network by Stochastic Delay Differential Equations Jianya Lu Yingjun Mo 33 0 0 01 May 2025
An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks Zhifa Ke Zaiwen Wen Junyu Zhang 37 0 0 07 May 2024
Probabilistic Constrained Reinforcement Learning with Formal Interpretability Yanran Wang Qiuchen Qian David E. Boyle 16 4 0 13 Jul 2023
Towards a Better Understanding of Representation Dynamics under TD-learning Yunhao Tang Rémi Munos OffRL 26 1 0 29 May 2023
Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation Zhifa Ke Junyu Zhang Zaiwen Wen 24 0 0 25 Feb 2023