Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.10027
Cited By
Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima
24 May 2019
Qi Cai
Zhuoran Yang
Jason D. Lee
Zhaoran Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima"
5 / 5 papers shown
Title
Approximation to Deep Q-Network by Stochastic Delay Differential Equations
Jianya Lu
Yingjun Mo
33
0
0
01 May 2025
An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks
Zhifa Ke
Zaiwen Wen
Junyu Zhang
37
0
0
07 May 2024
Probabilistic Constrained Reinforcement Learning with Formal Interpretability
Yanran Wang
Qiuchen Qian
David E. Boyle
16
4
0
13 Jul 2023
Towards a Better Understanding of Representation Dynamics under TD-learning
Yunhao Tang
Rémi Munos
OffRL
26
1
0
29 May 2023
Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation
Zhifa Ke
Junyu Zhang
Zaiwen Wen
24
0
0
25 Feb 2023
1