Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2007.02786
Cited By
TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
6 July 2020
Joshua Romoff
Peter Henderson
David Kanaa
Emmanuel Bengio
Ahmed Touati
Pierre-Luc Bacon
Joelle Pineau
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?"
3 / 3 papers shown
Title
Why Target Networks Stabilise Temporal Difference Methods
Matt Fellows
Matthew Smith
Shimon Whiteson
OOD
AAML
95
8
0
24 Feb 2023
Correcting Momentum in Temporal Difference Learning
Emmanuel Bengio
Joelle Pineau
Doina Precup
64
10
0
07 Jun 2021
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation Perspective
Florin Gogianu
Tudor Berariu
Mihaela Rosca
Claudia Clopath
L. Buşoniu
Razvan Pascanu
86
56
0
11 May 2021
1