ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.02786
  4. Cited By
TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?

TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?

6 July 2020
Joshua Romoff
Peter Henderson
David Kanaa
Emmanuel Bengio
Ahmed Touati
Pierre-Luc Bacon
Joelle Pineau
ArXiv (abs)PDFHTML

Papers citing "TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?"

3 / 3 papers shown
Title
Why Target Networks Stabilise Temporal Difference Methods
Why Target Networks Stabilise Temporal Difference Methods
Matt Fellows
Matthew Smith
Shimon Whiteson
OODAAML
97
8
0
24 Feb 2023
Correcting Momentum in Temporal Difference Learning
Correcting Momentum in Temporal Difference Learning
Emmanuel Bengio
Joelle Pineau
Doina Precup
67
10
0
07 Jun 2021
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation
  Perspective
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation Perspective
Florin Gogianu
Tudor Berariu
Mihaela Rosca
Claudia Clopath
L. Buşoniu
Razvan Pascanu
86
56
0
11 May 2021
1