ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.03906
  4. Cited By
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement
  Learning

Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning

9 September 2019
Kristopher De Asis
Alan Chan
Silviu Pitis
R. Sutton
D. Graves
ArXivPDFHTML

Papers citing "Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning"

8 / 8 papers shown
Title
Solving Finite-Horizon MDPs via Low-Rank Tensors
Solving Finite-Horizon MDPs via Low-Rank Tensors
Sergio Rozada
Jose Luis Orejuela
Antonio G. Marques
44
0
0
17 Jan 2025
Tensor Low-rank Approximation of Finite-horizon Value Functions
Tensor Low-rank Approximation of Finite-horizon Value Functions
Sergio Rozada
Antonio G. Marques
41
3
0
27 May 2024
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and
  Global Optimality
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality
François Ged
M. H. Veiga
31
0
0
22 Mar 2023
The Construction of Reality in an AI: A Review
The Construction of Reality in an AI: A Review
J. W. Johnston
3DV
13
1
0
03 Feb 2023
A policy gradient approach for Finite Horizon Constrained Markov Decision Processes
A policy gradient approach for Finite Horizon Constrained Markov Decision Processes
Soumyajit Guin
S. Bhatnagar
27
8
0
10 Oct 2022
Chaining Value Functions for Off-Policy Learning
Chaining Value Functions for Off-Policy Learning
Simon Schmitt
John Shawe-Taylor
Hado van Hasselt
OffRL
23
2
0
17 Jan 2022
Finite Horizon Q-learning: Stability, Convergence, Simulations and an
  application on Smart Grids
Finite Horizon Q-learning: Stability, Convergence, Simulations and an application on Smart Grids
V. Vivek
Dr.Shalabh Bhatnagar
16
6
0
27 Oct 2021
The Loss Surfaces of Multilayer Networks
The Loss Surfaces of Multilayer Networks
A. Choromańska
Mikael Henaff
Michaël Mathieu
Gerard Ben Arous
Yann LeCun
ODL
183
1,185
0
30 Nov 2014
1