Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.03906
Cited By
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning
9 September 2019
Kristopher De Asis
Alan Chan
Silviu Pitis
R. Sutton
D. Graves
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning"
8 / 8 papers shown
Title
Solving Finite-Horizon MDPs via Low-Rank Tensors
Sergio Rozada
Jose Luis Orejuela
Antonio G. Marques
44
0
0
17 Jan 2025
Tensor Low-rank Approximation of Finite-horizon Value Functions
Sergio Rozada
Antonio G. Marques
41
3
0
27 May 2024
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality
François Ged
M. H. Veiga
31
0
0
22 Mar 2023
The Construction of Reality in an AI: A Review
J. W. Johnston
3DV
13
1
0
03 Feb 2023
A policy gradient approach for Finite Horizon Constrained Markov Decision Processes
Soumyajit Guin
S. Bhatnagar
27
8
0
10 Oct 2022
Chaining Value Functions for Off-Policy Learning
Simon Schmitt
John Shawe-Taylor
Hado van Hasselt
OffRL
23
2
0
17 Jan 2022
Finite Horizon Q-learning: Stability, Convergence, Simulations and an application on Smart Grids
V. Vivek
Dr.Shalabh Bhatnagar
16
6
0
27 Oct 2021
The Loss Surfaces of Multilayer Networks
A. Choromańska
Mikael Henaff
Michaël Mathieu
Gerard Ben Arous
Yann LeCun
ODL
183
1,185
0
30 Nov 2014
1