ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.02597
  4. Cited By
Online Off-policy Prediction

Online Off-policy Prediction

6 November 2018
Sina Ghiassian
D. Paul
M. Fasoulakis
R. Sutton
Adam White
    OffRL
ArXivPDFHTML

Papers citing "Online Off-policy Prediction"

7 / 7 papers shown
Title
Gradient Descent Temporal Difference-difference Learning
Gradient Descent Temporal Difference-difference Learning
Rong Zhu
James M. Murray
OffRL
32
1
0
10 Sep 2022
Importance Sampling Placement in Off-Policy Temporal-Difference Methods
Importance Sampling Placement in Off-Policy Temporal-Difference Methods
Eric Graves
Sina Ghiassian
OffRL
49
2
0
18 Mar 2022
Schedule Based Temporal Difference Algorithms
Schedule Based Temporal Difference Algorithms
Rohan Deb
Meet Gandhi
S. Bhatnagar
11
0
0
23 Nov 2021
Reducing Sampling Error in Batch Temporal Difference Learning
Reducing Sampling Error in Batch Temporal Difference Learning
Brahma S. Pavse
Ishan Durugkar
Josiah P. Hanna
Peter Stone
OffRL
25
12
0
15 Aug 2020
Learning predictive representations in autonomous driving to improve
  deep reinforcement learning
Learning predictive representations in autonomous driving to improve deep reinforcement learning
D. Graves
Nhat M. Nguyen
Kimia Hassanzadeh
Jun Jin
SSL
29
12
0
26 Jun 2020
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement
  Learning
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning
Kristopher De Asis
Alan Chan
Silviu Pitis
R. Sutton
D. Graves
27
32
0
09 Sep 2019
Planning with Expectation Models
Planning with Expectation Models
Yi Wan
M. Zaheer
Adam White
Martha White
R. Sutton
OffRL
26
23
0
02 Apr 2019
1