Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.02597
Cited By
Online Off-policy Prediction
6 November 2018
Sina Ghiassian
D. Paul
M. Fasoulakis
R. Sutton
Adam White
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Online Off-policy Prediction"
7 / 7 papers shown
Title
Gradient Descent Temporal Difference-difference Learning
Rong Zhu
James M. Murray
OffRL
32
1
0
10 Sep 2022
Importance Sampling Placement in Off-Policy Temporal-Difference Methods
Eric Graves
Sina Ghiassian
OffRL
49
2
0
18 Mar 2022
Schedule Based Temporal Difference Algorithms
Rohan Deb
Meet Gandhi
S. Bhatnagar
11
0
0
23 Nov 2021
Reducing Sampling Error in Batch Temporal Difference Learning
Brahma S. Pavse
Ishan Durugkar
Josiah P. Hanna
Peter Stone
OffRL
25
12
0
15 Aug 2020
Learning predictive representations in autonomous driving to improve deep reinforcement learning
D. Graves
Nhat M. Nguyen
Kimia Hassanzadeh
Jun Jin
SSL
29
12
0
26 Jun 2020
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning
Kristopher De Asis
Alan Chan
Silviu Pitis
R. Sutton
D. Graves
27
32
0
09 Sep 2019
Planning with Expectation Models
Yi Wan
M. Zaheer
Adam White
Martha White
R. Sutton
OffRL
26
23
0
02 Apr 2019
1