High-confidence error estimates for learned value functions

28 August 2018

Papers citing "High-confidence error estimates for learned value functions"

9 / 9 papers shown

Title
Mixing time estimation in reversible Markov chains from a single sample path Daniel J. Hsu A. Kontorovich D. A. Levin Yuval Peres Csaba Szepesvári 47 82 0 24 Aug 2017
Learning Sparse Representations in Reinforcement Learning with Sparse Coding Lei Le Raksha Kumaraswamy Martha White OffRL SSL 49 25 0 26 Jul 2017
Stochastic Variance Reduction Methods for Policy Evaluation S. Du Jianshu Chen Lihong Li Lin Xiao Dengyong Zhou OffRL 34 156 0 25 Feb 2017
Accelerated Gradient Temporal Difference Learning Yangchen Pan Adam White Martha White 31 27 0 28 Nov 2016
Investigating practical linear temporal difference learning Adam White Martha White OffRL 40 41 0 28 Feb 2016
Incremental Truncated LSTD Clement Gehring Yangchen Pan Martha White 33 10 0 26 Nov 2015
An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning R. Sutton A. R. Mahmood Martha White 72 269 0 14 Mar 2015
Off-policy Learning with Eligibility Traces: A Survey Matthieu Geist B. Scherrer OffRL 77 94 0 15 Apr 2013
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping R. Sutton Csaba Szepesvári A. Geramifard Michael Bowling OffRL 65 203 0 13 Jun 2012