ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.09127
  4. Cited By
High-confidence error estimates for learned value functions

High-confidence error estimates for learned value functions

28 August 2018
Touqir Sajed
Wesley Chung
Martha White
    OffRL
ArXivPDFHTML

Papers citing "High-confidence error estimates for learned value functions"

9 / 9 papers shown
Title
Mixing time estimation in reversible Markov chains from a single sample
  path
Mixing time estimation in reversible Markov chains from a single sample path
Daniel J. Hsu
A. Kontorovich
D. A. Levin
Yuval Peres
Csaba Szepesvári
47
82
0
24 Aug 2017
Learning Sparse Representations in Reinforcement Learning with Sparse
  Coding
Learning Sparse Representations in Reinforcement Learning with Sparse Coding
Lei Le
Raksha Kumaraswamy
Martha White
OffRL
SSL
49
25
0
26 Jul 2017
Stochastic Variance Reduction Methods for Policy Evaluation
Stochastic Variance Reduction Methods for Policy Evaluation
S. Du
Jianshu Chen
Lihong Li
Lin Xiao
Dengyong Zhou
OffRL
34
156
0
25 Feb 2017
Accelerated Gradient Temporal Difference Learning
Accelerated Gradient Temporal Difference Learning
Yangchen Pan
Adam White
Martha White
31
27
0
28 Nov 2016
Investigating practical linear temporal difference learning
Investigating practical linear temporal difference learning
Adam White
Martha White
OffRL
40
41
0
28 Feb 2016
Incremental Truncated LSTD
Incremental Truncated LSTD
Clement Gehring
Yangchen Pan
Martha White
33
10
0
26 Nov 2015
An Emphatic Approach to the Problem of Off-policy Temporal-Difference
  Learning
An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning
R. Sutton
A. R. Mahmood
Martha White
72
269
0
14 Mar 2015
Off-policy Learning with Eligibility Traces: A Survey
Off-policy Learning with Eligibility Traces: A Survey
Matthieu Geist
B. Scherrer
OffRL
77
94
0
15 Apr 2013
Dyna-Style Planning with Linear Function Approximation and Prioritized
  Sweeping
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping
R. Sutton
Csaba Szepesvári
A. Geramifard
Michael Bowling
OffRL
65
203
0
13 Jun 2012
1