ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.08536
  4. Cited By
Instance-Dependent Confidence and Early Stopping for Reinforcement
  Learning

Instance-Dependent Confidence and Early Stopping for Reinforcement Learning

21 January 2022
K. Khamaru
Eric Xia
Martin J. Wainwright
Michael I. Jordan
ArXivPDFHTML

Papers citing "Instance-Dependent Confidence and Early Stopping for Reinforcement Learning"

5 / 5 papers shown
Title
Enhancing Stochastic Optimization for Statistical Efficiency Using
  ROOT-SGD with Diminishing Stepsize
Enhancing Stochastic Optimization for Statistical Efficiency Using ROOT-SGD with Diminishing Stepsize
Tong Zhang
Chris Junchi Li
36
0
0
15 Jul 2024
A Framework for History-Aware Hyperparameter Optimisation in
  Reinforcement Learning
A Framework for History-Aware Hyperparameter Optimisation in Reinforcement Learning
Juan Marcelo Parra Ullauri
Chen Zhen
A. García-Domínguez
Nelly Bencomo
Changgang Zheng
Juan Boubeta-Puig
Guadalupe Ortiz
Shufan Yang
OffRL
11
0
0
09 Mar 2023
Stabilizing Q-learning with Linear Architectures for Provably Efficient
  Learning
Stabilizing Q-learning with Linear Architectures for Provably Efficient Learning
Andrea Zanette
Martin J. Wainwright
OOD
31
5
0
01 Jun 2022
Optimal variance-reduced stochastic approximation in Banach spaces
Optimal variance-reduced stochastic approximation in Banach spaces
Wenlong Mou
K. Khamaru
Martin J. Wainwright
Peter L. Bartlett
Michael I. Jordan
31
8
0
21 Jan 2022
A Statistical Analysis of Polyak-Ruppert Averaged Q-learning
A Statistical Analysis of Polyak-Ruppert Averaged Q-learning
Xiang Li
Wenhao Yang
Jiadong Liang
Zhihua Zhang
Michael I. Jordan
32
15
0
29 Dec 2021
1