Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2201.08536
Cited By
Instance-Dependent Confidence and Early Stopping for Reinforcement Learning
21 January 2022
K. Khamaru
Eric Xia
Martin J. Wainwright
Michael I. Jordan
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Instance-Dependent Confidence and Early Stopping for Reinforcement Learning"
5 / 5 papers shown
Title
Enhancing Stochastic Optimization for Statistical Efficiency Using ROOT-SGD with Diminishing Stepsize
Tong Zhang
Chris Junchi Li
36
0
0
15 Jul 2024
A Framework for History-Aware Hyperparameter Optimisation in Reinforcement Learning
Juan Marcelo Parra Ullauri
Chen Zhen
A. García-Domínguez
Nelly Bencomo
Changgang Zheng
Juan Boubeta-Puig
Guadalupe Ortiz
Shufan Yang
OffRL
11
0
0
09 Mar 2023
Stabilizing Q-learning with Linear Architectures for Provably Efficient Learning
Andrea Zanette
Martin J. Wainwright
OOD
31
5
0
01 Jun 2022
Optimal variance-reduced stochastic approximation in Banach spaces
Wenlong Mou
K. Khamaru
Martin J. Wainwright
Peter L. Bartlett
Michael I. Jordan
31
8
0
21 Jan 2022
A Statistical Analysis of Polyak-Ruppert Averaged Q-learning
Xiang Li
Wenhao Yang
Jiadong Liang
Zhihua Zhang
Michael I. Jordan
32
15
0
29 Dec 2021
1