v1v2 (latest)

Q-learning with Uniformly Bounded Variance: Large Discounting is Not a Barrier to Fast Learning

IEEE Transactions on Automatic Control (TAC), 2020

24 February 2020

Adithya M. Devraj

Sean P. Meyn

ArXiv (abs)PDF HTML

Papers citing "Q-learning with Uniformly Bounded Variance: Large Discounting is Not a Barrier to Fast Learning"

11 / 11 papers shown

Deflated Dynamics Value Iteration

Jongmin Lee

Amin Rakhsha

Ernest K. Ryu

Amir-massoud Farahmand

322

15 Jul 2024

Regularized Q-Learning with Linear Function ApproximationIEEE Transactions on Automatic Control (TAC), 2024

Jiachen Xi

Alfredo Garcia

P. Momcilovic

548

26 Jan 2024

Convex Q Learning in a Stochastic Environment: Extended VersionIEEE Conference on Decision and Control (CDC), 2023

F. Lu

Sean P. Meyn

262

10 Sep 2023

Stability of Q-Learning Through Design and Optimism

Sean P. Meyn

305

05 Jul 2023

Efficiency Ordering of Stochastic Gradient DescentNeural Information Processing Systems (NeurIPS), 2022

Jie Hu

Vishwaraj Doshi

Do Young Eun

248

15 Sep 2022

Examining average and discounted reward optimality criteria in reinforcement learning

Vektor Dewanto

M. Gallagher

OffRL

284

03 Jul 2021

Is Q-Learning Minimax Optimal? A Tight Sample Complexity AnalysisOperational Research (OR), 2021

431

12 Feb 2021

The Mean-Squared Error of Double Q-LearningNeural Information Processing Systems (NeurIPS), 2020

296

09 Jul 2020

Sample Complexity of Asynchronous Q-Learning: Sharper Analysis and Variance Reduction

568

133

04 Jun 2020

Zap Q-Learning With Nonlinear Function ApproximationNeural Information Processing Systems (NeurIPS), 2019

252

11 Oct 2019

Differential Temporal Difference Learning

Adithya M. Devraj

Ioannis Kontoyiannis

Sean P. Meyn

156

28 Dec 2018