ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.01567
  4. Cited By
A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous
  Q-Learning and TD-Learning Variants
v1v2v3v4 (latest)

A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants

2 February 2021
Zaiwei Chen
S. T. Maguluri
Sanjay Shakkottai
Karthikeyan Shanmugam
    OffRL
ArXiv (abs)PDFHTML

Papers citing "A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants"

38 / 38 papers shown
Central Limit Theorems for Asynchronous Averaged Q-Learning
Central Limit Theorems for Asynchronous Averaged Q-Learning
Xingtu Liu
241
1
0
23 Sep 2025
Statistical and Algorithmic Foundations of Reinforcement Learning
Statistical and Algorithmic Foundations of Reinforcement Learning
Yuejie Chi
Yuxin Chen
Yuting Wei
OffRL
278
3
0
19 Jul 2025
Finite Sample Analysis of Linear Temporal Difference Learning with Arbitrary Features
Finite Sample Analysis of Linear Temporal Difference Learning with Arbitrary Features
Zixuan Xie
Xinyu Liu
Rohan Chandra
Shangtong Zhang
450
3
0
27 May 2025
Mean-Field Sampling for Cooperative Multi-Agent Reinforcement Learning
Mean-Field Sampling for Cooperative Multi-Agent Reinforcement Learning
Emile Anand
Ishani Karmarkar
Guannan Qu
735
8
0
01 Dec 2024
A finite time analysis of distributed Q-learning
A finite time analysis of distributed Q-learning
Han-Dong Lim
Donghwan Lee
OffRL
422
1
0
23 May 2024
Is Thompson Sampling Susceptible to Algorithmic Collusion?
Is Thompson Sampling Susceptible to Algorithmic Collusion?
Yi Xiong
Ningyuan Chen
Yi Xiong
360
0
0
23 May 2024
Compressed Federated Reinforcement Learning with a Generative Model
Compressed Federated Reinforcement Learning with a Generative Model
Ali Beikmohammadi
Sarit Khirirat
Sindri Magnússon
FedML
396
5
0
26 Mar 2024
Constant Stepsize Q-learning: Distributional Convergence, Bias and
  Extrapolation
Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation
Yixuan Zhang
Qiaomin Xie
350
16
0
25 Jan 2024
A Concentration Bound for TD(0) with Function Approximation
A Concentration Bound for TD(0) with Function Approximation
Siddharth Chandak
Vivek Borkar
566
4
0
16 Dec 2023
Convergence Rates for Stochastic Approximation: Biased Noise with
  Unbounded Variance, and Applications
Convergence Rates for Stochastic Approximation: Biased Noise with Unbounded Variance, and ApplicationsJournal of Optimization Theory and Applications (JOTA), 2023
Rajeeva Laxman Karandikar
M. Vidyasagar
501
23
0
05 Dec 2023
Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum
  Markov Games: Switching System Approach
Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach
Dong-hwan Lee
297
4
0
09 Jun 2023
The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup
  and Beyond
The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and BeyondInternational Conference on Machine Learning (ICML), 2023
Jiin Woo
Gauri Joshi
Yuejie Chi
FedML
402
34
0
18 May 2023
Concentration of Contractive Stochastic Approximation: Additive and
  Multiplicative Noise
Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise
Zaiwei Chen
S. T. Maguluri
Martin Zubeldia
301
25
0
28 Mar 2023
Convergence Rates for Localized Actor-Critic in Networked Markov
  Potential Games
Convergence Rates for Localized Actor-Critic in Networked Markov Potential GamesConference on Uncertainty in Artificial Intelligence (UAI), 2023
Zhaoyi Zhou
Zaiwei Chen
Yiheng Lin
Adam Wierman
368
9
0
08 Mar 2023
A Finite-Sample Analysis of Payoff-Based Independent Learning in
  Zero-Sum Stochastic Games
A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic GamesNeural Information Processing Systems (NeurIPS), 2023
Zaiwei Chen
Jianchao Tan
Eric Mazumdar
Asuman Ozdaglar
Adam Wierman
383
16
0
03 Mar 2023
Bias and Extrapolation in Markovian Linear Stochastic Approximation with
  Constant Stepsizes
Bias and Extrapolation in Markovian Linear Stochastic Approximation with Constant StepsizesMeasurement and Modeling of Computer Systems (SIGMETRICS), 2022
D. Huo
Yudong Chen
Qiaomin Xie
321
23
0
03 Oct 2022
First-order Policy Optimization for Robust Markov Decision Process
First-order Policy Optimization for Robust Markov Decision Process
Yan Li
Guanghui Lan
Tuo Zhao
510
34
0
21 Sep 2022
An Approximate Policy Iteration Viewpoint of Actor-Critic Algorithms
An Approximate Policy Iteration Viewpoint of Actor-Critic Algorithms
Zaiwei Chen
S. T. Maguluri
224
2
0
05 Aug 2022
Finite-Time Analysis of Asynchronous Q-learning under Diminishing
  Step-Size from Control-Theoretic View
Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic ViewIEEE Access (IEEE Access), 2022
Han-Dong Lim
Dong-hwan Lee
158
3
0
25 Jul 2022
The Efficacy of Pessimism in Asynchronous Q-Learning
The Efficacy of Pessimism in Asynchronous Q-LearningIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2022
Yuling Yan
Gen Li
Yuxin Chen
Jianqing Fan
OffRL
397
45
0
14 Mar 2022
Target Network and Truncation Overcome The Deadly Triad in $Q$-Learning
Target Network and Truncation Overcome The Deadly Triad in QQQ-LearningSIAM Journal on Mathematics of Data Science (SIMODS), 2022
Zaiwei Chen
John-Paul Clarke
S. T. Maguluri
309
31
0
05 Mar 2022
On the Convergence of SARSA with Linear Function Approximation
On the Convergence of SARSA with Linear Function ApproximationInternational Conference on Machine Learning (ICML), 2022
Shangtong Zhang
Rémi Tachet des Combes
Romain Laroche
281
18
0
14 Feb 2022
Stochastic Gradient Descent with Dependent Data for Offline
  Reinforcement Learning
Stochastic Gradient Descent with Dependent Data for Offline Reinforcement Learning
Jing-rong Dong
Xin T. Tong
OffRL
277
2
0
06 Feb 2022
Optimal variance-reduced stochastic approximation in Banach spaces
Optimal variance-reduced stochastic approximation in Banach spaces
Wenlong Mou
K. Khamaru
Martin J. Wainwright
Peter L. Bartlett
Sai Li
292
11
0
21 Jan 2022
A Statistical Analysis of Polyak-Ruppert Averaged Q-learning
A Statistical Analysis of Polyak-Ruppert Averaged Q-learningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Xiang Li
Wenhao Yang
Jiadong Liang
Zhihua Zhang
Michael I. Jordan
457
26
0
29 Dec 2021
Accelerated and instance-optimal policy evaluation with linear function
  approximation
Accelerated and instance-optimal policy evaluation with linear function approximationSIAM Journal on Mathematics of Data Science (SIMODS), 2021
Tianjiao Li
Guanghui Lan
A. Pananjady
OffRL
259
18
0
24 Dec 2021
A Concentration Bound for LSPE($λ$)
A Concentration Bound for LSPE(λλλ)Social Science Research Network (SSRN), 2021
Siddharth Chandak
Vivek Borkar
H. Dolhare
416
0
0
04 Nov 2021
Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch
Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution MismatchJournal of machine learning research (JMLR), 2021
Shangtong Zhang
Rémi Tachet des Combes
Romain Laroche
519
18
0
04 Nov 2021
Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free
  Reinforcement Learning
Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2021
Gen Li
Laixi Shi
Yuxin Chen
Yuejie Chi
OffRL
402
66
0
09 Oct 2021
Convergence of Batch Asynchronous Stochastic Approximation With
  Applications to Reinforcement Learning
Convergence of Batch Asynchronous Stochastic Approximation With Applications to Reinforcement LearningCommunications in Optimization Theory (COT), 2021
Rajeeva Laxman Karandikar
M. Vidyasagar
309
0
0
08 Sep 2021
Online Bootstrap Inference For Policy Evaluation in Reinforcement
  Learning
Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Pratik Ramprasad
Yuantong Li
Zhuoran Yang
Zhaoran Wang
W. Sun
Guang Cheng
OffRL
504
40
0
08 Aug 2021
Concentration of Contractive Stochastic Approximation and Reinforcement
  Learning
Concentration of Contractive Stochastic Approximation and Reinforcement LearningStochastic Systems (SS), 2021
Siddharth Chandak
Vivek Borkar
Parth Dodhia
353
30
0
27 Jun 2021
Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman
  Operators
Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman Operators
Zaiwei Chen
S. T. Maguluri
Sanjay Shakkottai
Karthikeyan Shanmugam
OffRL
209
20
0
24 Jun 2021
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear
  Function Approximation
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear Function ApproximationIEEE Control Systems Letters (L-CSS), 2021
Zaiwei Chen
S. Khodadadian
S. T. Maguluri
OffRL
301
33
0
26 May 2021
On the Linear convergence of Natural Policy Gradient Algorithm
On the Linear convergence of Natural Policy Gradient AlgorithmIEEE Conference on Decision and Control (CDC), 2021
S. Khodadadian
P. Jhunjhunwala
Sushil Mahavir Varma
S. T. Maguluri
369
66
0
04 May 2021
Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm
Finite-Sample Analysis of Off-Policy Natural Actor-Critic AlgorithmInternational Conference on Machine Learning (ICML), 2021
S. Khodadadian
Zaiwei Chen
S. T. Maguluri
CMLOffRL
369
33
0
18 Feb 2021
A Discrete-Time Switching System Analysis of Q-learning
A Discrete-Time Switching System Analysis of Q-learningSIAM Journal of Control and Optimization (SICON), 2021
Donghwan Lee
Jianghai Hu
Niao He
535
20
0
17 Feb 2021
Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis
Is Q-Learning Minimax Optimal? A Tight Sample Complexity AnalysisOperational Research (OR), 2021
Gen Li
Changxiao Cai
Ee
Yuting Wei
Yuejie Chi
OffRL
437
96
0
12 Feb 2021
1
Page 1 of 1