ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.00260
  4. Cited By
Finite-Time Analysis of Asynchronous Stochastic Approximation and
  $Q$-Learning

Finite-Time Analysis of Asynchronous Stochastic Approximation and QQQ-Learning

Annual Conference Computational Learning Theory (COLT), 2020
1 February 2020
Guannan Qu
Adam Wierman
ArXiv (abs)PDFHTML

Papers citing "Finite-Time Analysis of Asynchronous Stochastic Approximation and $Q$-Learning"

50 / 83 papers shown
Deep SOR Minimax Q-learning for Two-player Zero-sum Game
Deep SOR Minimax Q-learning for Two-player Zero-sum Game
Saksham Gautam
Lakshmi Mandal
Shalabh Bhatnagar
81
0
0
20 Nov 2025
Towards Formalizing Reinforcement Learning Theory
Towards Formalizing Reinforcement Learning Theory
Shangtong Zhang
155
3
0
05 Nov 2025
A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies
A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies
Phalguni Nanda
Zaiwei Chen
171
2
0
17 Oct 2025
TS-Agent: Understanding and Reasoning Over Raw Time Series via Iterative Insight Gathering
TS-Agent: Understanding and Reasoning Over Raw Time Series via Iterative Insight Gathering
Penghang Liu
Elizabeth Fons
Svitlana Vyetrenko
Daniel Borrajo
Vamsi K. Potluru
Manuela Veloso
Vamsi K. Potluru
Manuela Veloso
AI4TSAIFinLRM
272
2
0
08 Oct 2025
Extensions of Robbins-Siegmund Theorem with Applications in Reinforcement Learning
Extensions of Robbins-Siegmund Theorem with Applications in Reinforcement Learning
Xinyu Liu
Zixuan Xie
Shangtong Zhang
160
5
0
30 Sep 2025
Central Limit Theorems for Asynchronous Averaged Q-Learning
Central Limit Theorems for Asynchronous Averaged Q-Learning
Xingtu Liu
238
0
0
23 Sep 2025
Statistical and Algorithmic Foundations of Reinforcement Learning
Statistical and Algorithmic Foundations of Reinforcement Learning
Yuejie Chi
Yuxin Chen
Yuting Wei
OffRL
275
2
0
19 Jul 2025
A General-Purpose Theorem for High-Probability Bounds of Stochastic Approximation with Polyak Averaging
A General-Purpose Theorem for High-Probability Bounds of Stochastic Approximation with Polyak Averaging
Sajad Khodadadian
Martin Zubeldia
340
2
0
27 May 2025
Achieving Tighter Finite-Time Rates for Heterogeneous Federated Stochastic Approximation under Markovian Sampling
Achieving Tighter Finite-Time Rates for Heterogeneous Federated Stochastic Approximation under Markovian Sampling
Feng Zhu
Aritra Mitra
Robert W. Heath
FedML
285
1
0
15 Apr 2025
Semi-Gradient SARSA Routing with Theoretical Guarantee on Traffic Stability and Weight Convergence
Semi-Gradient SARSA Routing with Theoretical Guarantee on Traffic Stability and Weight Convergence
Yidan Wu
Yu Yu
Jianan Zhang
Li Jin
246
0
0
19 Mar 2025
Computing and Learning Stationary Mean Field Equilibria with Scalar Interactions: Algorithms and Applications
Computing and Learning Stationary Mean Field Equilibria with Scalar Interactions: Algorithms and Applications
Bar Light
282
0
0
02 Feb 2025
Robust Q-Learning under Corrupted Rewards
Robust Q-Learning under Corrupted RewardsIEEE Conference on Decision and Control (CDC), 2024
Sreejeet Maity
Aritra Mitra
AAML
248
0
0
05 Sep 2024
Pausing Policy Learning in Non-stationary Reinforcement Learning
Pausing Policy Learning in Non-stationary Reinforcement Learning
Hyunin Lee
Ming Jin
Javad Lavaei
Somayeh Sojoudi
OffRL
256
3
0
25 May 2024
Computing the Bias of Constant-step Stochastic Approximation with
  Markovian Noise
Computing the Bias of Constant-step Stochastic Approximation with Markovian NoiseNeural Information Processing Systems (NeurIPS), 2024
Sebastian Allmeier
Nicolas Gast
408
8
0
23 May 2024
A finite time analysis of distributed Q-learning
A finite time analysis of distributed Q-learning
Han-Dong Lim
Donghwan Lee
OffRL
417
1
0
23 May 2024
Is Thompson Sampling Susceptible to Algorithmic Collusion?
Is Thompson Sampling Susceptible to Algorithmic Collusion?
Yi Xiong
Ningyuan Chen
Yi Xiong
343
0
0
23 May 2024
Reward Centering
Reward Centering
Abhishek Naik
Yi Wan
Manan Tomar
Richard S. Sutton
266
20
0
16 May 2024
A Single Online Agent Can Efficiently Learn Mean Field Games
A Single Online Agent Can Efficiently Learn Mean Field GamesEuropean Conference on Artificial Intelligence (ECAI), 2024
Chenyu Zhang
Xu Chen
Xuan Di
OffRL
361
2
0
05 May 2024
Regularized Q-learning through Robust Averaging
Regularized Q-learning through Robust AveragingInternational Conference on Machine Learning (ICML), 2024
Peter Schmitt-Förster
Tobias Sutter
OOD
270
0
0
03 May 2024
Compressed Federated Reinforcement Learning with a Generative Model
Compressed Federated Reinforcement Learning with a Generative Model
Ali Beikmohammadi
Sarit Khirirat
Sindri Magnússon
FedML
388
5
0
26 Mar 2024
Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach
Finite-Time Error Analysis of Soft Q-Learning: Switching System ApproachIEEE Conference on Decision and Control (CDC), 2024
Narim Jeong
Donghwan Lee
205
2
0
11 Mar 2024
A Simple Finite-Time Analysis of TD Learning with Linear Function
  Approximation
A Simple Finite-Time Analysis of TD Learning with Linear Function Approximation
Aritra Mitra
345
11
0
04 Mar 2024
Learning the Model While Learning Q: Finite-Time Sample Complexity of Online SyncMBQ
Learning the Model While Learning Q: Finite-Time Sample Complexity of Online SyncMBQ
Han-Dong Lim
HyeAnn Lee
Donghwan Lee
OffRLOnRL
275
0
0
19 Feb 2024
Stochastic Approximation with Delayed Updates: Finite-Time Rates under
  Markovian Sampling
Stochastic Approximation with Delayed Updates: Finite-Time Rates under Markovian Sampling
Arman Adibi
Nicolò Dal Fabbro
Luca Schenato
Sanjeev R. Kulkarni
H. Vincent Poor
George J. Pappas
Hamed Hassani
A. Mitra
429
9
0
19 Feb 2024
Federated Offline Reinforcement Learning: Collaborative Single-Policy
  Coverage Suffices
Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices
Jiin Woo
Laixi Shi
Gauri Joshi
Yuejie Chi
OffRL
297
9
0
08 Feb 2024
Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement
  Learning
Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2024
Chenyu Zhang
Han Wang
Aritra Mitra
James Anderson
336
32
0
27 Jan 2024
Constant Stepsize Q-learning: Distributional Convergence, Bias and
  Extrapolation
Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation
Yixuan Zhang
Qiaomin Xie
347
14
0
25 Jan 2024
A Concentration Bound for TD(0) with Function Approximation
A Concentration Bound for TD(0) with Function Approximation
Siddharth Chandak
Vivek Borkar
559
4
0
16 Dec 2023
Convergence Rates for Stochastic Approximation: Biased Noise with
  Unbounded Variance, and Applications
Convergence Rates for Stochastic Approximation: Biased Noise with Unbounded Variance, and ApplicationsJournal of Optimization Theory and Applications (JOTA), 2023
Rajeeva Laxman Karandikar
M. Vidyasagar
491
23
0
05 Dec 2023
Suppressing Overestimation in Q-Learning through Adversarial Behaviors
Suppressing Overestimation in Q-Learning through Adversarial Behaviors
HyeAnn Lee
Donghwan Lee
254
2
0
10 Oct 2023
Finite-Time Analysis of Whittle Index based Q-Learning for Restless
  Multi-Armed Bandits with Neural Network Function Approximation
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function ApproximationNeural Information Processing Systems (NeurIPS), 2023
Efstathia Soufleri
Jian Li
285
18
0
03 Oct 2023
Online covariance estimation for stochastic gradient descent under
  Markovian sampling
Online covariance estimation for stochastic gradient descent under Markovian sampling
Abhishek Roy
Krishnakumar Balasubramanian
371
7
0
03 Aug 2023
Robust Multi-Agent Reinforcement Learning with State Uncertainty
Robust Multi-Agent Reinforcement Learning with State Uncertainty
Sihong He
Songyang Han
Sanbao Su
Shuo Han
Shaofeng Zou
Fei Miao
OOD
339
66
0
30 Jul 2023
Settling the Sample Complexity of Online Reinforcement Learning
Settling the Sample Complexity of Online Reinforcement LearningAnnual Conference Computational Learning Theory (COLT), 2023
Zihan Zhang
Yuxin Chen
Jason D. Lee
S. Du
OffRL
869
42
0
25 Jul 2023
A Central Limit Theorem for Algorithmic Estimator of Saddle Point
A Central Limit Theorem for Algorithmic Estimator of Saddle Point
Abhishek Roy
Yian Ma
417
1
0
09 Jun 2023
Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum
  Markov Games: Switching System Approach
Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach
Dong-hwan Lee
294
3
0
09 Jun 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control
  via Sample Multiple Reuse
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple ReuseInformation Sciences (Inf. Sci.), 2023
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
229
19
0
29 May 2023
The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup
  and Beyond
The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and BeyondInternational Conference on Machine Learning (ICML), 2023
Jiin Woo
Gauri Joshi
Yuejie Chi
FedML
402
34
0
18 May 2023
Concentration of Contractive Stochastic Approximation: Additive and
  Multiplicative Noise
Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise
Zaiwei Chen
S. T. Maguluri
Martin Zubeldia
301
23
0
28 Mar 2023
Convergence Rates for Localized Actor-Critic in Networked Markov
  Potential Games
Convergence Rates for Localized Actor-Critic in Networked Markov Potential GamesConference on Uncertainty in Artificial Intelligence (UAI), 2023
Zhaoyi Zhou
Zaiwei Chen
Yiheng Lin
Adam Wierman
361
9
0
08 Mar 2023
A Finite-Sample Analysis of Payoff-Based Independent Learning in
  Zero-Sum Stochastic Games
A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic GamesNeural Information Processing Systems (NeurIPS), 2023
Zaiwei Chen
Jianchao Tan
Eric Mazumdar
Asuman Ozdaglar
Adam Wierman
381
16
0
03 Mar 2023
Statistical Inference with Stochastic Gradient Methods under $ϕ$-mixing Data
Statistical Inference with Stochastic Gradient Methods under ϕϕϕ-mixing Data
Ruiqi Liu
Xinyu Chen
Zuofeng Shang
FedML
413
7
0
24 Feb 2023
A Survey on Reinforcement Learning in Aviation Applications
A Survey on Reinforcement Learning in Aviation ApplicationsEngineering applications of artificial intelligence (EAAI), 2022
Pouria Razzaghi
Amin Tabrizian
Wei Guo
Shulu Chen
Abenezer Taye
Ellis E. Thompson
Alexis Bregeon
Ali Baheri
Peng Wei
OffRL
236
82
0
03 Nov 2022
Oracle-free Reinforcement Learning in Mean-Field Games along a Single
  Sample Path
Oracle-free Reinforcement Learning in Mean-Field Games along a Single Sample PathInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Muhammad Aneeq uz Zaman
Alec Koppel
Sujay Bhatt
Tamer Basar
396
31
0
24 Aug 2022
An Approximate Policy Iteration Viewpoint of Actor-Critic Algorithms
An Approximate Policy Iteration Viewpoint of Actor-Critic Algorithms
Zaiwei Chen
S. T. Maguluri
224
2
0
05 Aug 2022
Finite-Time Analysis of Asynchronous Q-learning under Diminishing
  Step-Size from Control-Theoretic View
Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic ViewIEEE Access (IEEE Access), 2022
Han-Dong Lim
Dong-hwan Lee
154
3
0
25 Jul 2022
Constrained Stochastic Nonconvex Optimization with State-dependent
  Markov Data
Constrained Stochastic Nonconvex Optimization with State-dependent Markov DataNeural Information Processing Systems (NeurIPS), 2022
Abhishek Roy
Krishnakumar Balasubramanian
Saeed Ghadimi
412
10
0
22 Jun 2022
Finite-Time Analysis of Temporal Difference Learning: Discrete-Time
  Linear System Perspective
Finite-Time Analysis of Temporal Difference Learning: Discrete-Time Linear System Perspective
Dong-hwan Lee
Do Wan Kim
OffRL
418
0
0
22 Apr 2022
Data Sampling Affects the Complexity of Online SGD over Dependent Data
Data Sampling Affects the Complexity of Online SGD over Dependent DataConference on Uncertainty in Artificial Intelligence (UAI), 2022
Shaocong Ma
Ziyi Chen
Yi Zhou
Kaiyi Ji
Yingbin Liang
334
6
0
31 Mar 2022
The Efficacy of Pessimism in Asynchronous Q-Learning
The Efficacy of Pessimism in Asynchronous Q-LearningIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2022
Yuling Yan
Gen Li
Yuxin Chen
Jianqing Fan
OffRL
392
45
0
14 Mar 2022
12
Next
Page 1 of 2