Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.00260
Cited By
Finite-Time Analysis of Asynchronous Stochastic Approximation and
Q
Q
Q
-Learning
Annual Conference Computational Learning Theory (COLT), 2020
1 February 2020
Guannan Qu
Adam Wierman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Finite-Time Analysis of Asynchronous Stochastic Approximation and $Q$-Learning"
50 / 83 papers shown
Deep SOR Minimax Q-learning for Two-player Zero-sum Game
Saksham Gautam
Lakshmi Mandal
Shalabh Bhatnagar
81
0
0
20 Nov 2025
Towards Formalizing Reinforcement Learning Theory
Shangtong Zhang
155
3
0
05 Nov 2025
A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies
Phalguni Nanda
Zaiwei Chen
171
2
0
17 Oct 2025
TS-Agent: Understanding and Reasoning Over Raw Time Series via Iterative Insight Gathering
Penghang Liu
Elizabeth Fons
Svitlana Vyetrenko
Daniel Borrajo
Vamsi K. Potluru
Manuela Veloso
Vamsi K. Potluru
Manuela Veloso
AI4TS
AIFin
LRM
272
2
0
08 Oct 2025
Extensions of Robbins-Siegmund Theorem with Applications in Reinforcement Learning
Xinyu Liu
Zixuan Xie
Shangtong Zhang
160
5
0
30 Sep 2025
Central Limit Theorems for Asynchronous Averaged Q-Learning
Xingtu Liu
238
0
0
23 Sep 2025
Statistical and Algorithmic Foundations of Reinforcement Learning
Yuejie Chi
Yuxin Chen
Yuting Wei
OffRL
275
2
0
19 Jul 2025
A General-Purpose Theorem for High-Probability Bounds of Stochastic Approximation with Polyak Averaging
Sajad Khodadadian
Martin Zubeldia
340
2
0
27 May 2025
Achieving Tighter Finite-Time Rates for Heterogeneous Federated Stochastic Approximation under Markovian Sampling
Feng Zhu
Aritra Mitra
Robert W. Heath
FedML
285
1
0
15 Apr 2025
Semi-Gradient SARSA Routing with Theoretical Guarantee on Traffic Stability and Weight Convergence
Yidan Wu
Yu Yu
Jianan Zhang
Li Jin
246
0
0
19 Mar 2025
Computing and Learning Stationary Mean Field Equilibria with Scalar Interactions: Algorithms and Applications
Bar Light
282
0
0
02 Feb 2025
Robust Q-Learning under Corrupted Rewards
IEEE Conference on Decision and Control (CDC), 2024
Sreejeet Maity
Aritra Mitra
AAML
248
0
0
05 Sep 2024
Pausing Policy Learning in Non-stationary Reinforcement Learning
Hyunin Lee
Ming Jin
Javad Lavaei
Somayeh Sojoudi
OffRL
256
3
0
25 May 2024
Computing the Bias of Constant-step Stochastic Approximation with Markovian Noise
Neural Information Processing Systems (NeurIPS), 2024
Sebastian Allmeier
Nicolas Gast
408
8
0
23 May 2024
A finite time analysis of distributed Q-learning
Han-Dong Lim
Donghwan Lee
OffRL
417
1
0
23 May 2024
Is Thompson Sampling Susceptible to Algorithmic Collusion?
Yi Xiong
Ningyuan Chen
Yi Xiong
343
0
0
23 May 2024
Reward Centering
Abhishek Naik
Yi Wan
Manan Tomar
Richard S. Sutton
266
20
0
16 May 2024
A Single Online Agent Can Efficiently Learn Mean Field Games
European Conference on Artificial Intelligence (ECAI), 2024
Chenyu Zhang
Xu Chen
Xuan Di
OffRL
361
2
0
05 May 2024
Regularized Q-learning through Robust Averaging
International Conference on Machine Learning (ICML), 2024
Peter Schmitt-Förster
Tobias Sutter
OOD
270
0
0
03 May 2024
Compressed Federated Reinforcement Learning with a Generative Model
Ali Beikmohammadi
Sarit Khirirat
Sindri Magnússon
FedML
388
5
0
26 Mar 2024
Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach
IEEE Conference on Decision and Control (CDC), 2024
Narim Jeong
Donghwan Lee
205
2
0
11 Mar 2024
A Simple Finite-Time Analysis of TD Learning with Linear Function Approximation
Aritra Mitra
345
11
0
04 Mar 2024
Learning the Model While Learning Q: Finite-Time Sample Complexity of Online SyncMBQ
Han-Dong Lim
HyeAnn Lee
Donghwan Lee
OffRL
OnRL
275
0
0
19 Feb 2024
Stochastic Approximation with Delayed Updates: Finite-Time Rates under Markovian Sampling
Arman Adibi
Nicolò Dal Fabbro
Luca Schenato
Sanjeev R. Kulkarni
H. Vincent Poor
George J. Pappas
Hamed Hassani
A. Mitra
429
9
0
19 Feb 2024
Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices
Jiin Woo
Laixi Shi
Gauri Joshi
Yuejie Chi
OffRL
297
9
0
08 Feb 2024
Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement Learning
International Conference on Learning Representations (ICLR), 2024
Chenyu Zhang
Han Wang
Aritra Mitra
James Anderson
336
32
0
27 Jan 2024
Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation
Yixuan Zhang
Qiaomin Xie
347
14
0
25 Jan 2024
A Concentration Bound for TD(0) with Function Approximation
Siddharth Chandak
Vivek Borkar
559
4
0
16 Dec 2023
Convergence Rates for Stochastic Approximation: Biased Noise with Unbounded Variance, and Applications
Journal of Optimization Theory and Applications (JOTA), 2023
Rajeeva Laxman Karandikar
M. Vidyasagar
491
23
0
05 Dec 2023
Suppressing Overestimation in Q-Learning through Adversarial Behaviors
HyeAnn Lee
Donghwan Lee
254
2
0
10 Oct 2023
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation
Neural Information Processing Systems (NeurIPS), 2023
Efstathia Soufleri
Jian Li
285
18
0
03 Oct 2023
Online covariance estimation for stochastic gradient descent under Markovian sampling
Abhishek Roy
Krishnakumar Balasubramanian
371
7
0
03 Aug 2023
Robust Multi-Agent Reinforcement Learning with State Uncertainty
Sihong He
Songyang Han
Sanbao Su
Shuo Han
Shaofeng Zou
Fei Miao
OOD
339
66
0
30 Jul 2023
Settling the Sample Complexity of Online Reinforcement Learning
Annual Conference Computational Learning Theory (COLT), 2023
Zihan Zhang
Yuxin Chen
Jason D. Lee
S. Du
OffRL
869
42
0
25 Jul 2023
A Central Limit Theorem for Algorithmic Estimator of Saddle Point
Abhishek Roy
Yian Ma
417
1
0
09 Jun 2023
Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach
Dong-hwan Lee
294
3
0
09 Jun 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Information Sciences (Inf. Sci.), 2023
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
229
19
0
29 May 2023
The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond
International Conference on Machine Learning (ICML), 2023
Jiin Woo
Gauri Joshi
Yuejie Chi
FedML
402
34
0
18 May 2023
Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise
Zaiwei Chen
S. T. Maguluri
Martin Zubeldia
301
23
0
28 Mar 2023
Convergence Rates for Localized Actor-Critic in Networked Markov Potential Games
Conference on Uncertainty in Artificial Intelligence (UAI), 2023
Zhaoyi Zhou
Zaiwei Chen
Yiheng Lin
Adam Wierman
361
9
0
08 Mar 2023
A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games
Neural Information Processing Systems (NeurIPS), 2023
Zaiwei Chen
Jianchao Tan
Eric Mazumdar
Asuman Ozdaglar
Adam Wierman
381
16
0
03 Mar 2023
Statistical Inference with Stochastic Gradient Methods under
ϕ
ϕ
ϕ
-mixing Data
Ruiqi Liu
Xinyu Chen
Zuofeng Shang
FedML
413
7
0
24 Feb 2023
A Survey on Reinforcement Learning in Aviation Applications
Engineering applications of artificial intelligence (EAAI), 2022
Pouria Razzaghi
Amin Tabrizian
Wei Guo
Shulu Chen
Abenezer Taye
Ellis E. Thompson
Alexis Bregeon
Ali Baheri
Peng Wei
OffRL
236
82
0
03 Nov 2022
Oracle-free Reinforcement Learning in Mean-Field Games along a Single Sample Path
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Muhammad Aneeq uz Zaman
Alec Koppel
Sujay Bhatt
Tamer Basar
396
31
0
24 Aug 2022
An Approximate Policy Iteration Viewpoint of Actor-Critic Algorithms
Zaiwei Chen
S. T. Maguluri
224
2
0
05 Aug 2022
Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View
IEEE Access (IEEE Access), 2022
Han-Dong Lim
Dong-hwan Lee
154
3
0
25 Jul 2022
Constrained Stochastic Nonconvex Optimization with State-dependent Markov Data
Neural Information Processing Systems (NeurIPS), 2022
Abhishek Roy
Krishnakumar Balasubramanian
Saeed Ghadimi
412
10
0
22 Jun 2022
Finite-Time Analysis of Temporal Difference Learning: Discrete-Time Linear System Perspective
Dong-hwan Lee
Do Wan Kim
OffRL
418
0
0
22 Apr 2022
Data Sampling Affects the Complexity of Online SGD over Dependent Data
Conference on Uncertainty in Artificial Intelligence (UAI), 2022
Shaocong Ma
Ziyi Chen
Yi Zhou
Kaiyi Ji
Yingbin Liang
334
6
0
31 Mar 2022
The Efficacy of Pessimism in Asynchronous Q-Learning
IEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2022
Yuling Yan
Gen Li
Yuxin Chen
Jianqing Fan
OffRL
392
45
0
14 Mar 2022
1
2
Next
Page 1 of 2