Finite-Time Analysis of Asynchronous Stochastic Approximation and $Q$ -Learning

Annual Conference Computational Learning Theory (COLT), 2020

1 February 2020

Guannan Qu

Adam Wierman

ArXiv (abs)PDF HTML

Papers citing "Finite-Time Analysis of Asynchronous Stochastic Approximation and $Q$-Learning"

50 / 83 papers shown

Deep SOR Minimax Q-learning for Two-player Zero-sum Game

Saksham Gautam

Lakshmi Mandal

Shalabh Bhatnagar

20 Nov 2025

Towards Formalizing Reinforcement Learning Theory

Shangtong Zhang

155

05 Nov 2025

A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies

Phalguni Nanda

Zaiwei Chen

171

17 Oct 2025

TS-Agent: Understanding and Reasoning Over Raw Time Series via Iterative Insight Gathering

Vamsi K. Potluru

Manuela Veloso

AI4TS AIFin LRM

272

08 Oct 2025

Extensions of Robbins-Siegmund Theorem with Applications in Reinforcement Learning

Xinyu Liu

Zixuan Xie

Shangtong Zhang

160

30 Sep 2025

Central Limit Theorems for Asynchronous Averaged Q-Learning

Xingtu Liu

238

23 Sep 2025

Statistical and Algorithmic Foundations of Reinforcement Learning

275

19 Jul 2025

A General-Purpose Theorem for High-Probability Bounds of Stochastic Approximation with Polyak Averaging

Sajad Khodadadian

Martin Zubeldia

340

27 May 2025

Achieving Tighter Finite-Time Rates for Heterogeneous Federated Stochastic Approximation under Markovian Sampling

285

15 Apr 2025

Semi-Gradient SARSA Routing with Theoretical Guarantee on Traffic Stability and Weight Convergence

246

19 Mar 2025

Computing and Learning Stationary Mean Field Equilibria with Scalar Interactions: Algorithms and Applications

Bar Light

282

02 Feb 2025

Robust Q-Learning under Corrupted RewardsIEEE Conference on Decision and Control (CDC), 2024

Sreejeet Maity

Aritra Mitra

AAML

248

05 Sep 2024

Pausing Policy Learning in Non-stationary Reinforcement Learning

Hyunin Lee

Ming Jin

Javad Lavaei

Somayeh Sojoudi

OffRL

256

25 May 2024

Computing the Bias of Constant-step Stochastic Approximation with Markovian NoiseNeural Information Processing Systems (NeurIPS), 2024

Sebastian Allmeier

Nicolas Gast

408

23 May 2024

A finite time analysis of distributed Q-learning

Han-Dong Lim

Donghwan Lee

OffRL

417

23 May 2024

Is Thompson Sampling Susceptible to Algorithmic Collusion?

Yi Xiong

Ningyuan Chen

Yi Xiong

343

23 May 2024

Yi Wan

266

16 May 2024

A Single Online Agent Can Efficiently Learn Mean Field GamesEuropean Conference on Artificial Intelligence (ECAI), 2024

361

05 May 2024

Regularized Q-learning through Robust AveragingInternational Conference on Machine Learning (ICML), 2024

Peter Schmitt-Förster

Tobias Sutter

OOD

270

03 May 2024

Compressed Federated Reinforcement Learning with a Generative Model

388

26 Mar 2024

Finite-Time Error Analysis of Soft Q-Learning: Switching System ApproachIEEE Conference on Decision and Control (CDC), 2024

Narim Jeong

Donghwan Lee

205

11 Mar 2024

A Simple Finite-Time Analysis of TD Learning with Linear Function Approximation

Aritra Mitra

345

04 Mar 2024

Learning the Model While Learning Q: Finite-Time Sample Complexity of Online SyncMBQ

275

19 Feb 2024

Stochastic Approximation with Delayed Updates: Finite-Time Rates under Markovian Sampling

George J. Pappas

429

19 Feb 2024

Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices

297

08 Feb 2024

Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2024

Chenyu Zhang

Han Wang

Aritra Mitra

James Anderson

336

27 Jan 2024

Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation

Yixuan Zhang

Qiaomin Xie

347

25 Jan 2024

A Concentration Bound for TD(0) with Function Approximation

Siddharth Chandak

Vivek Borkar

559

16 Dec 2023

Convergence Rates for Stochastic Approximation: Biased Noise with Unbounded Variance, and ApplicationsJournal of Optimization Theory and Applications (JOTA), 2023

Rajeeva Laxman Karandikar

M. Vidyasagar

491

05 Dec 2023

Suppressing Overestimation in Q-Learning through Adversarial Behaviors

HyeAnn Lee

Donghwan Lee

254

10 Oct 2023

Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function ApproximationNeural Information Processing Systems (NeurIPS), 2023

Efstathia Soufleri

Jian Li

285

03 Oct 2023

Online covariance estimation for stochastic gradient descent under Markovian sampling

Abhishek Roy

Krishnakumar Balasubramanian

371

03 Aug 2023

Robust Multi-Agent Reinforcement Learning with State Uncertainty

339

30 Jul 2023

Settling the Sample Complexity of Online Reinforcement LearningAnnual Conference Computational Learning Theory (COLT), 2023

869

25 Jul 2023

A Central Limit Theorem for Algorithmic Estimator of Saddle Point

Abhishek Roy

Yian Ma

417

09 Jun 2023

Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach

Dong-hwan Lee

294

09 Jun 2023

Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple ReuseInformation Sciences (Inf. Sci.), 2023

229

29 May 2023

The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and BeyondInternational Conference on Machine Learning (ICML), 2023

402

18 May 2023

Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise

Zaiwei Chen

S. T. Maguluri

Martin Zubeldia

301

28 Mar 2023

Convergence Rates for Localized Actor-Critic in Networked Markov Potential GamesConference on Uncertainty in Artificial Intelligence (UAI), 2023

Zhaoyi Zhou

Zaiwei Chen

Yiheng Lin

Adam Wierman

361

08 Mar 2023

A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic GamesNeural Information Processing Systems (NeurIPS), 2023

Zaiwei Chen

Jianchao Tan

Eric Mazumdar

Asuman Ozdaglar

Adam Wierman

381

03 Mar 2023

Statistical Inference with Stochastic Gradient Methods under

ϕ

413

24 Feb 2023

A Survey on Reinforcement Learning in Aviation ApplicationsEngineering applications of artificial intelligence (EAAI), 2022

236

03 Nov 2022

Oracle-free Reinforcement Learning in Mean-Field Games along a Single Sample PathInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

Muhammad Aneeq uz Zaman

Alec Koppel

Sujay Bhatt

Tamer Basar

396

24 Aug 2022

An Approximate Policy Iteration Viewpoint of Actor-Critic Algorithms

Zaiwei Chen

S. T. Maguluri

224

05 Aug 2022

Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic ViewIEEE Access (IEEE Access), 2022

Han-Dong Lim

Dong-hwan Lee

154

25 Jul 2022

Constrained Stochastic Nonconvex Optimization with State-dependent Markov DataNeural Information Processing Systems (NeurIPS), 2022

Abhishek Roy

Krishnakumar Balasubramanian

Saeed Ghadimi

412

22 Jun 2022

Finite-Time Analysis of Temporal Difference Learning: Discrete-Time Linear System Perspective

Dong-hwan Lee

Do Wan Kim

OffRL

418

22 Apr 2022

Data Sampling Affects the Complexity of Online SGD over Dependent DataConference on Uncertainty in Artificial Intelligence (UAI), 2022

334

31 Mar 2022

The Efficacy of Pessimism in Asynchronous Q-LearningIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2022

392

14 Mar 2022

Finite-Time Analysis of Asynchronous Stochastic Approximation and QQQ-Learning

Papers citing "Finite-Time Analysis of Asynchronous Stochastic Approximation and $Q$-Learning"

Finite-Time Analysis of Asynchronous Stochastic Approximation and QQQ-Learning

Papers citing "Finite-Time Analysis of Asynchronous Stochastic Approximation and $Q$-Learning"

Finite-Time Analysis of Asynchronous Stochastic Approximation and $Q$ -Learning

Finite-Time Analysis of Asynchronous Stochastic Approximation and $Q$ -Learning