v1v2v3 (latest)

Finite-Sample Analysis for SARSA with Linear Function Approximation

6 February 2019

Papers citing "Finite-Sample Analysis for SARSA with Linear Function Approximation"

50 / 101 papers shown

Towards Formalizing Reinforcement Learning Theory

Shangtong Zhang

114

05 Nov 2025

A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies

Phalguni Nanda

Zaiwei Chen

131

17 Oct 2025

Non-iid hypothesis testing: from classical to quantum

07 Oct 2025

Generalized Fitted Q-Iteration with Clustered Data

144

04 Oct 2025

Extensions of Robbins-Siegmund Theorem with Applications in Reinforcement Learning

Xinyu Liu

Zixuan Xie

Shangtong Zhang

30 Sep 2025

Learning in Stackelberg Mean Field Games: A Non-Asymptotic Analysis

Sihan Zeng

Benjamin Patrick Evans

111

18 Sep 2025

Finite Sample Analysis of Linear Temporal Difference Learning with Arbitrary Features

371

27 May 2025

Natural Policy Gradient for Average Reward Non-Stationary RL

268

23 Apr 2025

A Hybrid Reinforcement Learning Framework for Hard Latency Constrained Resource SchedulingIEEE Internet of Things Journal (IEEE IoT J.), 2025

Luyuan Zhang

An Liu

Kexuan Wang

116

30 Mar 2025

Understanding Inverse Reinforcement Learning under Overparameterization: Non-Asymptotic Analysis and Global OptimalityInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025

303

22 Mar 2025

Near-Optimal Sample Complexity for Iterated CVaR Reinforcement Learning with a Generative ModelInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025

Zilong Deng

Simon Khan

Shaofeng Zou

499

11 Mar 2025

Stochastic Semi-Gradient Descent for Learning Mean Field Games with Population-Aware Function ApproximationInternational Conference on Learning Representations (ICLR), 2024

Chenyu Zhang

Xu Chen

Xuan Di

371

17 Feb 2025

Heavy-Ball Momentum Accelerated Actor-Critic With Function Approximation

Xiping Hu

363

13 Aug 2024

Finite-Time Analysis of Simultaneous Double Q-learning

Hyunjun Na

Donghwan Lee

157

14 Jun 2024

SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning

Shuai Zhang

Heshan Devaka Fernando

240

24 May 2024

Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement LearningAnnual Conference Computational Learning Theory (COLT), 2024

Sihan Zeng

Thinh T. Doan

362

15 May 2024

Graphon Mean Field Games with a Representative Player: Analysis and Learning Algorithm

303

08 May 2024

An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks

Zhifa Ke

Zaiwen Wen

Junyu Zhang

251

07 May 2024

A Single Online Agent Can Efficiently Learn Mean Field GamesEuropean Conference on Artificial Intelligence (ECAI), 2024

319

05 May 2024

Enhancing Classification Performance via Reinforcement Learning for Feature Selection

Younes Ghazagh Jahed

Seyyed Ali Sadat Tavana

178

09 Mar 2024

Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2024

Chenyu Zhang

Han Wang

Aritra Mitra

James Anderson

277

27 Jan 2024

Neural Network Approximation for Pessimistic Offline Reinforcement Learning

Yuling Jiao

268

19 Dec 2023

Lifting the Veil: Unlocking the Power of Depth in Q-learning

210

27 Oct 2023

On the Convergence and Sample Complexity Analysis of Deep Q-Networks with

ε

-Greedy ExplorationNeural Information Processing Systems (NeurIPS), 2023

Shuai Zhang

331

24 Oct 2023

Suppressing Overestimation in Q-Learning through Adversarial Behaviors

HyeAnn Lee

Donghwan Lee

182

10 Oct 2023

Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function ApproximationNeural Information Processing Systems (NeurIPS), 2023

Efstathia Soufleri

Jian Li

229

03 Oct 2023

TD Convergence: An Optimization PerspectiveNeural Information Processing Systems (NeurIPS), 2023

261

30 Jun 2023

Warm-Start Actor-Critic: From Approximation Error to Sub-optimality GapInternational Conference on Machine Learning (ICML), 2023

221

20 Jun 2023

A Single-Loop Deep Actor-Critic Algorithm for Constrained Reinforcement Learning with Provable Convergence

Kexuan Wang

An Liu

Baishuo Liu

163

10 Jun 2023

A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic GamesNeural Information Processing Systems (NeurIPS), 2023

Zaiwei Chen

Jianchao Tan

Eric Mazumdar

Asuman Ozdaglar

Adam Wierman

350

03 Mar 2023

Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation

Zhifa Ke

Junyu Zhang

Zaiwen Wen

170

25 Feb 2023

Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-CriticInternational Conference on Machine Learning (ICML), 2023

275

28 Jan 2023

A Policy Optimization Method Towards Optimal-time StabilityConference on Robot Learning (CoRL), 2023

241

02 Jan 2023

Offline Reinforcement Learning with Closed-Form Policy Improvement OperatorsInternational Conference on Machine Learning (ICML), 2022

Ming Yin

249

29 Nov 2022

Finite-time analysis of single-timescale actor-criticNeural Information Processing Systems (NeurIPS), 2022

Xu-yang Chen

Tianyuan Chen

OffRL

367

18 Oct 2022

Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time GuaranteesNeural Information Processing Systems (NeurIPS), 2022

363

04 Oct 2022

Structural Estimation of Markov Decision Processes in High-Dimensional State Space with Finite-Time GuaranteesOperational Research (OR), 2022

269

04 Oct 2022

Finite-Time Error Bounds for Greedy-GQMachine-mediated learning (ML), 2022

Yue Wang

Yi Zhou

Shaofeng Zou

330

06 Sep 2022

Robust Knowledge Adaptation for Dynamic Graph Neural NetworksIEEE Transactions on Knowledge and Data Engineering (TKDE), 2022

223

22 Jul 2022

q-Learning in Continuous TimeJournal of machine learning research (JMLR), 2022

Yanwei Jia

X. Zhou

OffRL

476

02 Jul 2022

Analysis of Stochastic Processes through Replay BuffersInternational Conference on Machine Learning (ICML), 2022

Shirli Di-Castro Shashua

Shie Mannor

Dotan Di-Castro

174

26 Jun 2022

A Single-Timescale Analysis For Stochastic Approximation With Multiple Coupled SequencesNeural Information Processing Systems (NeurIPS), 2022

Han Shen

Tianyi Chen

236

21 Jun 2022

Algorithm for Constrained Markov Decision Process with Linear ConvergenceInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

E. Gladin

Maksim Lavrik-Karmazin

K. Zainullina

Varvara Rudenko

Alexander V. Gasnikov

Martin Takáč

250

03 Jun 2022

Finite-Time Analysis of Temporal Difference Learning: Discrete-Time Linear System Perspective

Dong-hwan Lee

Do Wan Kim

OffRL

264

22 Apr 2022

Data Sampling Affects the Complexity of Online SGD over Dependent DataConference on Uncertainty in Artificial Intelligence (UAI), 2022

221

31 Mar 2022

Target Network and Truncation Overcome The Deadly Triad in

Q

-LearningSIAM Journal on Mathematics of Data Science (SIMODS), 2022

Zaiwei Chen

John-Paul Clarke

S. T. Maguluri

207

05 Mar 2022

Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite HorizonsJournal of the American Statistical Association (JASA), 2022

230

26 Feb 2022

A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided MarketsAnnals of Applied Statistics (AOAS), 2022

298

21 Feb 2022

Stochastic linear optimization never overfits with quadratically-bounded losses on general dataAnnual Conference Computational Learning Theory (COLT), 2022

Matus Telgarsky

245

14 Feb 2022

On the Convergence of SARSA with Linear Function ApproximationInternational Conference on Machine Learning (ICML), 2022

Shangtong Zhang

Rémi Tachet des Combes

Romain Laroche

217

14 Feb 2022