ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.02234
  4. Cited By
Finite-Sample Analysis for SARSA with Linear Function Approximation
v1v2v3 (latest)

Finite-Sample Analysis for SARSA with Linear Function Approximation

6 February 2019
Shaofeng Zou
Tengyu Xu
Yingbin Liang
ArXiv (abs)PDFHTML

Papers citing "Finite-Sample Analysis for SARSA with Linear Function Approximation"

50 / 101 papers shown
Title
Towards Formalizing Reinforcement Learning Theory
Towards Formalizing Reinforcement Learning Theory
Shangtong Zhang
90
0
0
05 Nov 2025
A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies
A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies
Phalguni Nanda
Zaiwei Chen
110
1
0
17 Oct 2025
Non-iid hypothesis testing: from classical to quantum
Non-iid hypothesis testing: from classical to quantum
Giacomo De Palma
Marco Fanizza
Connor Mowry
Ryan O'Donnell
80
0
0
07 Oct 2025
Generalized Fitted Q-Iteration with Clustered Data
Generalized Fitted Q-Iteration with Clustered Data
Liyuan Hu
Jitao Wang
Zhenke Wu
C. Shi
OffRL
128
0
0
04 Oct 2025
Extensions of Robbins-Siegmund Theorem with Applications in Reinforcement Learning
Extensions of Robbins-Siegmund Theorem with Applications in Reinforcement Learning
Xinyu Liu
Zixuan Xie
Shangtong Zhang
68
2
0
30 Sep 2025
Learning in Stackelberg Mean Field Games: A Non-Asymptotic Analysis
Learning in Stackelberg Mean Field Games: A Non-Asymptotic Analysis
Sihan Zeng
Benjamin Patrick Evans
Sujay Bhatt
Leo Ardon
Sumitra Ganesh
Alec Koppel
103
0
0
18 Sep 2025
Finite Sample Analysis of Linear Temporal Difference Learning with Arbitrary Features
Finite Sample Analysis of Linear Temporal Difference Learning with Arbitrary Features
Zixuan Xie
Xinyu Liu
Rohan Chandra
Shangtong Zhang
303
1
0
27 May 2025
Natural Policy Gradient for Average Reward Non-Stationary RL
Natural Policy Gradient for Average Reward Non-Stationary RL
Neharika Jali
Eshika Pathak
Pranay Sharma
Guannan Qu
Gauri Joshi
244
1
0
23 Apr 2025
A Hybrid Reinforcement Learning Framework for Hard Latency Constrained Resource Scheduling
A Hybrid Reinforcement Learning Framework for Hard Latency Constrained Resource SchedulingIEEE Internet of Things Journal (IEEE IoT J.), 2025
Luyuan Zhang
An Liu
Kexuan Wang
108
2
0
30 Mar 2025
Understanding Inverse Reinforcement Learning under Overparameterization: Non-Asymptotic Analysis and Global Optimality
Understanding Inverse Reinforcement Learning under Overparameterization: Non-Asymptotic Analysis and Global OptimalityInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Ruijia Zhang
Siliang Zeng
Chenliang Li
Alfredo García
Mingyi Hong
283
0
0
22 Mar 2025
Near-Optimal Sample Complexity for Iterated CVaR Reinforcement Learning with a Generative Model
Near-Optimal Sample Complexity for Iterated CVaR Reinforcement Learning with a Generative ModelInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Zilong Deng
Simon Khan
Shaofeng Zou
491
2
0
11 Mar 2025
Stochastic Semi-Gradient Descent for Learning Mean Field Games with Population-Aware Function Approximation
Stochastic Semi-Gradient Descent for Learning Mean Field Games with Population-Aware Function ApproximationInternational Conference on Learning Representations (ICLR), 2024
Chenyu Zhang
Xu Chen
Xuan Di
359
7
0
17 Feb 2025
Heavy-Ball Momentum Accelerated Actor-Critic With Function Approximation
Heavy-Ball Momentum Accelerated Actor-Critic With Function Approximation
Yanjie Dong
Haijun Zhang
Gang Wang
Shisheng Cui
Xiping Hu
319
2
0
13 Aug 2024
Finite-Time Analysis of Simultaneous Double Q-learning
Finite-Time Analysis of Simultaneous Double Q-learning
Hyunjun Na
Donghwan Lee
149
0
0
14 Jun 2024
SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep
  Reinforcement Learning
SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning
Shuai Zhang
Heshan Devaka Fernando
Miao Liu
K. Murugesan
Songtao Lu
Pin-Yu Chen
Tianyi Chen
Meng Wang
212
3
0
24 May 2024
Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning
Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement LearningAnnual Conference Computational Learning Theory (COLT), 2024
Sihan Zeng
Thinh T. Doan
331
9
0
15 May 2024
Graphon Mean Field Games with a Representative Player: Analysis and
  Learning Algorithm
Graphon Mean Field Games with a Representative Player: Analysis and Learning Algorithm
Fuzhong Zhou
Chenyu Zhang
Xu Chen
Xuan Di
291
7
0
08 May 2024
An Improved Finite-time Analysis of Temporal Difference Learning with
  Deep Neural Networks
An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks
Zhifa Ke
Zaiwen Wen
Junyu Zhang
227
0
0
07 May 2024
A Single Online Agent Can Efficiently Learn Mean Field Games
A Single Online Agent Can Efficiently Learn Mean Field GamesEuropean Conference on Artificial Intelligence (ECAI), 2024
Chenyu Zhang
Xu Chen
Xuan Di
OffRL
295
2
0
05 May 2024
Enhancing Classification Performance via Reinforcement Learning for
  Feature Selection
Enhancing Classification Performance via Reinforcement Learning for Feature Selection
Younes Ghazagh Jahed
Seyyed Ali Sadat Tavana
162
3
0
09 Mar 2024
Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement
  Learning
Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2024
Chenyu Zhang
Han Wang
Aritra Mitra
James Anderson
229
30
0
27 Jan 2024
Neural Network Approximation for Pessimistic Offline Reinforcement
  Learning
Neural Network Approximation for Pessimistic Offline Reinforcement Learning
Di Wu
Yuling Jiao
Li Shen
Haizhao Yang
Xiliang Lu
OffRL
258
1
0
19 Dec 2023
Lifting the Veil: Unlocking the Power of Depth in Q-learning
Lifting the Veil: Unlocking the Power of Depth in Q-learning
Shao-Bo Lin
Tao Li
Shaojie Tang
Yao Wang
Ding-Xuan Zhou
OffRLOOD
186
0
0
27 Oct 2023
On the Convergence and Sample Complexity Analysis of Deep Q-Networks
  with $ε$-Greedy Exploration
On the Convergence and Sample Complexity Analysis of Deep Q-Networks with εεε-Greedy ExplorationNeural Information Processing Systems (NeurIPS), 2023
Shuai Zhang
Hongkang Li
Meng Wang
Miao Liu
Pin-Yu Chen
Songtao Lu
Sijia Liu
K. Murugesan
Subhajit Chaudhury
307
38
0
24 Oct 2023
Suppressing Overestimation in Q-Learning through Adversarial Behaviors
Suppressing Overestimation in Q-Learning through Adversarial Behaviors
HyeAnn Lee
Donghwan Lee
174
1
0
10 Oct 2023
Finite-Time Analysis of Whittle Index based Q-Learning for Restless
  Multi-Armed Bandits with Neural Network Function Approximation
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function ApproximationNeural Information Processing Systems (NeurIPS), 2023
Efstathia Soufleri
Jian Li
213
17
0
03 Oct 2023
TD Convergence: An Optimization Perspective
TD Convergence: An Optimization PerspectiveNeural Information Processing Systems (NeurIPS), 2023
Kavosh Asadi
Shoham Sabach
Yao Liu
Omer Gottesman
Rasool Fakoor
MU
245
12
0
30 Jun 2023
Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap
Warm-Start Actor-Critic: From Approximation Error to Sub-optimality GapInternational Conference on Machine Learning (ICML), 2023
Hang Wang
Sen Lin
Junshan Zhang
OffRLOnRL
217
3
0
20 Jun 2023
A Single-Loop Deep Actor-Critic Algorithm for Constrained Reinforcement
  Learning with Provable Convergence
A Single-Loop Deep Actor-Critic Algorithm for Constrained Reinforcement Learning with Provable Convergence
Kexuan Wang
An Liu
Baishuo Liu
143
1
0
10 Jun 2023
A Finite-Sample Analysis of Payoff-Based Independent Learning in
  Zero-Sum Stochastic Games
A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic GamesNeural Information Processing Systems (NeurIPS), 2023
Zaiwei Chen
Jianchao Tan
Eric Mazumdar
Asuman Ozdaglar
Adam Wierman
325
12
0
03 Mar 2023
Gauss-Newton Temporal Difference Learning with Nonlinear Function
  Approximation
Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation
Zhifa Ke
Junyu Zhang
Zaiwen Wen
158
0
0
25 Feb 2023
Beyond Exponentially Fast Mixing in Average-Reward Reinforcement
  Learning via Multi-Level Monte Carlo Actor-Critic
Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-CriticInternational Conference on Machine Learning (ICML), 2023
Wesley A Suttle
Amrit Singh Bedi
Bhrij Patel
Brian M Sadler
Alec Koppel
Dinesh Manocha
255
20
0
28 Jan 2023
A Policy Optimization Method Towards Optimal-time Stability
A Policy Optimization Method Towards Optimal-time StabilityConference on Robot Learning (CoRL), 2023
Shengjie Wang
Lan Fengb
Xiang Zheng
Yu-wen Cao
Oluwatosin Oseni
Haotian Xu
Tao Zhang
Yang Gao
216
3
0
02 Jan 2023
Offline Reinforcement Learning with Closed-Form Policy Improvement
  Operators
Offline Reinforcement Learning with Closed-Form Policy Improvement OperatorsInternational Conference on Machine Learning (ICML), 2022
Jiachen Li
Edwin Zhang
Ming Yin
Qinxun Bai
Yu Wang
William Yang Wang
OffRL
216
18
0
29 Nov 2022
Finite-time analysis of single-timescale actor-critic
Finite-time analysis of single-timescale actor-criticNeural Information Processing Systems (NeurIPS), 2022
Xu-yang Chen
Tianyuan Chen
OffRL
311
27
0
18 Oct 2022
Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time GuaranteesNeural Information Processing Systems (NeurIPS), 2022
Siliang Zeng
Chenliang Li
Alfredo García
Min-Fong Hong
347
49
0
04 Oct 2022
Structural Estimation of Markov Decision Processes in High-Dimensional
  State Space with Finite-Time Guarantees
Structural Estimation of Markov Decision Processes in High-Dimensional State Space with Finite-Time GuaranteesOperational Research (OR), 2022
Siliang Zeng
Mingyi Hong
Alfredo García
OffRL
265
15
0
04 Oct 2022
Finite-Time Error Bounds for Greedy-GQ
Finite-Time Error Bounds for Greedy-GQMachine-mediated learning (ML), 2022
Yue Wang
Yi Zhou
Shaofeng Zou
302
2
0
06 Sep 2022
Robust Knowledge Adaptation for Dynamic Graph Neural Networks
Robust Knowledge Adaptation for Dynamic Graph Neural NetworksIEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Han Li
Changsheng Li
Kaituo Feng
Ye Yuan
Guoren Wang
H. Zha
195
20
0
22 Jul 2022
q-Learning in Continuous Time
q-Learning in Continuous TimeJournal of machine learning research (JMLR), 2022
Yanwei Jia
X. Zhou
OffRL
468
94
0
02 Jul 2022
Analysis of Stochastic Processes through Replay Buffers
Analysis of Stochastic Processes through Replay BuffersInternational Conference on Machine Learning (ICML), 2022
Shirli Di-Castro Shashua
Shie Mannor
Dotan Di-Castro
154
8
0
26 Jun 2022
A Single-Timescale Analysis For Stochastic Approximation With Multiple
  Coupled Sequences
A Single-Timescale Analysis For Stochastic Approximation With Multiple Coupled SequencesNeural Information Processing Systems (NeurIPS), 2022
Han Shen
Tianyi Chen
224
21
0
21 Jun 2022
Algorithm for Constrained Markov Decision Process with Linear
  Convergence
Algorithm for Constrained Markov Decision Process with Linear ConvergenceInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
E. Gladin
Maksim Lavrik-Karmazin
K. Zainullina
Varvara Rudenko
Alexander V. Gasnikov
Martin Takáč
242
9
0
03 Jun 2022
Finite-Time Analysis of Temporal Difference Learning: Discrete-Time
  Linear System Perspective
Finite-Time Analysis of Temporal Difference Learning: Discrete-Time Linear System Perspective
Dong-hwan Lee
Do Wan Kim
OffRL
256
0
0
22 Apr 2022
Data Sampling Affects the Complexity of Online SGD over Dependent Data
Data Sampling Affects the Complexity of Online SGD over Dependent DataConference on Uncertainty in Artificial Intelligence (UAI), 2022
Shaocong Ma
Ziyi Chen
Yi Zhou
Kaiyi Ji
Yingbin Liang
201
6
0
31 Mar 2022
Target Network and Truncation Overcome The Deadly Triad in $Q$-Learning
Target Network and Truncation Overcome The Deadly Triad in QQQ-LearningSIAM Journal on Mathematics of Data Science (SIMODS), 2022
Zaiwei Chen
John-Paul Clarke
S. T. Maguluri
179
26
0
05 Mar 2022
Statistically Efficient Advantage Learning for Offline Reinforcement
  Learning in Infinite Horizons
Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite HorizonsJournal of the American Statistical Association (JASA), 2022
C. Shi
Shuang Luo
Yuan Le
Hongtu Zhu
R. Song
OffRLOnRL
196
15
0
26 Feb 2022
A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation
  in Two-sided Markets
A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided MarketsAnnals of Applied Statistics (AOAS), 2022
C. Shi
Runzhe Wan
Ge Song
Shuang Luo
R. Song
Hongtu Zhu
OffRL
259
6
0
21 Feb 2022
Stochastic linear optimization never overfits with quadratically-bounded
  losses on general data
Stochastic linear optimization never overfits with quadratically-bounded losses on general dataAnnual Conference Computational Learning Theory (COLT), 2022
Matus Telgarsky
233
13
0
14 Feb 2022
On the Convergence of SARSA with Linear Function Approximation
On the Convergence of SARSA with Linear Function ApproximationInternational Conference on Machine Learning (ICML), 2022
Shangtong Zhang
Rémi Tachet des Combes
Romain Laroche
209
16
0
14 Feb 2022
123
Next