Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1902.02234
Cited By
v1
v2
v3 (latest)
Finite-Sample Analysis for SARSA with Linear Function Approximation
6 February 2019
Shaofeng Zou
Tengyu Xu
Yingbin Liang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Finite-Sample Analysis for SARSA with Linear Function Approximation"
50 / 101 papers shown
Title
Towards Formalizing Reinforcement Learning Theory
Shangtong Zhang
90
0
0
05 Nov 2025
A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies
Phalguni Nanda
Zaiwei Chen
110
1
0
17 Oct 2025
Non-iid hypothesis testing: from classical to quantum
Giacomo De Palma
Marco Fanizza
Connor Mowry
Ryan O'Donnell
80
0
0
07 Oct 2025
Generalized Fitted Q-Iteration with Clustered Data
Liyuan Hu
Jitao Wang
Zhenke Wu
C. Shi
OffRL
128
0
0
04 Oct 2025
Extensions of Robbins-Siegmund Theorem with Applications in Reinforcement Learning
Xinyu Liu
Zixuan Xie
Shangtong Zhang
68
2
0
30 Sep 2025
Learning in Stackelberg Mean Field Games: A Non-Asymptotic Analysis
Sihan Zeng
Benjamin Patrick Evans
Sujay Bhatt
Leo Ardon
Sumitra Ganesh
Alec Koppel
103
0
0
18 Sep 2025
Finite Sample Analysis of Linear Temporal Difference Learning with Arbitrary Features
Zixuan Xie
Xinyu Liu
Rohan Chandra
Shangtong Zhang
303
1
0
27 May 2025
Natural Policy Gradient for Average Reward Non-Stationary RL
Neharika Jali
Eshika Pathak
Pranay Sharma
Guannan Qu
Gauri Joshi
244
1
0
23 Apr 2025
A Hybrid Reinforcement Learning Framework for Hard Latency Constrained Resource Scheduling
IEEE Internet of Things Journal (IEEE IoT J.), 2025
Luyuan Zhang
An Liu
Kexuan Wang
108
2
0
30 Mar 2025
Understanding Inverse Reinforcement Learning under Overparameterization: Non-Asymptotic Analysis and Global Optimality
International Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Ruijia Zhang
Siliang Zeng
Chenliang Li
Alfredo García
Mingyi Hong
283
0
0
22 Mar 2025
Near-Optimal Sample Complexity for Iterated CVaR Reinforcement Learning with a Generative Model
International Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Zilong Deng
Simon Khan
Shaofeng Zou
491
2
0
11 Mar 2025
Stochastic Semi-Gradient Descent for Learning Mean Field Games with Population-Aware Function Approximation
International Conference on Learning Representations (ICLR), 2024
Chenyu Zhang
Xu Chen
Xuan Di
359
7
0
17 Feb 2025
Heavy-Ball Momentum Accelerated Actor-Critic With Function Approximation
Yanjie Dong
Haijun Zhang
Gang Wang
Shisheng Cui
Xiping Hu
319
2
0
13 Aug 2024
Finite-Time Analysis of Simultaneous Double Q-learning
Hyunjun Na
Donghwan Lee
149
0
0
14 Jun 2024
SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning
Shuai Zhang
Heshan Devaka Fernando
Miao Liu
K. Murugesan
Songtao Lu
Pin-Yu Chen
Tianyi Chen
Meng Wang
212
3
0
24 May 2024
Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning
Annual Conference Computational Learning Theory (COLT), 2024
Sihan Zeng
Thinh T. Doan
331
9
0
15 May 2024
Graphon Mean Field Games with a Representative Player: Analysis and Learning Algorithm
Fuzhong Zhou
Chenyu Zhang
Xu Chen
Xuan Di
291
7
0
08 May 2024
An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks
Zhifa Ke
Zaiwen Wen
Junyu Zhang
227
0
0
07 May 2024
A Single Online Agent Can Efficiently Learn Mean Field Games
European Conference on Artificial Intelligence (ECAI), 2024
Chenyu Zhang
Xu Chen
Xuan Di
OffRL
295
2
0
05 May 2024
Enhancing Classification Performance via Reinforcement Learning for Feature Selection
Younes Ghazagh Jahed
Seyyed Ali Sadat Tavana
162
3
0
09 Mar 2024
Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement Learning
International Conference on Learning Representations (ICLR), 2024
Chenyu Zhang
Han Wang
Aritra Mitra
James Anderson
229
30
0
27 Jan 2024
Neural Network Approximation for Pessimistic Offline Reinforcement Learning
Di Wu
Yuling Jiao
Li Shen
Haizhao Yang
Xiliang Lu
OffRL
258
1
0
19 Dec 2023
Lifting the Veil: Unlocking the Power of Depth in Q-learning
Shao-Bo Lin
Tao Li
Shaojie Tang
Yao Wang
Ding-Xuan Zhou
OffRL
OOD
186
0
0
27 Oct 2023
On the Convergence and Sample Complexity Analysis of Deep Q-Networks with
ε
ε
ε
-Greedy Exploration
Neural Information Processing Systems (NeurIPS), 2023
Shuai Zhang
Hongkang Li
Meng Wang
Miao Liu
Pin-Yu Chen
Songtao Lu
Sijia Liu
K. Murugesan
Subhajit Chaudhury
307
38
0
24 Oct 2023
Suppressing Overestimation in Q-Learning through Adversarial Behaviors
HyeAnn Lee
Donghwan Lee
174
1
0
10 Oct 2023
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation
Neural Information Processing Systems (NeurIPS), 2023
Efstathia Soufleri
Jian Li
213
17
0
03 Oct 2023
TD Convergence: An Optimization Perspective
Neural Information Processing Systems (NeurIPS), 2023
Kavosh Asadi
Shoham Sabach
Yao Liu
Omer Gottesman
Rasool Fakoor
MU
245
12
0
30 Jun 2023
Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap
International Conference on Machine Learning (ICML), 2023
Hang Wang
Sen Lin
Junshan Zhang
OffRL
OnRL
217
3
0
20 Jun 2023
A Single-Loop Deep Actor-Critic Algorithm for Constrained Reinforcement Learning with Provable Convergence
Kexuan Wang
An Liu
Baishuo Liu
143
1
0
10 Jun 2023
A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games
Neural Information Processing Systems (NeurIPS), 2023
Zaiwei Chen
Jianchao Tan
Eric Mazumdar
Asuman Ozdaglar
Adam Wierman
325
12
0
03 Mar 2023
Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation
Zhifa Ke
Junyu Zhang
Zaiwen Wen
158
0
0
25 Feb 2023
Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic
International Conference on Machine Learning (ICML), 2023
Wesley A Suttle
Amrit Singh Bedi
Bhrij Patel
Brian M Sadler
Alec Koppel
Dinesh Manocha
255
20
0
28 Jan 2023
A Policy Optimization Method Towards Optimal-time Stability
Conference on Robot Learning (CoRL), 2023
Shengjie Wang
Lan Fengb
Xiang Zheng
Yu-wen Cao
Oluwatosin Oseni
Haotian Xu
Tao Zhang
Yang Gao
216
3
0
02 Jan 2023
Offline Reinforcement Learning with Closed-Form Policy Improvement Operators
International Conference on Machine Learning (ICML), 2022
Jiachen Li
Edwin Zhang
Ming Yin
Qinxun Bai
Yu Wang
William Yang Wang
OffRL
216
18
0
29 Nov 2022
Finite-time analysis of single-timescale actor-critic
Neural Information Processing Systems (NeurIPS), 2022
Xu-yang Chen
Tianyuan Chen
OffRL
311
27
0
18 Oct 2022
Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees
Neural Information Processing Systems (NeurIPS), 2022
Siliang Zeng
Chenliang Li
Alfredo García
Min-Fong Hong
347
49
0
04 Oct 2022
Structural Estimation of Markov Decision Processes in High-Dimensional State Space with Finite-Time Guarantees
Operational Research (OR), 2022
Siliang Zeng
Mingyi Hong
Alfredo García
OffRL
265
15
0
04 Oct 2022
Finite-Time Error Bounds for Greedy-GQ
Machine-mediated learning (ML), 2022
Yue Wang
Yi Zhou
Shaofeng Zou
302
2
0
06 Sep 2022
Robust Knowledge Adaptation for Dynamic Graph Neural Networks
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Han Li
Changsheng Li
Kaituo Feng
Ye Yuan
Guoren Wang
H. Zha
195
20
0
22 Jul 2022
q-Learning in Continuous Time
Journal of machine learning research (JMLR), 2022
Yanwei Jia
X. Zhou
OffRL
468
94
0
02 Jul 2022
Analysis of Stochastic Processes through Replay Buffers
International Conference on Machine Learning (ICML), 2022
Shirli Di-Castro Shashua
Shie Mannor
Dotan Di-Castro
154
8
0
26 Jun 2022
A Single-Timescale Analysis For Stochastic Approximation With Multiple Coupled Sequences
Neural Information Processing Systems (NeurIPS), 2022
Han Shen
Tianyi Chen
224
21
0
21 Jun 2022
Algorithm for Constrained Markov Decision Process with Linear Convergence
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
E. Gladin
Maksim Lavrik-Karmazin
K. Zainullina
Varvara Rudenko
Alexander V. Gasnikov
Martin Takáč
242
9
0
03 Jun 2022
Finite-Time Analysis of Temporal Difference Learning: Discrete-Time Linear System Perspective
Dong-hwan Lee
Do Wan Kim
OffRL
256
0
0
22 Apr 2022
Data Sampling Affects the Complexity of Online SGD over Dependent Data
Conference on Uncertainty in Artificial Intelligence (UAI), 2022
Shaocong Ma
Ziyi Chen
Yi Zhou
Kaiyi Ji
Yingbin Liang
201
6
0
31 Mar 2022
Target Network and Truncation Overcome The Deadly Triad in
Q
Q
Q
-Learning
SIAM Journal on Mathematics of Data Science (SIMODS), 2022
Zaiwei Chen
John-Paul Clarke
S. T. Maguluri
179
26
0
05 Mar 2022
Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons
Journal of the American Statistical Association (JASA), 2022
C. Shi
Shuang Luo
Yuan Le
Hongtu Zhu
R. Song
OffRL
OnRL
196
15
0
26 Feb 2022
A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets
Annals of Applied Statistics (AOAS), 2022
C. Shi
Runzhe Wan
Ge Song
Shuang Luo
R. Song
Hongtu Zhu
OffRL
259
6
0
21 Feb 2022
Stochastic linear optimization never overfits with quadratically-bounded losses on general data
Annual Conference Computational Learning Theory (COLT), 2022
Matus Telgarsky
233
13
0
14 Feb 2022
On the Convergence of SARSA with Linear Function Approximation
International Conference on Machine Learning (ICML), 2022
Shangtong Zhang
Rémi Tachet des Combes
Romain Laroche
209
16
0
14 Feb 2022
1
2
3
Next