Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2203.02591
Cited By
v1
v2
v3
v4 (latest)
A Small Gain Analysis of Single Timescale Actor Critic
SIAM Journal of Control and Optimization (SICON), 2022
4 March 2022
Alexander Olshevsky
Bahman Gharesifard
Re-assign community
ArXiv (abs)
PDF
HTML
Github
Papers citing
"A Small Gain Analysis of Single Timescale Actor Critic"
16 / 16 papers shown
Stabilizing Policy Gradient Methods via Reward Profiling
Shihab Ahmed
El Houcine Bergou
A. Dutta
Yue Wang
OffRL
269
0
0
20 Nov 2025
Finite-time Convergence Analysis of Actor-Critic with Evolving Reward
Rui Hu
Yu Chen
Longbo Huang
181
0
0
14 Oct 2025
Finite Time Analysis of Constrained Natural Critic-Actor Algorithm with Improved Sample Complexity
Prashansa Panda
Shalabh Bhatnagar
146
0
0
05 Oct 2025
Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic Regulator
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Xuyang Chen
Jingliang Duan
Tianyuan Chen
352
2
0
02 May 2025
On The Global Convergence Of Online RLHF With Neural Parametrization
Mudit Gaur
Amrit Singh Bedi
Raghu Pasupathy
Vaneet Aggarwal
370
1
0
21 Oct 2024
Heavy-Ball Momentum Accelerated Actor-Critic With Function Approximation
Yanjie Dong
Haijun Zhang
Gang Wang
Shisheng Cui
Xiping Hu
476
2
0
13 Aug 2024
Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning
Annual Conference Computational Learning Theory (COLT), 2024
Sihan Zeng
Thinh T. Doan
452
11
0
15 May 2024
One-Shot Averaging for Distributed TD(
λ
λ
λ
) Under Markov Sampling
IEEE Control Systems Letters (L-CSS), 2024
Haoxing Tian
I. Paschalidis
Alexander Olshevsky
OffRL
297
6
0
13 Mar 2024
Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation
Prashansa Panda
Shalabh Bhatnagar
492
5
0
02 Feb 2024
On the Second-Order Convergence of Biased Policy Gradient Algorithms
International Conference on Machine Learning (ICML), 2023
Siqiao Mu
Diego Klabjan
486
4
0
05 Nov 2023
Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees
Neural Information Processing Systems (NeurIPS), 2023
Sharan Vaswani
A. Kazemi
Reza Babanezhad
Nicolas Le Roux
OffRL
478
6
0
24 May 2023
Finite-time analysis of single-timescale actor-critic
Neural Information Processing Systems (NeurIPS), 2022
Xu-yang Chen
Tianyuan Chen
OffRL
459
31
0
18 Oct 2022
Global Convergence of Two-timescale Actor-Critic for Solving Linear Quadratic Regulator
AAAI Conference on Artificial Intelligence (AAAI), 2022
Xu-yang Chen
Jingliang Duan
Yingbin Liang
Tianyuan Chen
290
11
0
18 Aug 2022
A Single-Timescale Analysis For Stochastic Approximation With Multiple Coupled Sequences
Neural Information Processing Systems (NeurIPS), 2022
Han Shen
Tianyi Chen
292
23
0
21 Jun 2022
Finite-Time Analysis of Fully Decentralized Single-Timescale Actor-Critic
Qijun Luo
Xiao Li
370
3
0
12 Jun 2022
On the Sample Complexity of Actor-Critic Method for Reinforcement Learning with Function Approximation
Machine-mediated learning (ML), 2019
Harshat Kumar
Alec Koppel
Alejandro Ribeiro
417
101
0
18 Oct 2019
1
Page 1 of 1