ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02591
  4. Cited By
A Small Gain Analysis of Single Timescale Actor Critic
v1v2v3v4 (latest)

A Small Gain Analysis of Single Timescale Actor Critic

SIAM Journal of Control and Optimization (SICON), 2022
4 March 2022
Alexander Olshevsky
Bahman Gharesifard
ArXiv (abs)PDFHTMLGithub

Papers citing "A Small Gain Analysis of Single Timescale Actor Critic"

16 / 16 papers shown
Stabilizing Policy Gradient Methods via Reward Profiling
Stabilizing Policy Gradient Methods via Reward Profiling
Shihab Ahmed
El Houcine Bergou
A. Dutta
Yue Wang
OffRL
269
0
0
20 Nov 2025
Finite-time Convergence Analysis of Actor-Critic with Evolving Reward
Finite-time Convergence Analysis of Actor-Critic with Evolving Reward
Rui Hu
Yu Chen
Longbo Huang
181
0
0
14 Oct 2025
Finite Time Analysis of Constrained Natural Critic-Actor Algorithm with Improved Sample Complexity
Finite Time Analysis of Constrained Natural Critic-Actor Algorithm with Improved Sample Complexity
Prashansa Panda
Shalabh Bhatnagar
146
0
0
05 Oct 2025
Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic Regulator
Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic RegulatorInternational Joint Conference on Artificial Intelligence (IJCAI), 2024
Xuyang Chen
Jingliang Duan
Tianyuan Chen
352
2
0
02 May 2025
On The Global Convergence Of Online RLHF With Neural Parametrization
On The Global Convergence Of Online RLHF With Neural Parametrization
Mudit Gaur
Amrit Singh Bedi
Raghu Pasupathy
Vaneet Aggarwal
370
1
0
21 Oct 2024
Heavy-Ball Momentum Accelerated Actor-Critic With Function Approximation
Heavy-Ball Momentum Accelerated Actor-Critic With Function Approximation
Yanjie Dong
Haijun Zhang
Gang Wang
Shisheng Cui
Xiping Hu
476
2
0
13 Aug 2024
Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning
Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement LearningAnnual Conference Computational Learning Theory (COLT), 2024
Sihan Zeng
Thinh T. Doan
452
11
0
15 May 2024
One-Shot Averaging for Distributed TD($λ$) Under Markov Sampling
One-Shot Averaging for Distributed TD(λλλ) Under Markov SamplingIEEE Control Systems Letters (L-CSS), 2024
Haoxing Tian
I. Paschalidis
Alexander Olshevsky
OffRL
297
6
0
13 Mar 2024
Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation
Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation
Prashansa Panda
Shalabh Bhatnagar
492
5
0
02 Feb 2024
On the Second-Order Convergence of Biased Policy Gradient Algorithms
On the Second-Order Convergence of Biased Policy Gradient AlgorithmsInternational Conference on Machine Learning (ICML), 2023
Siqiao Mu
Diego Klabjan
486
4
0
05 Nov 2023
Decision-Aware Actor-Critic with Function Approximation and Theoretical
  Guarantees
Decision-Aware Actor-Critic with Function Approximation and Theoretical GuaranteesNeural Information Processing Systems (NeurIPS), 2023
Sharan Vaswani
A. Kazemi
Reza Babanezhad
Nicolas Le Roux
OffRL
478
6
0
24 May 2023
Finite-time analysis of single-timescale actor-critic
Finite-time analysis of single-timescale actor-criticNeural Information Processing Systems (NeurIPS), 2022
Xu-yang Chen
Tianyuan Chen
OffRL
459
31
0
18 Oct 2022
Global Convergence of Two-timescale Actor-Critic for Solving Linear
  Quadratic Regulator
Global Convergence of Two-timescale Actor-Critic for Solving Linear Quadratic RegulatorAAAI Conference on Artificial Intelligence (AAAI), 2022
Xu-yang Chen
Jingliang Duan
Yingbin Liang
Tianyuan Chen
290
11
0
18 Aug 2022
A Single-Timescale Analysis For Stochastic Approximation With Multiple
  Coupled Sequences
A Single-Timescale Analysis For Stochastic Approximation With Multiple Coupled SequencesNeural Information Processing Systems (NeurIPS), 2022
Han Shen
Tianyi Chen
292
23
0
21 Jun 2022
Finite-Time Analysis of Fully Decentralized Single-Timescale
  Actor-Critic
Finite-Time Analysis of Fully Decentralized Single-Timescale Actor-Critic
Qijun Luo
Xiao Li
370
3
0
12 Jun 2022
On the Sample Complexity of Actor-Critic Method for Reinforcement
  Learning with Function Approximation
On the Sample Complexity of Actor-Critic Method for Reinforcement Learning with Function ApproximationMachine-mediated learning (ML), 2019
Harshat Kumar
Alec Koppel
Alejandro Ribeiro
417
101
0
18 Oct 2019
1
Page 1 of 1