ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.09921
  4. Cited By
Finite-time analysis of single-timescale actor-critic
v1v2v3v4 (latest)

Finite-time analysis of single-timescale actor-critic

Neural Information Processing Systems (NeurIPS), 2022
18 October 2022
Xu-yang Chen
Tianyuan Chen
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Finite-time analysis of single-timescale actor-critic"

14 / 14 papers shown
Finite-time Convergence Analysis of Actor-Critic with Evolving Reward
Finite-time Convergence Analysis of Actor-Critic with Evolving Reward
Rui Hu
Yu Chen
Longbo Huang
176
0
0
14 Oct 2025
Finite Time Analysis of Constrained Natural Critic-Actor Algorithm with Improved Sample Complexity
Finite Time Analysis of Constrained Natural Critic-Actor Algorithm with Improved Sample Complexity
Prashansa Panda
Shalabh Bhatnagar
138
0
0
05 Oct 2025
Quantitative Convergence Analysis of Projected Stochastic Gradient Descent for Non-Convex Losses via the Goldstein Subdifferential
Quantitative Convergence Analysis of Projected Stochastic Gradient Descent for Non-Convex Losses via the Goldstein Subdifferential
Yuping Zheng
Andrew G. Lamperski
259
0
0
03 Oct 2025
Regret Analysis of Average-Reward Unichain MDPs via an Actor-Critic Approach
Regret Analysis of Average-Reward Unichain MDPs via an Actor-Critic Approach
Swetha Ganesh
Vaneet Aggarwal
273
6
0
26 May 2025
IISE PG&E Energy Analytics Challenge 2025: Hourly-Binned Regression Models Beat Transformers in Load Forecasting
IISE PG&E Energy Analytics Challenge 2025: Hourly-Binned Regression Models Beat Transformers in Load Forecasting
Millend Roy
Vladimir Pyltsov
Yinbo Hu
315
0
0
16 May 2025
Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic Regulator
Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic RegulatorInternational Joint Conference on Artificial Intelligence (IJCAI), 2024
Xuyang Chen
Jingliang Duan
Tianyuan Chen
330
1
0
02 May 2025
On The Global Convergence Of Online RLHF With Neural Parametrization
On The Global Convergence Of Online RLHF With Neural Parametrization
Mudit Gaur
Amrit Singh Bedi
Raghu Pasupathy
Vaneet Aggarwal
342
1
0
21 Oct 2024
Heavy-Ball Momentum Accelerated Actor-Critic With Function Approximation
Heavy-Ball Momentum Accelerated Actor-Critic With Function Approximation
Yanjie Dong
Haijun Zhang
Gang Wang
Shisheng Cui
Xiping Hu
427
2
0
13 Aug 2024
Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning
Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement LearningAnnual Conference Computational Learning Theory (COLT), 2024
Sihan Zeng
Thinh T. Doan
429
11
0
15 May 2024
Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation
Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation
Prashansa Panda
Shalabh Bhatnagar
457
5
0
02 Feb 2024
Finite-Time Analysis of Three-Timescale Constrained Actor-Critic and Constrained Natural Actor-Critic Algorithms
Finite-Time Analysis of Three-Timescale Constrained Actor-Critic and Constrained Natural Actor-Critic AlgorithmsConference on Uncertainty in Artificial Intelligence (UAI), 2023
Prashansa Panda
Shalabh Bhatnagar
508
1
0
25 Oct 2023
On the Global Convergence of Natural Actor-Critic with Two-layer Neural
  Network Parametrization
On the Global Convergence of Natural Actor-Critic with Two-layer Neural Network Parametrization
Mudit Gaur
Amrit Singh Bedi
Di-di Wang
Vaneet Aggarwal
284
8
0
18 Jun 2023
Beyond Exponentially Fast Mixing in Average-Reward Reinforcement
  Learning via Multi-Level Monte Carlo Actor-Critic
Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-CriticInternational Conference on Machine Learning (ICML), 2023
Wesley A Suttle
Amrit Singh Bedi
Bhrij Patel
Brian M Sadler
Alec Koppel
Dinesh Manocha
328
24
0
28 Jan 2023
On the Global Convergence of Fitted Q-Iteration with Two-layer Neural
  Network Parametrization
On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network ParametrizationInternational Conference on Machine Learning (ICML), 2022
Mudit Gaur
Vaneet Aggarwal
Mridul Agarwal
MLT
419
3
0
14 Nov 2022
1
Page 1 of 1