Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.03557
Cited By
Non-asymptotic Convergence Analysis of Two Time-scale (Natural) Actor-Critic Algorithms
7 May 2020
Tengyu Xu
Zhe Wang
Yingbin Liang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Non-asymptotic Convergence Analysis of Two Time-scale (Natural) Actor-Critic Algorithms"
18 / 18 papers shown
Title
Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic Regulator
Xuyang Chen
Jingliang Duan
Lin Zhao
51
1
0
02 May 2025
How to Find the Exact Pareto Front for Multi-Objective MDPs?
Yining Li
Peizhong Ju
Ness B. Shroff
128
0
0
21 Oct 2024
Value Improved Actor Critic Algorithms
Yaniv Oren
Moritz A. Zanger
Pascal R. van der Vaart
M. Spaan
Wendelin Bohmer
Wendelin Bohmer
OffRL
31
0
0
03 Jun 2024
Regularized Q-Learning with Linear Function Approximation
Jiachen Xi
Alfredo Garcia
P. Momcilovic
27
2
0
26 Jan 2024
Provably Robust Temporal Difference Learning for Heavy-Tailed Rewards
Semih Cayci
A. Eryilmaz
18
2
0
20 Jun 2023
An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural Policy Gradient Methods
Yanli Liu
K. Zhang
Tamer Basar
W. Yin
32
102
0
15 Nov 2022
Recent Advances in Reinforcement Learning in Finance
B. Hambly
Renyuan Xu
Huining Yang
OffRL
27
165
0
08 Dec 2021
Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis
Ziyi Chen
Yi Zhou
Rongrong Chen
Shaofeng Zou
13
24
0
08 Sep 2021
Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation
Anas Barakat
Pascal Bianchi
Julien Lehmann
19
9
0
14 Jun 2021
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear Function Approximation
Zaiwei Chen
S. Khodadadian
S. T. Maguluri
OffRL
52
29
0
26 May 2021
On the Linear convergence of Natural Policy Gradient Algorithm
S. Khodadadian
P. Jhunjhunwala
Sushil Mahavir Varma
S. T. Maguluri
30
56
0
04 May 2021
Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm
S. Khodadadian
Zaiwei Chen
S. T. Maguluri
CML
OffRL
69
26
0
18 Feb 2021
CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee
Tengyu Xu
Yingbin Liang
Guanghui Lan
34
121
0
11 Nov 2020
Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial
Amal Feriani
E. Hossain
22
236
0
06 Nov 2020
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Zuyue Fu
Zhuoran Yang
Zhaoran Wang
15
42
0
02 Aug 2020
A Two-Timescale Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic
Mingyi Hong
Hoi-To Wai
Zhaoran Wang
Zhuoran Yang
13
134
0
10 Jul 2020
Scalable Multi-Agent Reinforcement Learning for Networked Systems with Average Reward
Guannan Qu
Yiheng Lin
Adam Wierman
Na Li
15
69
0
11 Jun 2020
On the Sample Complexity of Actor-Critic Method for Reinforcement Learning with Function Approximation
Harshat Kumar
Alec Koppel
Alejandro Ribeiro
99
79
0
18 Oct 2019
1