Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.05039
Cited By
Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator
15 January 2018
Maryam Fazel
Rong Ge
Sham Kakade
M. Mesbahi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator"
46 / 46 papers shown
Title
Learning Stabilizing Policies via an Unstable Subspace Representation
Leonardo F. Toso
Lintao Ye
James Anderson
21
0
0
02 May 2025
Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic Regulator
Xuyang Chen
Jingliang Duan
Lin Zhao
38
1
0
02 May 2025
Coreset-Based Task Selection for Sample-Efficient Meta-Reinforcement Learning
Donglin Zhan
Leonardo F. Toso
James Anderson
88
1
0
04 Feb 2025
A learning-based approach to stochastic optimal control under reach-avoid constraint
Tingting Ni
Maryam Kamgarpour
70
0
0
21 Dec 2024
Nash equilibria in scalar discrete-time linear quadratic games
Giulio Salizzoni
Reda Ouhamma
Maryam Kamgarpour
25
0
0
16 Oct 2024
Performance of NPG in Countable State-Space Average-Cost RL
Yashaswini Murthy
Isaac Grosof
S. T. Maguluri
R. Srikant
OffRL
19
1
0
30 May 2024
Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning
Sihan Zeng
Thinh T. Doan
49
5
0
15 May 2024
Learning Optimal Deterministic Policies with Stochastic Policy Gradients
Alessandro Montenegro
Marco Mussi
Alberto Maria Metelli
Matteo Papini
33
2
0
03 May 2024
Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective
Muhammad Aneeq uz Zaman
Alec Koppel
Mathieu Laurière
Tamer Basar
26
3
0
17 Mar 2024
Distributed Policy Gradient for Linear Quadratic Networked Control with Limited Communication Range
Yuzi Yan
Yuan-Chung Shen
19
0
0
05 Mar 2024
Score-Aware Policy-Gradient Methods and Performance Guarantees using Local Lyapunov Conditions: Applications to Product-Form Stochastic Networks and Queueing Systems
Céline Comte
Matthieu Jonckheere
J. Sanders
Albert Senen-Cerda
20
0
0
05 Dec 2023
On the Hardness of Learning to Stabilize Linear Systems
Xiong Zeng
Zexiang Liu
Zhe Du
N. Ozay
Mario Sznaier
21
3
0
18 Nov 2023
A Large Deviations Perspective on Policy Gradient Algorithms
Wouter Jongeneel
Daniel Kuhn
Mengmeng Li
11
1
0
13 Nov 2023
Oracle Complexity Reduction for Model-free LQR: A Stochastic Variance-Reduced Policy Gradient Approach
Leonardo F. Toso
Han Wang
James Anderson
22
2
0
19 Sep 2023
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality
François Ged
M. H. Veiga
16
0
0
22 Mar 2023
Neural Operators of Backstepping Controller and Observer Gain Functions for Reaction-Diffusion PDEs
Miroslav Krstic
Luke Bhan
Yuanyuan Shi
26
28
0
18 Mar 2023
Policy Gradient Converges to the Globally Optimal Policy for Nearly Linear-Quadratic Regulators
Yin-Huan Han
Meisam Razaviyayn
Renyuan Xu
14
5
0
15 Mar 2023
Learning the Kalman Filter with Fine-Grained Sample Complexity
Xiangyuan Zhang
Bin Hu
Tamer Bacsar
18
16
0
30 Jan 2023
Multi-Task Imitation Learning for Linear Dynamical Systems
Thomas T. Zhang
Katie Kang
Bruce D. Lee
Claire Tomlin
Sergey Levine
Stephen Tu
Nikolai Matni
28
23
0
01 Dec 2022
An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural Policy Gradient Methods
Yanli Liu
K. Zhang
Tamer Basar
W. Yin
19
102
0
15 Nov 2022
Statistical Learning Theory for Control: A Finite Sample Perspective
Anastasios Tsiamis
Ingvar M. Ziemann
Nikolai Matni
George J. Pappas
13
73
0
12 Sep 2022
A stabilizing reinforcement learning approach for sampled systems with partially unknown models
Lukas Beckenbach
Pavel Osinenko
S. Streif
OffRL
11
1
0
31 Aug 2022
Lazy Queries Can Reduce Variance in Zeroth-order Optimization
Quan-Wu Xiao
Qing Ling
Tianyi Chen
20
0
0
14 Jun 2022
How are policy gradient methods affected by the limits of control?
Ingvar M. Ziemann
Anastasios Tsiamis
H. Sandberg
Nikolai Matni
20
14
0
14 Jun 2022
Rate-Optimal Online Convex Optimization in Adaptive Linear Control
Asaf B. Cassel
Alon Cohen
Google Research
13
9
0
03 Jun 2022
Independent Natural Policy Gradient Methods for Potential Games: Finite-time Global Convergence with Entropy Regularization
Shicong Cen
Fan Chen
Yuejie Chi
16
15
0
12 Apr 2022
Do Differentiable Simulators Give Better Policy Gradients?
H. Suh
Max Simchowitz
K. Zhang
Russ Tedrake
25
94
0
02 Feb 2022
Safe Adaptive Learning-based Control for Constrained Linear Quadratic Regulators with Regret Guarantees
Yingying Li
Subhro Das
J. Shamma
Na Li
11
25
0
31 Oct 2021
Neural PPO-Clip Attains Global Optimality: A Hinge Loss Perspective
Nai-Chieh Huang
Ping-Chun Hsieh
Kuo-Hao Ho
Hsuan-Yu Yao
Kai-Chun Hu
Liang-Chun Ouyang
I-Chen Wu
9
1
0
26 Oct 2021
On the Sample Complexity of Decentralized Linear Quadratic Regulator with Partially Nested Information Structure
Lintao Ye
Haoqi Zhu
V. Gupta
6
14
0
14 Oct 2021
Cautiously Optimistic Policy Optimization and Exploration with Linear Function Approximation
Andrea Zanette
Ching-An Cheng
Alekh Agarwal
10
52
0
24 Mar 2021
CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee
Tengyu Xu
Yingbin Liang
Guanghui Lan
18
119
0
11 Nov 2020
Adaptive Regret for Control of Time-Varying Dynamics
Paula Gradu
Elad Hazan
Edgar Minasyan
22
47
0
08 Jul 2020
Cooperative Multi-Agent Reinforcement Learning with Partial Observations
Yan Zhang
Michael M. Zavlanos
OffRL
11
22
0
18 Jun 2020
Global Convergence and Variance-Reduced Optimization for a Class of Nonconvex-Nonconcave Minimax Problems
Junchi Yang
Negar Kiyavash
Niao He
16
83
0
22 Feb 2020
Convergence Guarantees of Policy Optimization Methods for Markovian Jump Linear Systems
Joao Paulo Jansch-Porto
Bin Hu
Geir Dullerud
17
35
0
10 Feb 2020
Convergence and sample complexity of gradient methods for the model-free linear quadratic regulator problem
Hesameddin Mohammadi
A. Zare
Mahdi Soltanolkotabi
M. Jovanović
22
121
0
26 Dec 2019
Natural Actor-Critic Converges Globally for Hierarchical Linear Quadratic Regulator
Yuwei Luo
Zhuoran Yang
Zhaoran Wang
Mladen Kolar
21
9
0
14 Dec 2019
Linear-Quadratic Mean-Field Reinforcement Learning: Convergence of Policy Gradient Methods
René Carmona
Mathieu Laurière
Zongjun Tan
27
61
0
09 Oct 2019
On the Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost
Zhuoran Yang
Yongxin Chen
Mingyi Hong
Zhaoran Wang
24
39
0
14 Jul 2019
On the Global Convergence of Imitation Learning: A Case for Linear Quadratic Regulator
Qi Cai
Mingyi Hong
Yongxin Chen
Zhaoran Wang
16
34
0
11 Jan 2019
Provably Efficient Maximum Entropy Exploration
Elad Hazan
Sham Kakade
Karan Singh
A. V. Soest
9
291
0
06 Dec 2018
Input Perturbations for Adaptive Control and Learning
Mohamad Kazem Shirani Faradonbeh
Ambuj Tewari
George Michailidis
11
46
0
10 Nov 2018
Spectral Filtering for General Linear Dynamical Systems
Elad Hazan
Holden Lee
Karan Singh
Cyril Zhang
Yi Zhang
40
97
0
12 Feb 2018
On the Sample Complexity of the Linear Quadratic Regulator
Sarah Dean
Horia Mania
Nikolai Matni
Benjamin Recht
Stephen Tu
26
568
0
04 Oct 2017
Linear Convergence of Gradient and Proximal-Gradient Methods Under the Polyak-Łojasiewicz Condition
Hamed Karimi
J. Nutini
Mark W. Schmidt
119
1,190
0
16 Aug 2016
1