ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.05039
  4. Cited By
Global Convergence of Policy Gradient Methods for the Linear Quadratic
  Regulator
v1v2v3 (latest)

Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator

15 January 2018
Maryam Fazel
Rong Ge
Sham Kakade
M. Mesbahi
ArXiv (abs)PDFHTML

Papers citing "Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator"

50 / 279 papers shown
Title
Harnessing Data from Clustered LQR Systems: Personalized and Collaborative Policy Optimization
Harnessing Data from Clustered LQR Systems: Personalized and Collaborative Policy Optimization
Vinay Kanakeri
Shivam Bajaj
Ashwin Verma
Vijay Gupta
Aritra Mitra
OffRL
143
0
0
21 Nov 2025
The Confusing Instance Principle for Online Linear Quadratic Control
The Confusing Instance Principle for Online Linear Quadratic Control
Waris Radji
Odalric-Ambrym Maillard
OffRL
80
1
0
22 Oct 2025
Policy Transfer for Continuous-Time Reinforcement Learning: A (Rough) Differential Equation Approach
Policy Transfer for Continuous-Time Reinforcement Learning: A (Rough) Differential Equation Approach
Xin Guo
Zijiu Lyu
OffRL
76
0
0
16 Oct 2025
Global Convergence of Policy Gradient for Entropy Regularized Linear-Quadratic Control with Multiplicative Noise
Global Convergence of Policy Gradient for Entropy Regularized Linear-Quadratic Control with Multiplicative Noise
Gabriel Diaz
Lucky Li
Wenhao Zhang
140
0
0
03 Oct 2025
On the System Theoretic Offline Learning of Continuous-Time LQR with Exogenous Disturbances
On the System Theoretic Offline Learning of Continuous-Time LQR with Exogenous Disturbances
Sayak Mukherjee
Ramij-Raja Hossain
M. Halappanavar
OffRL
74
0
0
20 Sep 2025
Predictability Enables Parallelization of Nonlinear State Space Models
Predictability Enables Parallelization of Nonlinear State Space Models
Xavier Gonzalez
Leo Kozachkov
D. Zoltowski
Kenneth L. Clarkson
Scott W. Linderman
125
3
0
22 Aug 2025
Statistical and Algorithmic Foundations of Reinforcement Learning
Statistical and Algorithmic Foundations of Reinforcement Learning
Yuejie Chi
Yuxin Chen
Yuting Wei
OffRL
157
2
0
19 Jul 2025
Suboptimality analysis of receding horizon quadratic control with unknown linear systems and its applications in learning-based control
Suboptimality analysis of receding horizon quadratic control with unknown linear systems and its applications in learning-based controlIEEE Transactions on Automatic Control (TAC), 2023
Shengli Shi
Anastasios Tsiamis
B. de Schutter
125
2
0
01 Jul 2025
Online Multi-Agent Control with Adversarial Disturbances
Online Multi-Agent Control with Adversarial Disturbances
Anas Barakat
John Lazarsfeld
Georgios Piliouras
Antonios Varvitsiotis
186
0
0
23 Jun 2025
Policy Optimization for Continuous-time Linear-Quadratic Graphon Mean Field Games
Policy Optimization for Continuous-time Linear-Quadratic Graphon Mean Field Games
Philipp Plank
Yufei Zhang
128
1
0
06 Jun 2025
Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic Regulator
Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic RegulatorInternational Joint Conference on Artificial Intelligence (IJCAI), 2024
Xuyang Chen
Jingliang Duan
Tianyuan Chen
238
1
0
02 May 2025
Learning Stabilizing Policies via an Unstable Subspace Representation
Learning Stabilizing Policies via an Unstable Subspace Representation
Leonardo F. Toso
Lintao Ye
James Anderson
230
1
0
02 May 2025
MAD: A Magnitude And Direction Policy Parametrization for Stability Constrained Reinforcement Learning
MAD: A Magnitude And Direction Policy Parametrization for Stability Constrained Reinforcement Learning
Luca Furieri
Sucheth Shenoy
Danilo Saccani
Andrea Martin
Giancarlo Ferrari-Trecate
135
2
0
03 Apr 2025
Policy Gradient for LQR with Domain Randomization
Policy Gradient for LQR with Domain Randomization
Tesshu Fujinami
Bruce D. Lee
Nikolai Matni
George J. Pappas
163
1
0
31 Mar 2025
Remarks on the Polyak-Lojasiewicz inequality and the convergence of gradient systems
Remarks on the Polyak-Lojasiewicz inequality and the convergence of gradient systems
A. C. B. D. Oliveira
Leilei Cui
Eduardo Sontag
130
1
0
31 Mar 2025
Enhanced Derivative-Free Optimization Using Adaptive Correlation-Induced Finite Difference Estimators
Enhanced Derivative-Free Optimization Using Adaptive Correlation-Induced Finite Difference Estimators
Guo Liang
Guangwu Liu
Kun Zhang
88
0
0
28 Feb 2025
Coreset-Based Task Selection for Sample-Efficient Meta-Reinforcement Learning
Coreset-Based Task Selection for Sample-Efficient Meta-Reinforcement Learning
Donglin Zhan
Leonardo F. Toso
James Anderson
431
3
0
04 Feb 2025
A learning-based approach to stochastic optimal control under reach-avoid constraint
A learning-based approach to stochastic optimal control under reach-avoid constraintInternational Conference on Hybrid Systems: Computation and Control (HSCC), 2024
Tingting Ni
Maryam Kamgarpour
330
1
0
21 Dec 2024
Differentiable Quantum Computing for Large-scale Linear Control
Differentiable Quantum Computing for Large-scale Linear ControlNeural Information Processing Systems (NeurIPS), 2024
Connor Clayton
Jiaqi Leng
Gengzhi Yang
Yi-Ling Qiao
Ming Lin
Xiaodi Wu
129
2
0
03 Nov 2024
Approximate Feedback Nash Equilibria with Sparse Inter-Agent Dependencies
Approximate Feedback Nash Equilibria with Sparse Inter-Agent Dependencies
Xinjie Liu
Jingqi Li
Filippos Fotiadis
Mustafa O. Karabag
Jesse Milzman
David Fridovich-Keil
Ufuk Topcu
128
0
0
21 Oct 2024
Nash equilibria in scalar discrete-time linear quadratic games
Nash equilibria in scalar discrete-time linear quadratic gamesEuropean Control Conference (ECC), 2024
Giulio Salizzoni
Reda Ouhamma
Maryam Kamgarpour
242
3
0
16 Oct 2024
Towards Fast Rates for Federated and Multi-Task Reinforcement Learning
Towards Fast Rates for Federated and Multi-Task Reinforcement LearningIEEE Conference on Decision and Control (CDC), 2024
Feng Zhu
Robert W. Heath Jr.
Aritra Mitra
151
1
0
09 Sep 2024
Exploratory Optimal Stopping: A Singular Control Formulation
Exploratory Optimal Stopping: A Singular Control Formulation
Jodi Dianetti
Giorgio Ferrari
Renyuan Xu
152
11
0
18 Aug 2024
Nonlinear Perturbation-based Non-Convex Optimization over Time-Varying
  Networks
Nonlinear Perturbation-based Non-Convex Optimization over Time-Varying NetworksIEEE Transactions on Network Science and Engineering (TNSE), 2024
Mohammadreza Doostmohammadian
Zulfiya R. Gabidullina
Hamid R. Rabiee
135
13
0
05 Aug 2024
Robust Cooperative Multi-Agent Reinforcement Learning:A Mean-Field Type
  Game Perspective
Robust Cooperative Multi-Agent Reinforcement Learning:A Mean-Field Type Game Perspective
Muhammad Aneeq uz Zaman
Mathieu Laurière
Alec Koppel
Tamer Basar
203
6
0
20 Jun 2024
Two-Timescale Optimization Framework for Decentralized Linear-Quadratic
  Optimal Control
Two-Timescale Optimization Framework for Decentralized Linear-Quadratic Optimal Control
Lechen Feng
Yuan-Hua Ni
Xuebo Zhang
275
0
0
17 Jun 2024
Learning to Stabilize Unknown LTI Systems on a Single Trajectory under
  Stochastic Noise
Learning to Stabilize Unknown LTI Systems on a Single Trajectory under Stochastic Noise
Ziyi Zhang
Yorie Nakahira
Guannan Qu
153
2
0
31 May 2024
Performance of NPG in Countable State-Space Average-Cost RL
Performance of NPG in Countable State-Space Average-Cost RL
Yashaswini Murthy
Isaac Grosof
S. T. Maguluri
R. Srikant
OffRL
156
1
0
30 May 2024
Mollification Effects of Policy Gradient Methods
Mollification Effects of Policy Gradient Methods
Tao Wang
Sylvia Herbert
Sicun Gao
184
1
0
28 May 2024
Model-Agnostic Zeroth-Order Policy Optimization for Meta-Learning of
  Ergodic Linear Quadratic Regulators
Model-Agnostic Zeroth-Order Policy Optimization for Meta-Learning of Ergodic Linear Quadratic Regulators
Yunian Pan
Quanyan Zhu
140
2
0
27 May 2024
Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning
Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement LearningAnnual Conference Computational Learning Theory (COLT), 2024
Sihan Zeng
Thinh T. Doan
311
9
0
15 May 2024
Fast Stochastic Policy Gradient: Negative Momentum for Reinforcement
  Learning
Fast Stochastic Policy Gradient: Negative Momentum for Reinforcement Learning
Haobin Zhang
Zhuang Yang
174
0
0
08 May 2024
Learning Optimal Deterministic Policies with Stochastic Policy Gradients
Learning Optimal Deterministic Policies with Stochastic Policy GradientsInternational Conference on Machine Learning (ICML), 2024
Alessandro Montenegro
Marco Mussi
Alberto Maria Metelli
Matteo Papini
233
5
0
03 May 2024
Stabilizing Backpropagation Through Time to Learn Complex Physics
Stabilizing Backpropagation Through Time to Learn Complex PhysicsInternational Conference on Learning Representations (ICLR), 2024
Patrick Schnell
Nils Thuerey
315
2
0
03 May 2024
Learning to Boost the Performance of Stable Nonlinear Systems
Learning to Boost the Performance of Stable Nonlinear Systems
Luca Furieri
C. Galimberti
Giancarlo Ferrari-Trecate
138
15
0
01 May 2024
Sample Complexity of the Linear Quadratic Regulator: A Reinforcement Learning Lens
Sample Complexity of the Linear Quadratic Regulator: A Reinforcement Learning Lens
Amirreza Neshaei Moghaddam
A. Olshevsky
Bahman Gharesifard
180
6
0
16 Apr 2024
Decision Transformer as a Foundation Model for Partially Observable
  Continuous Control
Decision Transformer as a Foundation Model for Partially Observable Continuous ControlAmerican Control Conference (ACC), 2024
Xiangyuan Zhang
Weichao Mao
Haoran Qiu
Tamer Basar
OffRLAI4CE
193
6
0
03 Apr 2024
A Moreau Envelope Approach for LQR Meta-Policy Estimation
A Moreau Envelope Approach for LQR Meta-Policy Estimation
Ashwin Aravind
Taha Toghani
César A. Uribe
192
3
0
26 Mar 2024
Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective
Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective
Muhammad Aneeq uz Zaman
Alec Koppel
Mathieu Laurière
Tamer Basar
195
5
0
17 Mar 2024
Regret Analysis of Policy Optimization over Submanifolds for Linearly Constrained Online LQG
Regret Analysis of Policy Optimization over Submanifolds for Linearly Constrained Online LQG
Ting-Jui Chang
Shahin Shahrampour
OffRL
195
1
0
13 Mar 2024
On the Global Convergence of Policy Gradient in Average Reward Markov
  Decision Processes
On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes
Navdeep Kumar
Yashaswini Murthy
Itai Shufaro
Kfir Y. Levy
R. Srikant
Shie Mannor
129
8
0
11 Mar 2024
Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical
  Systems
Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems
Wesley A Suttle
Vipul K Sharma
K. Kosaraju
S. Sivaranjani
Ji Liu
Vijay Gupta
Brian M Sadler
158
2
0
06 Mar 2024
Distributed Policy Gradient for Linear Quadratic Networked Control with
  Limited Communication Range
Distributed Policy Gradient for Linear Quadratic Networked Control with Limited Communication Range
Yuzi Yan
Yuan-Chung Shen
148
1
0
05 Mar 2024
Linear quadratic control of nonlinear systems with Koopman operator learning and the Nyström method
Linear quadratic control of nonlinear systems with Koopman operator learning and the Nyström method
Edoardo Caldarelli
Antoine Chatalic
Adrià Colomé
C. Molinari
C. Ocampo‐Martinez
Carme Torras
Lorenzo Rosasco
364
3
0
05 Mar 2024
Policy Optimization for PDE Control with a Warm Start
Policy Optimization for PDE Control with a Warm Start
Xiangyuan Zhang
S. Mowlavi
M. Benosman
Tamer Basar
145
2
0
01 Mar 2024
Taming Nonconvex Stochastic Mirror Descent with General Bregman
  Divergence
Taming Nonconvex Stochastic Mirror Descent with General Bregman Divergence
Ilyas Fatkhullin
Niao He
224
12
0
27 Feb 2024
Model-Free $μ$-Synthesis: A Nonsmooth Optimization Perspective
Model-Free μμμ-Synthesis: A Nonsmooth Optimization Perspective
Darioush Keivan
Xing-ming Guo
Peter M. Seiler
Geir Dullerud
Bin Hu
143
0
0
18 Feb 2024
Non-asymptotic Analysis of Biased Adaptive Stochastic Approximation
Non-asymptotic Analysis of Biased Adaptive Stochastic Approximation
Sobihan Surendran
Antoine Godichon-Baggioni
Adeline Fermanian
Sylvain Le Corff
277
3
0
05 Feb 2024
On the Complexity of Finite-Sum Smooth Optimization under the
  Polyak-Łojasiewicz Condition
On the Complexity of Finite-Sum Smooth Optimization under the Polyak-Łojasiewicz Condition
Yunyan Bai
Yuxing Liu
Luo Luo
124
1
0
04 Feb 2024
Meta-Learning Linear Quadratic Regulators: A Policy Gradient MAML
  Approach for Model-free LQR
Meta-Learning Linear Quadratic Regulators: A Policy Gradient MAML Approach for Model-free LQRConference on Learning for Dynamics & Control (L4DC), 2024
Leonardo F. Toso
Donglin Zhan
James Anderson
Han Wang
226
16
0
25 Jan 2024
123456
Next