Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
1801.05039
Cited By

Global Convergence of Policy Gradient Methods for the Linear Quadratic
Regulator

v1v2v3 (latest)

Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator

15 January 2018

ArXiv (abs)PDF HTML

Papers citing "Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator"

50 / 279 papers shown

Harnessing Data from Clustered LQR Systems: Personalized and Collaborative Policy Optimization

Harnessing Data from Clustered LQR Systems: Personalized and Collaborative Policy Optimization

250

0

0

21 Nov 2025

The Confusing Instance Principle for Online Linear Quadratic Control

The Confusing Instance Principle for Online Linear Quadratic Control

Odalric-Ambrym Maillard

178

1

0

22 Oct 2025

Policy Transfer for Continuous-Time Reinforcement Learning: A (Rough) Differential Equation Approach

Policy Transfer for Continuous-Time Reinforcement Learning: A (Rough) Differential Equation Approach

213

0

0

16 Oct 2025

Global Convergence of Policy Gradient for Entropy Regularized Linear-Quadratic Control with Multiplicative Noise

Global Convergence of Policy Gradient for Entropy Regularized Linear-Quadratic Control with Multiplicative Noise

364

0

0

03 Oct 2025

On the System Theoretic Offline Learning of Continuous-Time LQR with Exogenous Disturbances

On the System Theoretic Offline Learning of Continuous-Time LQR with Exogenous Disturbances

Sayak Mukherjee

Ramij-Raja Hossain

M. Halappanavar

172

0

0

20 Sep 2025

Predictability Enables Parallelization of Nonlinear State Space Models

Predictability Enables Parallelization of Nonlinear State Space Models

Xavier Gonzalez

Kenneth L. Clarkson

Scott W. Linderman

305

5

0

22 Aug 2025

Statistical and Algorithmic Foundations of Reinforcement Learning

Statistical and Algorithmic Foundations of Reinforcement Learning

275

2

0

19 Jul 2025

Suboptimality analysis of receding horizon quadratic control with unknown linear systems and its applications in learning-based control

Suboptimality analysis of receding horizon quadratic control with unknown linear systems and its applications in learning-based controlIEEE Transactions on Automatic Control (TAC), 2023

Anastasios Tsiamis

304

3

0

01 Jul 2025

Online Multi-Agent Control with Adversarial Disturbances

Online Multi-Agent Control with Adversarial Disturbances

John Lazarsfeld

Georgios Piliouras

Antonios Varvitsiotis

294

0

0

23 Jun 2025

Policy Optimization for Continuous-time Linear-Quadratic Graphon Mean Field Games

Policy Optimization for Continuous-time Linear-Quadratic Graphon Mean Field Games

209

3

0

06 Jun 2025

Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic Regulator

Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic RegulatorInternational Joint Conference on Artificial Intelligence (IJCAI), 2024

342

1

0

02 May 2025

Learning Stabilizing Policies via an Unstable Subspace Representation

Learning Stabilizing Policies via an Unstable Subspace Representation

Leonardo F. Toso

432

2

0

02 May 2025

MAD: A Magnitude And Direction Policy Parametrization for Stability Constrained Reinforcement Learning

MAD: A Magnitude And Direction Policy Parametrization for Stability Constrained Reinforcement Learning

Giancarlo Ferrari-Trecate

241

5

0

03 Apr 2025

Policy Gradient for LQR with Domain Randomization

Policy Gradient for LQR with Domain Randomization

Tesshu Fujinami

George J. Pappas

269

4

0

31 Mar 2025

Remarks on the Polyak-Lojasiewicz inequality and the convergence of gradient systems

Remarks on the Polyak-Lojasiewicz inequality and the convergence of gradient systems

A. C. B. D. Oliveira

209

4

0

31 Mar 2025

Enhanced Derivative-Free Optimization Using Adaptive Correlation-Induced Finite Difference Estimators

Enhanced Derivative-Free Optimization Using Adaptive Correlation-Induced Finite Difference Estimators

169

0

0

28 Feb 2025

Coreset-Based Task Selection for Sample-Efficient Meta-Reinforcement Learning

Coreset-Based Task Selection for Sample-Efficient Meta-Reinforcement Learning

Leonardo F. Toso

596

5

0

04 Feb 2025

A learning-based approach to stochastic optimal control under reach-avoid constraint

A learning-based approach to stochastic optimal control under reach-avoid constraintInternational Conference on Hybrid Systems: Computation and Control (HSCC), 2024

Maryam Kamgarpour

493

2

0

21 Dec 2024

Differentiable Quantum Computing for Large-scale Linear Control

Differentiable Quantum Computing for Large-scale Linear ControlNeural Information Processing Systems (NeurIPS), 2024

191

5

0

03 Nov 2024

Approximate Feedback Nash Equilibria with Sparse Inter-Agent Dependencies

Approximate Feedback Nash Equilibria with Sparse Inter-Agent Dependencies

Filippos Fotiadis

Mustafa O. Karabag

David Fridovich-Keil

Ufuk Topcu

248

0

0

21 Oct 2024

Nash equilibria in scalar discrete-time linear quadratic games

Nash equilibria in scalar discrete-time linear quadratic gamesEuropean Control Conference (ECC), 2024

Giulio Salizzoni

Maryam Kamgarpour

325

4

0

16 Oct 2024

Towards Fast Rates for Federated and Multi-Task Reinforcement Learning

Towards Fast Rates for Federated and Multi-Task Reinforcement LearningIEEE Conference on Decision and Control (CDC), 2024

Robert W. Heath Jr.

250

3

0

09 Sep 2024

Exploratory Optimal Stopping: A Singular Control Formulation

Exploratory Optimal Stopping: A Singular Control Formulation

Giorgio Ferrari

319

16

0

18 Aug 2024

Nonlinear Perturbation-based Non-Convex Optimization over Time-Varying
Networks

Nonlinear Perturbation-based Non-Convex Optimization over Time-Varying NetworksIEEE Transactions on Network Science and Engineering (TNSE), 2024

Mohammadreza Doostmohammadian

Zulfiya R. Gabidullina

Hamid R. Rabiee

224

21

0

05 Aug 2024

Robust Cooperative Multi-Agent Reinforcement Learning:A Mean-Field Type
Game Perspective

Robust Cooperative Multi-Agent Reinforcement Learning:A Mean-Field Type Game Perspective

Muhammad Aneeq uz Zaman

Mathieu Laurière

313

8

0

20 Jun 2024

Two-Timescale Optimization Framework for Decentralized Linear-Quadratic
Optimal Control

Two-Timescale Optimization Framework for Decentralized Linear-Quadratic Optimal Control

403

0

0

17 Jun 2024

Learning to Stabilize Unknown LTI Systems on a Single Trajectory under
Stochastic Noise

Learning to Stabilize Unknown LTI Systems on a Single Trajectory under Stochastic Noise

306

2

0

31 May 2024

Performance of NPG in Countable State-Space Average-Cost RL

Performance of NPG in Countable State-Space Average-Cost RL

Yashaswini Murthy

R. Srikant

285

2

0

30 May 2024

Mollification Effects of Policy Gradient Methods

Mollification Effects of Policy Gradient Methods

304

2

0

28 May 2024

Model-Agnostic Zeroth-Order Policy Optimization for Meta-Learning of
Ergodic Linear Quadratic Regulators

Model-Agnostic Zeroth-Order Policy Optimization for Meta-Learning of Ergodic Linear Quadratic Regulators

219

2

0

27 May 2024

Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning

Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement LearningAnnual Conference Computational Learning Theory (COLT), 2024

437

11

0

15 May 2024

Fast Stochastic Policy Gradient: Negative Momentum for Reinforcement
Learning

Fast Stochastic Policy Gradient: Negative Momentum for Reinforcement Learning

247

0

0

08 May 2024

Learning Optimal Deterministic Policies with Stochastic Policy Gradients

Learning Optimal Deterministic Policies with Stochastic Policy GradientsInternational Conference on Machine Learning (ICML), 2024

Alessandro Montenegro

Alberto Maria Metelli

379

9

0

03 May 2024

Stabilizing Backpropagation Through Time to Learn Complex Physics

Stabilizing Backpropagation Through Time to Learn Complex PhysicsInternational Conference on Learning Representations (ICLR), 2024

Patrick Schnell

406

2

0

03 May 2024

Learning to Boost the Performance of Stable Nonlinear Systems

Learning to Boost the Performance of Stable Nonlinear Systems

Giancarlo Ferrari-Trecate

252

21

0

01 May 2024

Sample Complexity of the Linear Quadratic Regulator: A Reinforcement Learning Lens

Sample Complexity of the Linear Quadratic Regulator: A Reinforcement Learning Lens

Amirreza Neshaei Moghaddam

Bahman Gharesifard

302

2

0

16 Apr 2024

Decision Transformer as a Foundation Model for Partially Observable
Continuous Control

Decision Transformer as a Foundation Model for Partially Observable Continuous ControlAmerican Control Conference (ACC), 2024

Xiangyuan Zhang

291

8

0

03 Apr 2024

A Moreau Envelope Approach for LQR Meta-Policy Estimation

A Moreau Envelope Approach for LQR Meta-Policy Estimation

César A. Uribe

276

3

0

26 Mar 2024

Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective

Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective

Muhammad Aneeq uz Zaman

Mathieu Laurière

273

5

0

17 Mar 2024

Regret Analysis of Policy Optimization over Submanifolds for Linearly Constrained Online LQG

Regret Analysis of Policy Optimization over Submanifolds for Linearly Constrained Online LQG

Shahin Shahrampour

329

1

0

13 Mar 2024

On the Global Convergence of Policy Gradient in Average Reward Markov
Decision Processes

On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes

Yashaswini Murthy

R. Srikant

228

11

0

11 Mar 2024

Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical
Systems

Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems

Wesley A Suttle

230

3

0

06 Mar 2024

Distributed Policy Gradient for Linear Quadratic Networked Control with
Limited Communication Range

Distributed Policy Gradient for Linear Quadratic Networked Control with Limited Communication Range

Yuan-Chung Shen

251

1

0

05 Mar 2024

Linear quadratic control of nonlinear systems with Koopman operator learning and the Nyström method

Linear quadratic control of nonlinear systems with Koopman operator learning and the Nyström method

Edoardo Caldarelli

Antoine Chatalic

C. Ocampo‐Martinez

Lorenzo Rosasco

531

4

0

05 Mar 2024

Policy Optimization for PDE Control with a Warm Start

Policy Optimization for PDE Control with a Warm Start

Xiangyuan Zhang

265

4

0

01 Mar 2024

Taming Nonconvex Stochastic Mirror Descent with General Bregman
Divergence

Taming Nonconvex Stochastic Mirror Descent with General Bregman Divergence

Ilyas Fatkhullin

379

16

0

27 Feb 2024

Model-Free $μ$-Synthesis: A Nonsmooth Optimization Perspective

μ

-Synthesis: A Nonsmooth Optimization Perspective

Darioush Keivan

Peter M. Seiler

Geir Dullerud

229

0

0

18 Feb 2024

Non-asymptotic Analysis of Biased Adaptive Stochastic Approximation

Non-asymptotic Analysis of Biased Adaptive Stochastic Approximation

Sobihan Surendran

Antoine Godichon-Baggioni

Adeline Fermanian

Sylvain Le Corff

348

5

0

05 Feb 2024

On the Complexity of Finite-Sum Smooth Optimization under the
Polyak-Łojasiewicz Condition

On the Complexity of Finite-Sum Smooth Optimization under the Polyak-Łojasiewicz Condition

250

2

0

04 Feb 2024

Meta-Learning Linear Quadratic Regulators: A Policy Gradient MAML
Approach for Model-free LQR

Meta-Learning Linear Quadratic Regulators: A Policy Gradient MAML Approach for Model-free LQRConference on Learning for Dynamics & Control (L4DC), 2024

Leonardo F. Toso

James Anderson

321

17

0

25 Jan 2024

Page 1 of 6