$Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with $\sqrt{T}$ Regret$

Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with $\sqrt{T}$ Regret

International Conference on Machine Learning (ICML), 2021

25 February 2021

Papers citing "Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with $\sqrt{T}$ Regret"

8 / 8 papers shown

Learning Stabilizing Policies via an Unstable Subspace Representation

Leonardo F. Toso

Lintao Ye

James Anderson

433

02 May 2025

Optimal Rates for Bandit Nonstochastic ControlNeural Information Processing Systems (NeurIPS), 2023

Y. Jennifer Sun

Stephen Newman

Elad Hazan

443

24 May 2023

$Learning Decentralized Linear Quadratic Regulators with $\sqrt{T}$ Regret$

Learning Decentralized Linear Quadratic Regulators with

\sqrt{T}

RegretSIAM Journal of Control and Optimization (SICON), 2022

386

17 Oct 2022

How are policy gradient methods affected by the limits of control?IEEE Conference on Decision and Control (CDC), 2022

304

14 Jun 2022

Learning to Control under Time-Varying Environment

168

06 Jun 2022

Rate-Optimal Online Convex Optimization in Adaptive Linear ControlNeural Information Processing Systems (NeurIPS), 2022

Asaf B. Cassel

Alon Cohen

Google Research

219

03 Jun 2022

On the Sample Complexity of Decentralized Linear Quadratic Regulator with Partially Nested Information Structure

Lintao Ye

Haoqi Zhu

V. Gupta

347

14 Oct 2021

Regret-Optimal Filtering for Prediction and EstimationIEEE Transactions on Signal Processing (IEEE TSP), 2021

Oron Sabag

B. Hassibi

398

25 Jan 2021

Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with T\sqrt{T}T​ Regret

Papers citing "Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with $\sqrt{T}$ Regret"

Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with $\sqrt{T}$ Regret