ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.12608
  4. Cited By
Online Policy Gradient for Model Free Learning of Linear Quadratic
  Regulators with $\sqrt{T}$ Regret

Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with T\sqrt{T}T​ Regret

International Conference on Machine Learning (ICML), 2021
25 February 2021
Asaf B. Cassel
Tomer Koren
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with $\sqrt{T}$ Regret"

8 / 8 papers shown
Learning Stabilizing Policies via an Unstable Subspace Representation
Learning Stabilizing Policies via an Unstable Subspace Representation
Leonardo F. Toso
Lintao Ye
James Anderson
433
2
0
02 May 2025
Optimal Rates for Bandit Nonstochastic Control
Optimal Rates for Bandit Nonstochastic ControlNeural Information Processing Systems (NeurIPS), 2023
Y. Jennifer Sun
Stephen Newman
Elad Hazan
443
7
0
24 May 2023
Learning Decentralized Linear Quadratic Regulators with $\sqrt{T}$
  Regret
Learning Decentralized Linear Quadratic Regulators with T\sqrt{T}T​ RegretSIAM Journal of Control and Optimization (SICON), 2022
Lintao Ye
Ming Chi
Ruiquan Liao
V. Gupta
386
3
0
17 Oct 2022
How are policy gradient methods affected by the limits of control?
How are policy gradient methods affected by the limits of control?IEEE Conference on Decision and Control (CDC), 2022
Ingvar M. Ziemann
Anastasios Tsiamis
H. Sandberg
Nikolai Matni
304
14
0
14 Jun 2022
Learning to Control under Time-Varying Environment
Learning to Control under Time-Varying Environment
Yuzhen Han
Rubén Solozabal
Jing Dong
Xingyu Zhou
Martin Takáč
B. Gu
168
3
0
06 Jun 2022
Rate-Optimal Online Convex Optimization in Adaptive Linear Control
Rate-Optimal Online Convex Optimization in Adaptive Linear ControlNeural Information Processing Systems (NeurIPS), 2022
Asaf B. Cassel
Alon Cohen
Google Research
219
9
0
03 Jun 2022
On the Sample Complexity of Decentralized Linear Quadratic Regulator
  with Partially Nested Information Structure
On the Sample Complexity of Decentralized Linear Quadratic Regulator with Partially Nested Information Structure
Lintao Ye
Haoqi Zhu
V. Gupta
347
18
0
14 Oct 2021
Regret-Optimal Filtering for Prediction and Estimation
Regret-Optimal Filtering for Prediction and EstimationIEEE Transactions on Signal Processing (IEEE TSP), 2021
Oron Sabag
B. Hassibi
398
9
0
25 Jan 2021
1
Page 1 of 1