v1v2v3 (latest)

Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator

15 January 2018

Papers citing "Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator"

50 / 279 papers shown

Title
Harnessing Data from Clustered LQR Systems: Personalized and Collaborative Policy Optimization Vinay Kanakeri Shivam Bajaj Ashwin Verma Vijay Gupta Aritra Mitra OffRL 143 0 0 21 Nov 2025
The Confusing Instance Principle for Online Linear Quadratic Control Waris Radji Odalric-Ambrym Maillard OffRL 80 1 0 22 Oct 2025
Policy Transfer for Continuous-Time Reinforcement Learning: A (Rough) Differential Equation Approach Xin Guo Zijiu Lyu OffRL 76 0 0 16 Oct 2025
Global Convergence of Policy Gradient for Entropy Regularized Linear-Quadratic Control with Multiplicative Noise Gabriel Diaz Lucky Li Wenhao Zhang 140 0 0 03 Oct 2025
On the System Theoretic Offline Learning of Continuous-Time LQR with Exogenous Disturbances Sayak Mukherjee Ramij-Raja Hossain M. Halappanavar OffRL 74 0 0 20 Sep 2025
Predictability Enables Parallelization of Nonlinear State Space Models Xavier Gonzalez Leo Kozachkov D. Zoltowski Kenneth L. Clarkson Scott W. Linderman 125 3 0 22 Aug 2025
Statistical and Algorithmic Foundations of Reinforcement Learning Yuejie Chi Yuxin Chen Yuting Wei OffRL 157 2 0 19 Jul 2025
Suboptimality analysis of receding horizon quadratic control with unknown linear systems and its applications in learning-based controlIEEE Transactions on Automatic Control (TAC), 2023 Shengli Shi Anastasios Tsiamis B. de Schutter 125 2 0 01 Jul 2025
Online Multi-Agent Control with Adversarial Disturbances Anas Barakat John Lazarsfeld Georgios Piliouras Antonios Varvitsiotis 186 0 0 23 Jun 2025
Policy Optimization for Continuous-time Linear-Quadratic Graphon Mean Field Games Philipp Plank Yufei Zhang 128 1 0 06 Jun 2025
Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic RegulatorInternational Joint Conference on Artificial Intelligence (IJCAI), 2024 Xuyang Chen Jingliang Duan Tianyuan Chen 238 1 0 02 May 2025
Learning Stabilizing Policies via an Unstable Subspace Representation Leonardo F. Toso Lintao Ye James Anderson 230 1 0 02 May 2025
MAD: A Magnitude And Direction Policy Parametrization for Stability Constrained Reinforcement Learning Luca Furieri Sucheth Shenoy Danilo Saccani Andrea Martin Giancarlo Ferrari-Trecate 135 2 0 03 Apr 2025
Policy Gradient for LQR with Domain Randomization Tesshu Fujinami Bruce D. Lee Nikolai Matni George J. Pappas 163 1 0 31 Mar 2025
Remarks on the Polyak-Lojasiewicz inequality and the convergence of gradient systems A. C. B. D. Oliveira Leilei Cui Eduardo Sontag 130 1 0 31 Mar 2025
Enhanced Derivative-Free Optimization Using Adaptive Correlation-Induced Finite Difference Estimators Guo Liang Guangwu Liu Kun Zhang 88 0 0 28 Feb 2025
Coreset-Based Task Selection for Sample-Efficient Meta-Reinforcement Learning Donglin Zhan Leonardo F. Toso James Anderson 431 3 0 04 Feb 2025
A learning-based approach to stochastic optimal control under reach-avoid constraintInternational Conference on Hybrid Systems: Computation and Control (HSCC), 2024 Tingting Ni Maryam Kamgarpour 330 1 0 21 Dec 2024
Differentiable Quantum Computing for Large-scale Linear ControlNeural Information Processing Systems (NeurIPS), 2024 Connor Clayton Jiaqi Leng Gengzhi Yang Yi-Ling Qiao Ming Lin Xiaodi Wu 129 2 0 03 Nov 2024
Approximate Feedback Nash Equilibria with Sparse Inter-Agent Dependencies Xinjie Liu Jingqi Li Filippos Fotiadis Mustafa O. Karabag Jesse Milzman David Fridovich-Keil Ufuk Topcu 128 0 0 21 Oct 2024
Nash equilibria in scalar discrete-time linear quadratic gamesEuropean Control Conference (ECC), 2024 Giulio Salizzoni Reda Ouhamma Maryam Kamgarpour 242 3 0 16 Oct 2024
Towards Fast Rates for Federated and Multi-Task Reinforcement LearningIEEE Conference on Decision and Control (CDC), 2024 Feng Zhu Robert W. Heath Jr. Aritra Mitra 151 1 0 09 Sep 2024
Exploratory Optimal Stopping: A Singular Control Formulation Jodi Dianetti Giorgio Ferrari Renyuan Xu 152 11 0 18 Aug 2024
Nonlinear Perturbation-based Non-Convex Optimization over Time-Varying NetworksIEEE Transactions on Network Science and Engineering (TNSE), 2024 Mohammadreza Doostmohammadian Zulfiya R. Gabidullina Hamid R. Rabiee 135 13 0 05 Aug 2024
Robust Cooperative Multi-Agent Reinforcement Learning:A Mean-Field Type Game Perspective Muhammad Aneeq uz Zaman Mathieu Laurière Alec Koppel Tamer Basar 203 6 0 20 Jun 2024
Two-Timescale Optimization Framework for Decentralized Linear-Quadratic Optimal Control Lechen Feng Yuan-Hua Ni Xuebo Zhang 275 0 0 17 Jun 2024
Learning to Stabilize Unknown LTI Systems on a Single Trajectory under Stochastic Noise Ziyi Zhang Yorie Nakahira Guannan Qu 153 2 0 31 May 2024
Performance of NPG in Countable State-Space Average-Cost RL Yashaswini Murthy Isaac Grosof S. T. Maguluri R. Srikant OffRL 156 1 0 30 May 2024
Mollification Effects of Policy Gradient Methods Tao Wang Sylvia Herbert Sicun Gao 184 1 0 28 May 2024
Model-Agnostic Zeroth-Order Policy Optimization for Meta-Learning of Ergodic Linear Quadratic Regulators Yunian Pan Quanyan Zhu 140 2 0 27 May 2024
Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement LearningAnnual Conference Computational Learning Theory (COLT), 2024 Sihan Zeng Thinh T. Doan 311 9 0 15 May 2024
Fast Stochastic Policy Gradient: Negative Momentum for Reinforcement Learning Haobin Zhang Zhuang Yang 174 0 0 08 May 2024
Learning Optimal Deterministic Policies with Stochastic Policy GradientsInternational Conference on Machine Learning (ICML), 2024 Alessandro Montenegro Marco Mussi Alberto Maria Metelli Matteo Papini 233 5 0 03 May 2024
Stabilizing Backpropagation Through Time to Learn Complex PhysicsInternational Conference on Learning Representations (ICLR), 2024 Patrick Schnell Nils Thuerey 315 2 0 03 May 2024
Learning to Boost the Performance of Stable Nonlinear Systems Luca Furieri C. Galimberti Giancarlo Ferrari-Trecate 138 15 0 01 May 2024
Sample Complexity of the Linear Quadratic Regulator: A Reinforcement Learning Lens Amirreza Neshaei Moghaddam A. Olshevsky Bahman Gharesifard 180 6 0 16 Apr 2024
Decision Transformer as a Foundation Model for Partially Observable Continuous ControlAmerican Control Conference (ACC), 2024 Xiangyuan Zhang Weichao Mao Haoran Qiu Tamer Basar OffRL AI4CE 193 6 0 03 Apr 2024
A Moreau Envelope Approach for LQR Meta-Policy Estimation Ashwin Aravind Taha Toghani César A. Uribe 192 3 0 26 Mar 2024
Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective Muhammad Aneeq uz Zaman Alec Koppel Mathieu Laurière Tamer Basar 195 5 0 17 Mar 2024
Regret Analysis of Policy Optimization over Submanifolds for Linearly Constrained Online LQG Ting-Jui Chang Shahin Shahrampour OffRL 195 1 0 13 Mar 2024
On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes Navdeep Kumar Yashaswini Murthy Itai Shufaro Kfir Y. Levy R. Srikant Shie Mannor 129 8 0 11 Mar 2024
Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems Wesley A Suttle Vipul K Sharma K. Kosaraju S. Sivaranjani Ji Liu Vijay Gupta Brian M Sadler 158 2 0 06 Mar 2024
Distributed Policy Gradient for Linear Quadratic Networked Control with Limited Communication Range Yuzi Yan Yuan-Chung Shen 148 1 0 05 Mar 2024
Linear quadratic control of nonlinear systems with Koopman operator learning and the Nyström method Edoardo Caldarelli Antoine Chatalic Adrià Colomé C. Molinari C. Ocampo‐Martinez Carme Torras Lorenzo Rosasco 364 3 0 05 Mar 2024
Policy Optimization for PDE Control with a Warm Start Xiangyuan Zhang S. Mowlavi M. Benosman Tamer Basar 145 2 0 01 Mar 2024
Taming Nonconvex Stochastic Mirror Descent with General Bregman Divergence Ilyas Fatkhullin Niao He 224 12 0 27 Feb 2024
Model-Free $μ$ -Synthesis: A Nonsmooth Optimization Perspective Darioush Keivan Xing-ming Guo Peter M. Seiler Geir Dullerud Bin Hu 143 0 0 18 Feb 2024
Non-asymptotic Analysis of Biased Adaptive Stochastic Approximation Sobihan Surendran Antoine Godichon-Baggioni Adeline Fermanian Sylvain Le Corff 277 3 0 05 Feb 2024
On the Complexity of Finite-Sum Smooth Optimization under the Polyak-Łojasiewicz Condition Yunyan Bai Yuxing Liu Luo Luo 124 1 0 04 Feb 2024
Meta-Learning Linear Quadratic Regulators: A Policy Gradient MAML Approach for Model-free LQRConference on Learning for Dynamics & Control (L4DC), 2024 Leonardo F. Toso Donglin Zhan James Anderson Han Wang 226 16 0 25 Jan 2024