ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.09135
  4. Cited By
Distributed Reinforcement Learning for Decentralized Linear Quadratic
  Control: A Derivative-Free Policy Optimization Approach

Distributed Reinforcement Learning for Decentralized Linear Quadratic Control: A Derivative-Free Policy Optimization Approach

19 December 2019
Yingying Li
Yujie Tang
Runyu Zhang
Na Li
ArXivPDFHTML

Papers citing "Distributed Reinforcement Learning for Decentralized Linear Quadratic Control: A Derivative-Free Policy Optimization Approach"

41 / 41 papers shown
Title
Exploiting inter-agent coupling information for efficient reinforcement learning of cooperative LQR
Exploiting inter-agent coupling information for efficient reinforcement learning of cooperative LQR
Shahbaz P Qadri Syed
He Bai
43
0
0
29 Apr 2025
Scalable spectral representations for multi-agent reinforcement learning
  in network MDPs
Scalable spectral representations for multi-agent reinforcement learning in network MDPs
Zhaolin Ren
Runyu
Zhang
Bo Dai
17
0
0
22 Oct 2024
Heavy-Ball Momentum Accelerated Actor-Critic With Function Approximation
Heavy-Ball Momentum Accelerated Actor-Critic With Function Approximation
Yanjie Dong
Haijun Zhang
Gang Wang
Shisheng Cui
Xiping Hu
48
1
0
13 Aug 2024
System stabilization with policy optimization on unstable latent
  manifolds
System stabilization with policy optimization on unstable latent manifolds
Steffen W. R. Werner
Benjamin Peherstorfer
30
1
0
08 Jul 2024
Robust Cooperative Multi-Agent Reinforcement Learning:A Mean-Field Type
  Game Perspective
Robust Cooperative Multi-Agent Reinforcement Learning:A Mean-Field Type Game Perspective
Muhammad Aneeq uz Zaman
Mathieu Laurière
Alec Koppel
Tamer Basar
45
3
0
20 Jun 2024
Two-Timescale Optimization Framework for Decentralized Linear-Quadratic
  Optimal Control
Two-Timescale Optimization Framework for Decentralized Linear-Quadratic Optimal Control
Lechen Feng
Yuan-Hua Ni
Xuebo Zhang
31
0
0
17 Jun 2024
Learning to Stabilize Unknown LTI Systems on a Single Trajectory under
  Stochastic Noise
Learning to Stabilize Unknown LTI Systems on a Single Trajectory under Stochastic Noise
Ziyi Zhang
Yorie Nakahira
Guannan Qu
17
2
0
31 May 2024
Achieving Dimension-Free Communication in Federated Learning via Zeroth-Order Optimization
Achieving Dimension-Free Communication in Federated Learning via Zeroth-Order Optimization
Zhe Li
Bicheng Ying
Zidong Liu
Haibo Yang
Haibo Yang
FedML
59
3
0
24 May 2024
Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective
Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective
Muhammad Aneeq uz Zaman
Alec Koppel
Mathieu Laurière
Tamer Basar
39
3
0
17 Mar 2024
Distributed Policy Gradient for Linear Quadratic Networked Control with
  Limited Communication Range
Distributed Policy Gradient for Linear Quadratic Networked Control with Limited Communication Range
Yuzi Yan
Yuan-Chung Shen
34
0
0
05 Mar 2024
Controlgym: Large-Scale Control Environments for Benchmarking
  Reinforcement Learning Algorithms
Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning Algorithms
Xiangyuan Zhang
Weichao Mao
S. Mowlavi
M. Benosman
Tamer Basar
OffRL
AI4CE
24
2
0
30 Nov 2023
On the Hardness of Learning to Stabilize Linear Systems
On the Hardness of Learning to Stabilize Linear Systems
Xiong Zeng
Zexiang Liu
Zhe Du
N. Ozay
Mario Sznaier
26
3
0
18 Nov 2023
Learning the Uncertainty Sets for Control Dynamics via Set Membership: A
  Non-Asymptotic Analysis
Learning the Uncertainty Sets for Control Dynamics via Set Membership: A Non-Asymptotic Analysis
Yingying Li
Jing Yu
Lauren Conger
Taylan Kargin
Adam Wierman
43
5
0
26 Sep 2023
Global Convergence of Receding-Horizon Policy Search in Learning
  Estimator Designs
Global Convergence of Receding-Horizon Policy Search in Learning Estimator Designs
Xiangyuan Zhang
S. Mowlavi
M. Benosman
Tamer Basar
32
7
0
09 Sep 2023
Learning Zero-Sum Linear Quadratic Games with Improved Sample Complexity
  and Last-Iterate Convergence
Learning Zero-Sum Linear Quadratic Games with Improved Sample Complexity and Last-Iterate Convergence
Jiduan Wu
Anas Barakat
Ilyas Fatkhullin
Niao He
34
5
0
08 Sep 2023
Policy Evaluation in Distributional LQR
Policy Evaluation in Distributional LQR
Zifan Wang
Yulong Gao
Si Wang
Michael M. Zavlanos
Alessandro Abate
Karl H. Johansson
OffRL
26
3
0
23 Mar 2023
Revisiting LQR Control from the Perspective of Receding-Horizon Policy
  Gradient
Revisiting LQR Control from the Perspective of Receding-Horizon Policy Gradient
Xiangyuan Zhang
Tamer Basar
36
19
0
25 Feb 2023
Policy Evaluation in Decentralized POMDPs with Belief Sharing
Policy Evaluation in Decentralized POMDPs with Belief Sharing
Mert Kayaalp
Fatima Ghadieh
Ali H. Sayed
16
2
0
08 Feb 2023
Learning the Kalman Filter with Fine-Grained Sample Complexity
Learning the Kalman Filter with Fine-Grained Sample Complexity
Xiangyuan Zhang
Bin Hu
Tamer Bacsar
26
16
0
30 Jan 2023
Global Convergence of Direct Policy Search for State-Feedback
  $\mathcal{H}_\infty$ Robust Control: A Revisit of Nonsmooth Synthesis with
  Goldstein Subdifferential
Global Convergence of Direct Policy Search for State-Feedback H∞\mathcal{H}_\inftyH∞​ Robust Control: A Revisit of Nonsmooth Synthesis with Goldstein Subdifferential
Xing-ming Guo
Bin Hu
41
12
0
20 Oct 2022
Learning Decentralized Linear Quadratic Regulators with $\sqrt{T}$
  Regret
Learning Decentralized Linear Quadratic Regulators with T\sqrt{T}T​ Regret
Lintao Ye
Ming Chi
Ruiquan Liao
V. Gupta
16
1
0
17 Oct 2022
Towards a Theoretical Foundation of Policy Optimization for Learning
  Control Policies
Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies
Bin Hu
Kaipeng Zhang
Na Li
M. Mesbahi
Maryam Fazel
Tamer Bacsar
87
27
0
10 Oct 2022
Minibatch Stochastic Three Points Method for Unconstrained Smooth
  Minimization
Minibatch Stochastic Three Points Method for Unconstrained Smooth Minimization
Soumia Boucherouite
Grigory Malinovsky
Peter Richtárik
El Houcine Bergou
21
3
0
16 Sep 2022
Secure Distributed/Federated Learning: Prediction-Privacy Trade-Off for
  Multi-Agent System
Secure Distributed/Federated Learning: Prediction-Privacy Trade-Off for Multi-Agent System
Mohamed Ridha Znaidi
Gaurav Gupta
P. Bogdan
FedML
8
1
0
24 Apr 2022
Distributed Multi-Agent Reinforcement Learning Based on Graph-Induced
  Local Value Functions
Distributed Multi-Agent Reinforcement Learning Based on Graph-Induced Local Value Functions
Gangshan Jing
H. Bai
Jemin George
A. Chakrabortty
P. Sharma
30
2
0
26 Feb 2022
On the Sample Complexity of Decentralized Linear Quadratic Regulator
  with Partially Nested Information Structure
On the Sample Complexity of Decentralized Linear Quadratic Regulator with Partially Nested Information Structure
Lintao Ye
Haoqi Zhu
V. Gupta
30
14
0
14 Oct 2021
Scalable regret for learning to control network-coupled subsystems with
  unknown dynamics
Scalable regret for learning to control network-coupled subsystems with unknown dynamics
Sagar Sudhakara
Aditya Mahajan
A. Nayyar
Yi Ouyang
25
4
0
18 Aug 2021
Mean-Field Multi-Agent Reinforcement Learning: A Decentralized Network
  Approach
Mean-Field Multi-Agent Reinforcement Learning: A Decentralized Network Approach
Haotian Gu
Xin Guo
Xiaoli Wei
Renyuan Xu
OOD
40
36
0
05 Aug 2021
Asynchronous Distributed Reinforcement Learning for LQR Control via
  Zeroth-Order Block Coordinate Descent
Asynchronous Distributed Reinforcement Learning for LQR Control via Zeroth-Order Block Coordinate Descent
Gangshan Jing
H. Bai
Jemin George
A. Chakrabortty
P. Sharma
22
8
0
26 Jul 2021
Gradient play in stochastic games: stationary points, convergence, and
  sample complexity
Gradient play in stochastic games: stationary points, convergence, and sample complexity
Runyu Zhang
Zhaolin Ren
Na Li
26
43
0
01 Jun 2021
Derivative-Free Policy Optimization for Linear Risk-Sensitive and Robust
  Control Design: Implicit Regularization and Sample Complexity
Derivative-Free Policy Optimization for Linear Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample Complexity
Kaipeng Zhang
Xiangyuan Zhang
Bin Hu
Tamer Bacsar
21
19
0
04 Jan 2021
Distributed Q-Learning with State Tracking for Multi-agent Networked
  Control
Distributed Q-Learning with State Tracking for Multi-agent Networked Control
Hang Wang
Sen Lin
Hamid Jafarkhani
Junshan Zhang
OffRL
16
3
0
22 Dec 2020
Leveraging Predictions in Smoothed Online Convex Optimization via
  Gradient-based Algorithms
Leveraging Predictions in Smoothed Online Convex Optimization via Gradient-based Algorithms
Yingying Li
Na Li
9
19
0
25 Nov 2020
Policy Optimization for Markovian Jump Linear Quadratic Control:
  Gradient-Based Methods and Global Convergence
Policy Optimization for Markovian Jump Linear Quadratic Control: Gradient-Based Methods and Global Convergence
Joao Paulo Jansch-Porto
Bin Hu
Geir Dullerud
19
8
0
24 Nov 2020
Primal-dual Learning for the Model-free Risk-constrained Linear
  Quadratic Regulator
Primal-dual Learning for the Model-free Risk-constrained Linear Quadratic Regulator
Feiran Zhao
Keyou You
21
20
0
22 Nov 2020
LQR with Tracking: A Zeroth-order Approach and Its Global Convergence
LQR with Tracking: A Zeroth-order Approach and Its Global Convergence
Zhaolin Ren
Aoxiao Zhong
Na Li
17
3
0
03 Nov 2020
Online Optimal Control with Affine Constraints
Online Optimal Control with Affine Constraints
Yingying Li
Subhro Das
Na Li
12
40
0
10 Oct 2020
Robust Reinforcement Learning: A Case Study in Linear Quadratic
  Regulation
Robust Reinforcement Learning: A Case Study in Linear Quadratic Regulation
Bo Pang
Zhong-Ping Jiang
40
34
0
25 Aug 2020
Cooperative Multi-Agent Reinforcement Learning with Partial Observations
Cooperative Multi-Agent Reinforcement Learning with Partial Observations
Yan Zhang
Michael M. Zavlanos
OffRL
30
22
0
18 Jun 2020
Combining Model-Based and Model-Free Methods for Nonlinear Control: A
  Provably Convergent Policy Gradient Approach
Combining Model-Based and Model-Free Methods for Nonlinear Control: A Provably Convergent Policy Gradient Approach
Guannan Qu
Chenkai Yu
S. Low
Adam Wierman
6
19
0
12 Jun 2020
Multiagent Value Iteration Algorithms in Dynamic Programming and
  Reinforcement Learning
Multiagent Value Iteration Algorithms in Dynamic Programming and Reinforcement Learning
Dimitri Bertsekas
38
38
0
04 May 2020
1