ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.07916
  4. Cited By
Multi-Agent Trust Region Policy Optimization
v1v2v3 (latest)

Multi-Agent Trust Region Policy Optimization

15 October 2020
Hepeng Li
Haibo He
ArXiv (abs)PDFHTML

Papers citing "Multi-Agent Trust Region Policy Optimization"

18 / 18 papers shown
Title
Collaborative AI Teaming in Unknown Environments via Active Goal
  Deduction
Collaborative AI Teaming in Unknown Environments via Active Goal Deduction
Zuyuan Zhang
Hanhan Zhou
Mahdi Imani
Taeyoung Lee
Tian-Shing Lan
171
11
0
22 Mar 2024
Fully Decentralized Cooperative Multi-Agent Reinforcement Learning: A
  Survey
Fully Decentralized Cooperative Multi-Agent Reinforcement Learning: A Survey
Jiechuan Jiang
Kefan Su
Zongqing Lu
224
8
0
10 Jan 2024
MARC: A multi-agent robots control framework for enhancing reinforcement
  learning in construction tasks
MARC: A multi-agent robots control framework for enhancing reinforcement learning in construction tasks
Kangkang Duan
C. W. Suen
Zhengbo Zou
104
2
0
23 May 2023
How to Use Reinforcement Learning to Facilitate Future Electricity Market Design? Part 1: A Paradigmatic Theory
Ziqing Zhu
S. Bu
K. Chan
Bin Zhou
S. Xia
168
0
0
04 May 2023
Heterogeneous-Agent Reinforcement Learning
Heterogeneous-Agent Reinforcement Learning
Yifan Zhong
J. Kuba
Xidong Feng
Siyi Hu
Jiaming Ji
Yaodong Yang
192
98
0
19 Apr 2023
Order Matters: Agent-by-agent Policy Optimization
Order Matters: Agent-by-agent Policy OptimizationInternational Conference on Learning Representations (ICLR), 2023
Xihuai Wang
Zheng Tian
Bo Liu
Ying Wen
Jun Wang
Weinan Zhang
289
42
0
13 Feb 2023
Best Possible Q-Learning
Best Possible Q-LearningConference on Uncertainty in Artificial Intelligence (UAI), 2023
Jiechuan Jiang
Zongqing Lu
OffRL
220
7
0
02 Feb 2023
Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to
  Cooperative MARL
Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
J. Kuba
Xidong Feng
Shiyao Ding
Hao Dong
Jun Wang
Yaodong Yang
157
28
0
02 Aug 2022
Learning Distributed and Fair Policies for Network Load Balancing as
  Markov Potential Game
Learning Distributed and Fair Policies for Network Load Balancing as Markov Potential GameNeural Information Processing Systems (NeurIPS), 2022
Zhiyuan Yao
Zihan Ding
OffRL
271
2
0
03 Jun 2022
DM$^2$: Decentralized Multi-Agent Reinforcement Learning for
  Distribution Matching
DM2^22: Decentralized Multi-Agent Reinforcement Learning for Distribution MatchingAAAI Conference on Artificial Intelligence (AAAI), 2022
Caroline Wang
Ishan Durugkar
Elad Liebman
Peter Stone
226
7
0
01 Jun 2022
Trust Region Bounds for Decentralized PPO Under Non-stationarity
Trust Region Bounds for Decentralized PPO Under Non-stationarityAdaptive Agents and Multi-Agent Systems (AAMAS), 2022
Mingfei Sun
Sam Devlin
Jacob Beck
Katja Hofmann
Shimon Whiteson
297
13
0
31 Jan 2022
Coordinated Proximal Policy Optimization
Coordinated Proximal Policy Optimization
Zifan Wu
Chao Yu
Deheng Ye
Junge Zhang
Haiyin Piao
H. Zhuo
173
60
0
07 Nov 2021
EnTRPO: Trust Region Policy Optimization Method with Entropy
  Regularization
EnTRPO: Trust Region Policy Optimization Method with Entropy Regularization
Sahar Roostaie
M. Ebadzadeh
198
6
0
26 Oct 2021
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Trust Region Policy Optimisation in Multi-Agent Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2021
J. Kuba
Ruiqing Chen
Munning Wen
Ying Wen
Fanglei Sun
Jun Wang
Yaodong Yang
359
324
0
23 Sep 2021
Policy Regularization via Noisy Advantage Values for Cooperative
  Multi-agent Actor-Critic methods
Policy Regularization via Noisy Advantage Values for Cooperative Multi-agent Actor-Critic methods
Jian Hu
Siyue Hu
Shih-Wei Liao
569
20
0
27 Jun 2021
A Game-Theoretic Approach to Multi-Agent Trust Region Optimization
A Game-Theoretic Approach to Multi-Agent Trust Region OptimizationInternational Conference on Distributed Artificial Intelligence (DAI), 2021
Ying Wen
Hui Chen
Yaodong Yang
Zheng Tian
Minne Li
Xu Chen
Jun Wang
199
13
0
12 Jun 2021
The Power of Exploiter: Provable Multi-Agent RL in Large State Spaces
The Power of Exploiter: Provable Multi-Agent RL in Large State SpacesInternational Conference on Machine Learning (ICML), 2021
Chi Jin
Qinghua Liu
Tiancheng Yu
209
55
0
07 Jun 2021
Dealing with Non-Stationarity in MARL via Trust-Region Decomposition
Dealing with Non-Stationarity in MARL via Trust-Region DecompositionInternational Conference on Learning Representations (ICLR), 2021
Wenhao Li
Xiangfeng Wang
Bo Jin
Junjie Sheng
H. Zha
352
14
0
21 Feb 2021
1