ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.01062
  4. Cited By
QPLEX: Duplex Dueling Multi-Agent Q-Learning

QPLEX: Duplex Dueling Multi-Agent Q-Learning

3 August 2020
Jianhao Wang
Zhizhou Ren
Terry Liu
Yang Yu
Chongjie Zhang
    OffRL
ArXivPDFHTML

Papers citing "QPLEX: Duplex Dueling Multi-Agent Q-Learning"

50 / 207 papers shown
Title
Efficient Communication via Self-supervised Information Aggregation for
  Online and Offline Multi-agent Reinforcement Learning
Efficient Communication via Self-supervised Information Aggregation for Online and Offline Multi-agent Reinforcement Learning
Cong Guan
F. Chen
Lei Yuan
Zongzhang Zhang
Yang Yu
OffRL
37
4
0
19 Feb 2023
TiZero: Mastering Multi-Agent Football with Curriculum Learning and
  Self-Play
TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play
Fanqing Lin
Shiyu Huang
Tim Pearce
Wenze Chen
Weijuan Tu
26
17
0
15 Feb 2023
Adaptive Value Decomposition with Greedy Marginal Contribution
  Computation for Cooperative Multi-Agent Reinforcement Learning
Adaptive Value Decomposition with Greedy Marginal Contribution Computation for Cooperative Multi-Agent Reinforcement Learning
Shanqi Liu
Yujing Hu
Runze Wu
Dongxian Xing
Yu Xiong
Changjie Fan
Kun Kuang
Y. Liu
19
0
0
14 Feb 2023
Order Matters: Agent-by-agent Policy Optimization
Order Matters: Agent-by-agent Policy Optimization
Xihuai Wang
Zheng Tian
Ziyu Wan
Ying Wen
J. Wang
Weinan Zhang
25
26
0
13 Feb 2023
MANSA: Learning Fast and Slow in Multi-Agent Systems
MANSA: Learning Fast and Slow in Multi-Agent Systems
D. Mguni
Hao Chen
Taher Jafferjee
Jianhong Wang
Long Fei
Xidong Feng
Stephen Marcus McAleer
Feifei Tong
Jun Wang
Yaodong Yang
30
1
0
12 Feb 2023
ReMIX: Regret Minimization for Monotonic Value Function Factorization in
  Multiagent Reinforcement Learning
ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning
Yongsheng Mei
Hanhan Zhou
Tian-Shing Lan
30
11
0
11 Feb 2023
Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial
  Minority Influence
Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence
Simin Li
Jun Guo
Jingqiao Xiu
Pu Feng
Xin Yu
Aishan Liu
Wenjun Wu
Xianglong Liu
AAML
42
13
0
07 Feb 2023
Dual Self-Awareness Value Decomposition Framework without Individual
  Global Max for Cooperative Multi-Agent Reinforcement Learning
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative Multi-Agent Reinforcement Learning
Zhiwei Xu
Bin Zhang
Dapeng Li
Guangchong Zhou
Zeren Zhang
Guoliang Fan
28
3
0
04 Feb 2023
Best Possible Q-Learning
Best Possible Q-Learning
Jiechuan Jiang
Zongqing Lu
OffRL
20
5
0
02 Feb 2023
DIFFER: Decomposing Individual Reward for Fair Experience Replay in
  Multi-Agent Reinforcement Learning
DIFFER: Decomposing Individual Reward for Fair Experience Replay in Multi-Agent Reinforcement Learning
Xu Hu
Jian Zhao
Wen-gang Zhou
Ruili Feng
Houqiang Li
29
1
0
25 Jan 2023
TransfQMix: Transformers for Leveraging the Graph Structure of
  Multi-Agent Reinforcement Learning Problems
TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Problems
Matteo Gallici
Mario Martin
Ivan Masmitja
OffRL
11
9
0
13 Jan 2023
Self-Motivated Multi-Agent Exploration
Self-Motivated Multi-Agent Exploration
Shaowei Zhang
Jiahan Cao
Lei Yuan
Yang Yu
De-Chuan Zhan
41
5
0
05 Jan 2023
Attention-Based Recurrence for Multi-Agent Reinforcement Learning under
  Stochastic Partial Observability
Attention-Based Recurrence for Multi-Agent Reinforcement Learning under Stochastic Partial Observability
Thomy Phan
Fabian Ritz
Philipp Altmann
Maximilian Zorn
Jonas Nusslein
Michael Kolle
Thomas Gabor
Claudia Linnhoff-Popien
22
12
0
04 Jan 2023
On Transforming Reinforcement Learning by Transformer: The Development
  Trajectory
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya-Qin Zhang
Yixin Chen
Dacheng Tao
OffRL
27
25
0
29 Dec 2022
Strangeness-driven Exploration in Multi-Agent Reinforcement Learning
Strangeness-driven Exploration in Multi-Agent Reinforcement Learning
Ju-Bong Kim
Ho-bin Choi
Youn-Hee Han
14
4
0
27 Dec 2022
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement
  Learning
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Benjamin Ellis
Jonathan Cook
S. Moalla
Mikayel Samvelyan
Mingfei Sun
Anuj Mahajan
Jakob N. Foerster
Shimon Whiteson
19
83
0
14 Dec 2022
Hierarchical Strategies for Cooperative Multi-Agent Reinforcement
  Learning
Hierarchical Strategies for Cooperative Multi-Agent Reinforcement Learning
M. Ibrahim
Ammar Fayad
22
1
0
14 Dec 2022
Curriculum Learning for Relative Overgeneralization
Curriculum Learning for Relative Overgeneralization
Lin Shi
Bei Peng
25
1
0
06 Dec 2022
ACE: Cooperative Multi-agent Q-learning with Bidirectional
  Action-Dependency
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency
Chuming Li
Jie Liu
Yinmin Zhang
Yuhong Wei
Yazhe Niu
Yaodong Yang
Y. Liu
Wanli Ouyang
46
23
0
29 Nov 2022
Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition
Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition
Shunyu Liu
Yihe Zhou
Jie Song
Tongya Zheng
Kaixuan Chen
Tongtian Zhu
Zunlei Feng
Mingli Song
40
17
0
23 Nov 2022
Greedy based Value Representation for Optimal Coordination in
  Multi-agent Reinforcement Learning
Greedy based Value Representation for Optimal Coordination in Multi-agent Reinforcement Learning
Lipeng Wan
Zeyang Liu
Xingyu Chen
Xuguang Lan
Han Wang
37
12
0
22 Nov 2022
Decision-making with Speculative Opponent Models
Decision-making with Speculative Opponent Models
Jing-rong Sun
Shuo Chen
Cong Zhang
Yining Ma
Jie Zhang
28
1
0
22 Nov 2022
Value-based CTDE Methods in Symmetric Two-team Markov Game: from
  Cooperation to Team Competition
Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team Competition
Pascal Leroy
J. Pisane
D. Ernst
17
3
0
21 Nov 2022
Decentralized Policy Optimization
Decentralized Policy Optimization
Kefan Su
Zongqing Lu
11
8
0
06 Nov 2022
Non-Linear Coordination Graphs
Non-Linear Coordination Graphs
Yipeng Kang
Tonghan Wang
Xiao-Ren Wu
Qianlan Yang
Chongjie Zhang
29
9
0
26 Oct 2022
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
RPM: Generalizable Behaviors for Multi-Agent Reinforcement Learning
Wei Qiu
Xiao Ma
Bo An
S. Obraztsova
Shuicheng Yan
Zhongwen Xu
11
1
0
18 Oct 2022
PTDE: Personalized Training with Distilled Execution for Multi-Agent
  Reinforcement Learning
PTDE: Personalized Training with Distilled Execution for Multi-Agent Reinforcement Learning
Yiqun Chen
Hangyu Mao
Jiaxin Mao
Shiguang Wu
Tianle Zhang
Bin Zhang
Bin Wang
Hong Chang
OffRL
36
7
0
17 Oct 2022
Learning Explicit Credit Assignment for Cooperative Multi-Agent
  Reinforcement Learning via Polarization Policy Gradient
Learning Explicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning via Polarization Policy Gradient
Wubing Chen
Wenbin Li
Xiao Liu
Shangdong Yang
Yang Gao
40
5
0
10 Oct 2022
Stateful active facilitator: Coordination and Environmental
  Heterogeneity in Cooperative Multi-Agent Reinforcement Learning
Stateful active facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning
Dianbo Liu
Vedant Shah
Oussama Boussif
Cristian Meo
Anirudh Goyal
Tianmin Shu
Michael C. Mozer
N. Heess
Yoshua Bengio
24
7
0
04 Oct 2022
Pareto Actor-Critic for Equilibrium Selection in Multi-Agent
  Reinforcement Learning
Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning
Filippos Christianos
Georgios Papoudakis
Stefano V. Albrecht
27
4
0
28 Sep 2022
More Centralized Training, Still Decentralized Execution: Multi-Agent
  Conditional Policy Factorization
More Centralized Training, Still Decentralized Execution: Multi-Agent Conditional Policy Factorization
Jiangxing Wang
Deheng Ye
Zongqing Lu
OffRL
39
18
0
26 Sep 2022
Towards a Standardised Performance Evaluation Protocol for Cooperative
  MARL
Towards a Standardised Performance Evaluation Protocol for Cooperative MARL
R. Gorsane
Omayma Mahjoub
Ruan de Kock
Roland Dubb
Siddarth S. Singh
Arnu Pretorius
OffRL
39
49
0
21 Sep 2022
Rethinking Individual Global Max in Cooperative Multi-Agent
  Reinforcement Learning
Rethinking Individual Global Max in Cooperative Multi-Agent Reinforcement Learning
Yi-Te Hong
Yaochu Jin
Yang Tang
17
22
0
20 Sep 2022
MA2QL: A Minimalist Approach to Fully Decentralized Multi-Agent
  Reinforcement Learning
MA2QL: A Minimalist Approach to Fully Decentralized Multi-Agent Reinforcement Learning
Kefan Su
Siyuan Zhou
Jiechuan Jiang
Chuang Gan
Xiangjun Wang
Zongqing Lu
OffRL
28
6
0
17 Sep 2022
MIXRTs: Toward Interpretable Multi-Agent Reinforcement Learning via Mixing Recurrent Soft Decision Trees
MIXRTs: Toward Interpretable Multi-Agent Reinforcement Learning via Mixing Recurrent Soft Decision Trees
Zichuan Liu
Zichuan Liu
Zhi Wang
Yuanyang Zhu
Chunlin Chen
57
5
0
15 Sep 2022
Taming Multi-Agent Reinforcement Learning with Estimator Variance
  Reduction
Taming Multi-Agent Reinforcement Learning with Estimator Variance Reduction
Taher Jafferjee
Juliusz Ziomek
Tianpei Yang
Zipeng Dai
Jianhong Wang
Matthew E. Taylor
Kun Shao
J. Wang
D. Mguni
32
0
0
02 Sep 2022
A Policy Resonance Approach to Solve the Problem of Responsibility
  Diffusion in Multiagent Reinforcement Learning
A Policy Resonance Approach to Solve the Problem of Responsibility Diffusion in Multiagent Reinforcement Learning
Qing Fu
Tenghai Qiu
Jianqiang Yi
Zhiqiang Pu
Xiaolin Ai
Wanmai Yuan
24
0
0
16 Aug 2022
Transformer-based Value Function Decomposition for Cooperative
  Multi-agent Reinforcement Learning in StarCraft
Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning in StarCraft
Muhammad Junaid Khan
Syed Hammad Ahmed
G. Sukthankar
23
15
0
15 Aug 2022
Maximum Correntropy Value Decomposition for Multi-agent Deep
  Reinforcemen Learning
Maximum Correntropy Value Decomposition for Multi-agent Deep Reinforcemen Learning
Kai Liu
Tianxian Zhang
L. Kong
28
0
0
07 Aug 2022
Towards Global Optimality in Cooperative MARL with the Transformation
  And Distillation Framework
Towards Global Optimality in Cooperative MARL with the Transformation And Distillation Framework
Jianing Ye
Chenghao Li
Jianhao Wang
Chongjie Zhang
37
2
0
12 Jul 2022
Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning
Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning
Shunyu Liu
Jie Song
Yihe Zhou
Na Yu
Kaixuan Chen
Zunlei Feng
Mingli Song
26
7
0
08 Jul 2022
PAC: Assisted Value Factorisation with Counterfactual Predictions in
  Multi-Agent Reinforcement Learning
PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning
Hanhan Zhou
Tian-Shing Lan
Vaneet Aggarwal
16
4
0
22 Jun 2022
S2RL: Do We Really Need to Perceive All States in Deep Multi-Agent
  Reinforcement Learning?
S2RL: Do We Really Need to Perceive All States in Deep Multi-Agent Reinforcement Learning?
Shuang Luo
Yinchuan Li
Jiahui Li
Kun Kuang
Furui Liu
Yunfeng Shao
Chao-Xiang Wu
OffRL
16
6
0
20 Jun 2022
From Multi-agent to Multi-robot: A Scalable Training and Evaluation
  Platform for Multi-robot Reinforcement Learning
From Multi-agent to Multi-robot: A Scalable Training and Evaluation Platform for Multi-robot Reinforcement Learning
Zhiuxan Liang
Jiannong Cao
Shan Jiang
Divya Saxena
Jinlin Chen
Huafeng Xu
22
9
0
20 Jun 2022
Revisiting Some Common Practices in Cooperative Multi-Agent
  Reinforcement Learning
Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
Wei Fu
Chao Yu
Zelai Xu
Jiaqi Yang
Yi Wu
34
32
0
15 Jun 2022
RACA: Relation-Aware Credit Assignment for Ad-Hoc Cooperation in
  Multi-Agent Deep Reinforcement Learning
RACA: Relation-Aware Credit Assignment for Ad-Hoc Cooperation in Multi-Agent Deep Reinforcement Learning
Haoxing Chen
Guang Yang
Junge Zhang
Qiyue Yin
Kaiqi Huang
17
2
0
02 Jun 2022
Learning Generalizable Risk-Sensitive Policies to Coordinate in Decentralized Multi-Agent General-Sum Games
Ziyi Liu
Xian Guo
Yongchun Fang
18
0
0
31 May 2022
Residual Q-Networks for Value Function Factorizing in Multi-Agent
  Reinforcement Learning
Residual Q-Networks for Value Function Factorizing in Multi-Agent Reinforcement Learning
Rafael Pina
V. D. Silva
Joosep Hook
A. Kondoz
12
13
0
30 May 2022
Off-Beat Multi-Agent Reinforcement Learning
Off-Beat Multi-Agent Reinforcement Learning
Wei Qiu
Weixun Wang
R. Wang
Bo An
Yujing Hu
S. Obraztsova
Zinovi Rabinovich
Jianye Hao
Yingfeng Chen
Changjie Fan
OffRL
26
2
0
27 May 2022
QGNN: Value Function Factorisation with Graph Neural Networks
QGNN: Value Function Factorisation with Graph Neural Networks
Ryan Kortvelesy
Amanda Prorok
19
15
0
25 May 2022
Previous
12345
Next