ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.03433
  4. Cited By
Reward Design in Cooperative Multi-agent Reinforcement Learning for
  Packet Routing

Reward Design in Cooperative Multi-agent Reinforcement Learning for Packet Routing

5 March 2020
Hangyu Mao
Zhibo Gong
Zhen Xiao
ArXiv (abs)PDFHTML

Papers citing "Reward Design in Cooperative Multi-agent Reinforcement Learning for Packet Routing"

19 / 19 papers shown
Large Language Model-Enhanced Reinforcement Learning for Generic Bus Holding Control Strategies
Large Language Model-Enhanced Reinforcement Learning for Generic Bus Holding Control Strategies
Jiajie Yu
Yuhong Wang
Wei Ma
OffRL
375
5
0
14 Oct 2024
Dynamic neighbourhood optimisation for task allocation using multi-agent
Dynamic neighbourhood optimisation for task allocation using multi-agent
N. Creech
Natalia Criado
S. Miles
344
2
0
16 Feb 2021
Learning Agent Communication under Limited Bandwidth by Message Pruning
Learning Agent Communication under Limited Bandwidth by Message PruningAAAI Conference on Artificial Intelligence (AAAI), 2019
Hangyu Mao
Zhengchao Zhang
Zhen Xiao
Zhibo Gong
Yan Ni
181
109
0
03 Dec 2019
Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning
Neighborhood Cognition Consistent Multi-Agent Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2019
Hangyu Mao
Wulong Liu
Jianye Hao
Jun Luo
Dong Li
Zhengchao Zhang
Jun Wang
Zhen Xiao
OffRL
263
82
0
03 Dec 2019
Hierarchical Deep Double Q-Routing
Hierarchical Deep Double Q-Routing
Ramy E. Ali
B. Erman
Ejder Bastug
Bruce Cilli
196
18
0
09 Oct 2019
Learning Multi-agent Communication under Limited-bandwidth Restriction
  for Internet Packet Routing
Learning Multi-agent Communication under Limited-bandwidth Restriction for Internet Packet Routing
Hangyu Mao
Zhibo Gong
Zhengchao Zhang
Zhen Xiao
Yan Ni
AI4CE
192
21
0
26 Feb 2019
Modelling the Dynamic Joint Policy of Teammates with Attention
  Multi-agent DDPG
Modelling the Dynamic Joint Policy of Teammates with Attention Multi-agent DDPG
Hangyu Mao
Zhengchao Zhang
Zhen Xiao
Zhibo Gong
227
96
0
13 Nov 2018
A Policy Search Method For Temporal Logic Specified Reinforcement
  Learning Tasks
A Policy Search Method For Temporal Logic Specified Reinforcement Learning Tasks
Xiao Li
Yao Ma
C. Belta
203
61
0
27 Sep 2017
Multi-Agent Q-Learning for Minimizing Demand-Supply Power Deficit in
  Microgrids
Multi-Agent Q-Learning for Minimizing Demand-Supply Power Deficit in Microgrids
Raghuram Bharadwaj Diddigi
Sai Koti Reddy Danda
S. Bhatnagar
99
4
0
25 Aug 2017
A Distributional Perspective on Reinforcement Learning
A Distributional Perspective on Reinforcement Learning
Marc G. Bellemare
Will Dabney
Rémi Munos
OffRL
410
1,769
0
21 Jul 2017
Value-Decomposition Networks For Cooperative Multi-Agent Learning
Value-Decomposition Networks For Cooperative Multi-Agent Learning
P. Sunehag
Guy Lever
A. Gruslys
Wojciech M. Czarnecki
V. Zambaldi
...
Marc Lanctot
Nicolas Sonnerat
Joel Z Leibo
K. Tuyls
T. Graepel
561
1,251
0
16 Jun 2017
Hybrid Reward Architecture for Reinforcement Learning
Hybrid Reward Architecture for Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2017
H. V. Seijen
Mehdi Fatemi
Joshua Romoff
Romain Laroche
Tavian Barnes
Jeffrey Tsang
302
281
0
13 Jun 2017
ACCNet: Actor-Coordinator-Critic Net for "Learning-to-Communicate" with
  Deep Multi-agent Reinforcement Learning
ACCNet: Actor-Coordinator-Critic Net for "Learning-to-Communicate" with Deep Multi-agent Reinforcement Learning
Hangyu Mao
Zhibo Gong
Yan Ni
Zhen Xiao
283
46
0
10 Jun 2017
Counterfactual Multi-Agent Policy Gradients
Counterfactual Multi-Agent Policy Gradients
Jakob N. Foerster
Gregory Farquhar
Triantafyllos Afouras
Nantas Nardelli
Shimon Whiteson
862
2,454
0
24 May 2017
Analysing Congestion Problems in Multi-agent Reinforcement Learning
Analysing Congestion Problems in Multi-agent Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2017
Roxana Rădulescu
Peter Vrancx
A. Nowé
190
12
0
28 Feb 2017
Reinforcement Learning with Unsupervised Auxiliary Tasks
Reinforcement Learning with Unsupervised Auxiliary Tasks
Max Jaderberg
Volodymyr Mnih
Wojciech M. Czarnecki
Tom Schaul
Joel Z Leibo
David Silver
Koray Kavukcuoglu
SSL
646
1,288
0
16 Nov 2016
Deep Reinforcement Learning for Dialogue Generation
Deep Reinforcement Learning for Dialogue GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2016
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
620
1,398
0
05 Jun 2016
Multiagent Cooperation and Competition with Deep Reinforcement Learning
Multiagent Cooperation and Competition with Deep Reinforcement Learning
Ardi Tampuu
Tambet Matiisen
Dorian Kodelja
Ilya Kuzovkin
Kristjan Korjus
Juhan Aru
Jaan Aru
Raul Vicente
404
972
0
27 Nov 2015
Deep Reinforcement Learning in Parameterized Action Space
Deep Reinforcement Learning in Parameterized Action Space
Matthew J. Hausknecht
Peter Stone
501
333
0
13 Nov 2015
1
Page 1 of 1