ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.01641
  4. Cited By
Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents
v1v2v3 (latest)

Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents

3 June 2024
John L. Zhou
Weizhe Hong
Jonathan C. Kao
ArXiv (abs)PDFHTMLGithub (8★)

Papers citing "Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents"

25 / 25 papers shown
The Benefits of Power Regularization in Cooperative Reinforcement
  Learning
The Benefits of Power Regularization in Cooperative Reinforcement Learning
Michelle Li
Michael Dennis
310
3
0
17 Jun 2024
LOQA: Learning with Opponent Q-Learning Awareness
LOQA: Learning with Opponent Q-Learning Awareness
Milad Aghajohari
Juan Agustin Duque
Tim Cooijmans
Rameswar Panda
246
9
0
02 May 2024
Aligning Individual and Collective Objectives in Multi-Agent Cooperation
Aligning Individual and Collective Objectives in Multi-Agent Cooperation
Yang Li
Wenhao Zhang
Jianhong Wang
Shao Zhang
Yali Du
Ying Wen
Wei Pan
219
7
0
19 Feb 2024
Scaling Opponent Shaping to High Dimensional Games
Scaling Opponent Shaping to High Dimensional Games
Akbir Khan
Timon Willi
Newton Kwan
Andrea Tacchetti
Chris Xiaoxuan Lu
Edward Grefenstette
Tim Rocktaschel
Jakob N. Foerster
372
14
0
19 Dec 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
METRA: Scalable Unsupervised RL with Metric-Aware AbstractionInternational Conference on Learning Representations (ICLR), 2023
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
499
82
0
13 Oct 2023
Proximal Learning With Opponent-Learning Awareness
Proximal Learning With Opponent-Learning AwarenessNeural Information Processing Systems (NeurIPS), 2022
S. Zhao
Chris Xiaoxuan Lu
Roger C. Grosse
Jakob N. Foerster
342
26
0
18 Oct 2022
The emergence of division of labor through decentralized social
  sanctioning
The emergence of division of labor through decentralized social sanctioning
Anil Yaman
Joel Z Leibo
Giovanni Iacca
Sang Wan Lee
351
9
0
10 Aug 2022
Model-Free Opponent Shaping
Model-Free Opponent ShapingInternational Conference on Machine Learning (ICML), 2022
Chris Xiaoxuan Lu
Timon Willi
Christian Schroeder de Witt
Jakob N. Foerster
415
53
0
03 May 2022
COLA: Consistent Learning with Opponent-Learning Awareness
COLA: Consistent Learning with Opponent-Learning AwarenessInternational Conference on Machine Learning (ICML), 2022
Timon Willi
Alistair Letcher
Johannes Treutlein
Jakob N. Foerster
350
61
0
08 Mar 2022
Model-Based Opponent Modeling
Model-Based Opponent Modeling
Xiaopeng Yu
Jiechuan Jiang
Wanpeng Zhang
Haobin Jiang
Zongqing Lu
OffRL
336
44
0
04 Aug 2021
A Policy Gradient Algorithm for Learning to Learn in Multiagent
  Reinforcement Learning
A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement LearningInternational Conference on Machine Learning (ICML), 2020
Dong-Ki Kim
Miao Liu
Matthew D Riemer
Chuangchuang Sun
Marwa Abdulhai
Golnaz Habibi
Sebastian Lopez-Cot
Gerald Tesauro
Jonathan P. How
646
66
0
31 Oct 2020
Learning to Incentivize Other Learning Agents
Learning to Incentivize Other Learning AgentsNeural Information Processing Systems (NeurIPS), 2020
Jiachen Yang
Ang Li
Mehrdad Farajtabar
P. Sunehag
Edward Hughes
H. Zha
416
88
0
10 Jun 2020
Influence-Based Multi-Agent Exploration
Influence-Based Multi-Agent ExplorationInternational Conference on Learning Representations (ICLR), 2019
Tonghan Wang
Jianhao Wang
Yi Wu
Chongjie Zhang
229
153
0
12 Oct 2019
Towards Empathic Deep Q-Learning
Towards Empathic Deep Q-Learning
Bart Bussmann
Jacqueline Heinerman
Joel Lehman
AI4CE
176
12
0
26 Jun 2019
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent
  Reinforcement Learning
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
775
1,952
0
30 Mar 2018
Inequity aversion improves cooperation in intertemporal social dilemmas
Inequity aversion improves cooperation in intertemporal social dilemmas
Edward Hughes
Joel Z Leibo
Matthew Phillips
K. Tuyls
Edgar A. Duénez-Guzmán
...
Tina Zhu
Kevin R. McKee
Raphael Köster
H. Roff
T. Graepel
403
254
0
23 Mar 2018
Time Limits in Reinforcement Learning
Time Limits in Reinforcement Learning
Fabio Pardo
Arash Tavakoli
Vitaly Levdik
Petar Kormushev
CLL
358
184
0
01 Dec 2017
Learning with Opponent-Learning Awareness
Learning with Opponent-Learning Awareness
Jakob N. Foerster
Richard Y. Chen
Maruan Al-Shedivat
Shimon Whiteson
Pieter Abbeel
Igor Mordatch
571
595
0
13 Sep 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
1.5K
26,647
0
20 Jul 2017
Maintaining cooperation in complex social dilemmas using deep
  reinforcement learning
Maintaining cooperation in complex social dilemmas using deep reinforcement learning
Adam Lerer
A. Peysakhovich
459
170
0
04 Jul 2017
Value-Decomposition Networks For Cooperative Multi-Agent Learning
Value-Decomposition Networks For Cooperative Multi-Agent Learning
P. Sunehag
Guy Lever
A. Gruslys
Wojciech M. Czarnecki
V. Zambaldi
...
Marc Lanctot
Nicolas Sonnerat
Joel Z Leibo
K. Tuyls
T. Graepel
573
1,272
0
16 Jun 2017
Counterfactual Multi-Agent Policy Gradients
Counterfactual Multi-Agent Policy Gradients
Jakob N. Foerster
Gregory Farquhar
Triantafyllos Afouras
Nantas Nardelli
Shimon Whiteson
929
2,486
0
24 May 2017
Multi-agent Reinforcement Learning in Sequential Social Dilemmas
Multi-agent Reinforcement Learning in Sequential Social DilemmasAdaptive Agents and Multi-Agent Systems (AAMAS), 2017
Joel Z Leibo
V. Zambaldi
Marc Lanctot
J. Marecki
T. Graepel
465
682
0
10 Feb 2017
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
898
9,865
0
04 Feb 2016
Prioritized Experience Replay
Prioritized Experience Replay
Tom Schaul
John Quan
Ioannis Antonoglou
David Silver
OffRL
777
4,322
0
18 Nov 2015
1
Page 1 of 1