v1v2v3 (latest)

Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents

3 June 2024

John L. Zhou

Weizhe Hong

Jonathan C. Kao

ArXiv (abs)PDF HTML Github (8★)

Papers citing "Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents"

25 / 25 papers shown

The Benefits of Power Regularization in Cooperative Reinforcement Learning

Michelle Li

Michael Dennis

310

17 Jun 2024

LOQA: Learning with Opponent Q-Learning Awareness

246

02 May 2024

Aligning Individual and Collective Objectives in Multi-Agent Cooperation

Yang Li

Wenhao Zhang

Jianhong Wang

Shao Zhang

Yali Du

Ying Wen

Wei Pan

219

19 Feb 2024

Scaling Opponent Shaping to High Dimensional Games

372

19 Dec 2023

METRA: Scalable Unsupervised RL with Metric-Aware AbstractionInternational Conference on Learning Representations (ICLR), 2023

499

13 Oct 2023

Proximal Learning With Opponent-Learning AwarenessNeural Information Processing Systems (NeurIPS), 2022

342

18 Oct 2022

The emergence of division of labor through decentralized social sanctioning

351

10 Aug 2022

Model-Free Opponent ShapingInternational Conference on Machine Learning (ICML), 2022

Chris Xiaoxuan Lu

Timon Willi

Christian Schroeder de Witt

Jakob N. Foerster

415

03 May 2022

COLA: Consistent Learning with Opponent-Learning AwarenessInternational Conference on Machine Learning (ICML), 2022

350

08 Mar 2022

Model-Based Opponent Modeling

336

04 Aug 2021

A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement LearningInternational Conference on Machine Learning (ICML), 2020

646

31 Oct 2020

Learning to Incentivize Other Learning AgentsNeural Information Processing Systems (NeurIPS), 2020

416

10 Jun 2020

Influence-Based Multi-Agent ExplorationInternational Conference on Learning Representations (ICLR), 2019

Tonghan Wang

Jianhao Wang

Yi Wu

Chongjie Zhang

229

153

12 Oct 2019

Towards Empathic Deep Q-Learning

176

26 Jun 2019

QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

Tabish Rashid

Mikayel Samvelyan

Christian Schroeder de Witt

Gregory Farquhar

Jakob N. Foerster

Shimon Whiteson

775

1,952

30 Mar 2018

Inequity aversion improves cooperation in intertemporal social dilemmas

Edgar A. Duénez-Guzmán

...

403

254

23 Mar 2018

Time Limits in Reinforcement Learning

358

184

01 Dec 2017

Learning with Opponent-Learning Awareness

Pieter Abbeel

571

595

13 Sep 2017

Proximal Policy Optimization Algorithms

1.5K

26,647

20 Jul 2017

Maintaining cooperation in complex social dilemmas using deep reinforcement learning

Adam Lerer

A. Peysakhovich

459

170

04 Jul 2017

Value-Decomposition Networks For Cooperative Multi-Agent Learning

P. Sunehag

Guy Lever

A. Gruslys

Wojciech M. Czarnecki

...

573

1,272

16 Jun 2017

Counterfactual Multi-Agent Policy Gradients

Jakob N. Foerster

Gregory Farquhar

Triantafyllos Afouras

Nantas Nardelli

Shimon Whiteson

929

2,486

24 May 2017

Multi-agent Reinforcement Learning in Sequential Social DilemmasAdaptive Agents and Multi-Agent Systems (AAMAS), 2017

465

682

10 Feb 2017

Asynchronous Methods for Deep Reinforcement Learning

Volodymyr Mnih

Adria Puigdomenech Badia

David Silver

898

9,865

04 Feb 2016

Prioritized Experience Replay

David Silver

777

4,322

18 Nov 2015