LOQA: Learning with Opponent Q-Learning Awareness

2 May 2024

Papers citing "LOQA: Learning with Opponent Q-Learning Awareness"

3 / 3 papers shown

Title
Multi-agent cooperation through learning-aware policy gradients Alexander Meulemans Seijin Kobayashi J. Oswald Nino Scherrer Eric Elmoznino Blake A. Richards Guillaume Lajoie Blaise Agüera y Arcas João Sacramento 39 0 0 24 Oct 2024
Advantage Alignment Algorithms Juan Agustin Duque Milad Aghajohari Tim Cooijmans Tianyu Zhang Aaron C. Courville Gauthier Gidel Aaron Courville 18 0 0 20 Jun 2024
Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents John L. Zhou Weizhe Hong Jonathan C. Kao 28 0 0 03 Jun 2024