ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.04326
  4. Cited By
Learning with Opponent-Learning Awareness

Learning with Opponent-Learning Awareness

13 September 2017
Jakob N. Foerster
Richard Y. Chen
Maruan Al-Shedivat
Shimon Whiteson
Pieter Abbeel
Igor Mordatch
ArXivPDFHTML

Papers citing "Learning with Opponent-Learning Awareness"

48 / 98 papers shown
Title
Continual Learning In Environments With Polynomial Mixing Times
Continual Learning In Environments With Polynomial Mixing Times
Matthew D Riemer
Sharath Chandra Raparthy
Ignacio Cases
G. Subbaraj
M. P. Touzel
Irina Rish
CLL
33
8
0
13 Dec 2021
Learning to Simulate Self-Driven Particles System with Coordinated
  Policy Optimization
Learning to Simulate Self-Driven Particles System with Coordinated Policy Optimization
Zhenghao Peng
Quanyi Li
Ka-Ming Hui
Chunxiao Liu
Bolei Zhou
31
58
0
26 Oct 2021
Independent Natural Policy Gradient Always Converges in Markov Potential
  Games
Independent Natural Policy Gradient Always Converges in Markov Potential Games
Roy Fox
Stephen Marcus McAleer
W. Overman
Ioannis Panageas
24
49
0
20 Oct 2021
Interpretation of Emergent Communication in Heterogeneous Collaborative
  Embodied Agents
Interpretation of Emergent Communication in Heterogeneous Collaborative Embodied Agents
Shivansh Patel
Saim Wani
Unnat Jain
A. Schwing
Svetlana Lazebnik
Manolis Savva
Angel X. Chang
LM&Ro
24
25
0
12 Oct 2021
Influencing Towards Stable Multi-Agent Interactions
Influencing Towards Stable Multi-Agent Interactions
Woodrow Z. Wang
Andy Shih
Annie Xie
Dorsa Sadigh
38
35
0
05 Oct 2021
Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams
Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams
Erdem Biyik
Anusha Lalitha
R. Saha
Andrea J. Goldsmith
Dorsa Sadigh
15
4
0
02 Oct 2021
Emergence of Theory of Mind Collaboration in Multiagent Systems
Emergence of Theory of Mind Collaboration in Multiagent Systems
Luyao Yuan
Zipeng Fu
Linqi Zhou
Kexin Yang
Song-Chun Zhu
46
10
0
30 Sep 2021
Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning
  Algorithms
Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms
Liyuan Zheng
Tanner Fiez
Zane Alumbaugh
Benjamin J. Chasnov
Lillian J. Ratliff
OffRL
32
38
0
25 Sep 2021
ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via
  Convex Relaxation
ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via Convex Relaxation
Chuangchuang Sun
Dong-Ki Kim
Jonathan P. How
AAML
31
18
0
14 Sep 2021
Policy Gradient Methods Find the Nash Equilibrium in N-player
  General-sum Linear-quadratic Games
Policy Gradient Methods Find the Nash Equilibrium in N-player General-sum Linear-quadratic Games
B. Hambly
Renyuan Xu
Huining Yang
13
25
0
27 Jul 2021
Social Coordination and Altruism in Autonomous Driving
Social Coordination and Altruism in Autonomous Driving
Behrad Toghi
Rodolfo Valiente
Dorsa Sadigh
Ramtin Pedarsani
Y. P. Fallah
18
66
0
01 Jul 2021
A Game-Theoretic Approach to Multi-Agent Trust Region Optimization
A Game-Theoretic Approach to Multi-Agent Trust Region Optimization
Ying Wen
Hui Chen
Yaodong Yang
Zheng Tian
Minne Li
Xu Chen
Jun Wang
28
11
0
12 Jun 2021
Gradient play in stochastic games: stationary points, convergence, and
  sample complexity
Gradient play in stochastic games: stationary points, convergence, and sample complexity
Runyu Zhang
Zhaolin Ren
Na Li
20
43
0
01 Jun 2021
Who/What is My Teammate? Team Composition Considerations in Human-AI
  Teaming
Who/What is My Teammate? Team Composition Considerations in Human-AI Teaming
Nathan J. Mcneese
Beau G. Schelble
L. Canonico
Mustafa Demir
108
48
0
23 May 2021
Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise
  Rollouts
Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts
Weinan Zhang
Xihuai Wang
Jian Shen
Ming Zhou
19
35
0
07 May 2021
Deep Interpretable Models of Theory of Mind
Deep Interpretable Models of Theory of Mind
Ini Oguntola
Dana Hughes
Katia P. Sycara
HAI
25
23
0
07 Apr 2021
Open Problems in Cooperative AI
Open Problems in Cooperative AI
Allan Dafoe
Edward Hughes
Yoram Bachrach
Tantum Collins
Kevin R. McKee
Joel Z. Leibo
Kate Larson
T. Graepel
21
199
0
15 Dec 2020
Learning in two-player games between transparent opponents
Learning in two-player games between transparent opponents
A. Hutter
15
5
0
04 Dec 2020
Opponent Learning Awareness and Modelling in Multi-Objective Normal Form
  Games
Opponent Learning Awareness and Modelling in Multi-Objective Normal Form Games
Roxana Rădulescu
T. Verstraeten
Yijie Zhang
Patrick Mannion
D. Roijers
A. Nowé
20
14
0
14 Nov 2020
Learning Latent Representations to Influence Multi-Agent Interaction
Learning Latent Representations to Influence Multi-Agent Interaction
Annie Xie
Dylan P. Losey
R. Tolsma
Chelsea Finn
Dorsa Sadigh
DRL
13
132
0
12 Nov 2020
Emergent Reciprocity and Team Formation from Randomized Uncertain Social
  Preferences
Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences
Bowen Baker
LRM
13
33
0
10 Nov 2020
Learning to Play against Any Mixture of Opponents
Learning to Play against Any Mixture of Opponents
Max O. Smith
Thomas W. Anthony
Yongzhao Wang
Michael P. Wellman
OffRL
17
9
0
29 Sep 2020
Learning Nash Equilibria in Zero-Sum Stochastic Games via
  Entropy-Regularized Policy Approximation
Learning Nash Equilibria in Zero-Sum Stochastic Games via Entropy-Regularized Policy Approximation
Yue Guan
Qifan Zhang
Panagiotis Tsiotras
4
7
0
01 Sep 2020
Joint Policy Search for Multi-agent Collaboration with Imperfect
  Information
Joint Policy Search for Multi-agent Collaboration with Imperfect Information
Yuandong Tian
Qucheng Gong
Tina Jiang
29
19
0
14 Aug 2020
Reinforcement Communication Learning in Different Social Network
  Structures
Reinforcement Communication Learning in Different Social Network Structures
M. Dubova
A. Moskvichev
Robert L. Goldstone
GNN
11
9
0
19 Jul 2020
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Thomas W. Anthony
Tom Eccles
Andrea Tacchetti
János Kramár
I. Gemp
...
Richard Everett
Roman Werpachowski
Satinder Singh
T. Graepel
Yoram Bachrach
11
42
0
08 Jun 2020
AI Research Considerations for Human Existential Safety (ARCHES)
AI Research Considerations for Human Existential Safety (ARCHES)
Andrew Critch
David M. Krueger
22
50
0
30 May 2020
On the Impossibility of Global Convergence in Multi-Loss Optimization
On the Impossibility of Global Convergence in Multi-Loss Optimization
Alistair Letcher
11
32
0
26 May 2020
Optimizing for the Future in Non-Stationary MDPs
Optimizing for the Future in Non-Stationary MDPs
Yash Chandak
Georgios Theocharous
Shiv Shankar
Martha White
Sridhar Mahadevan
Philip S. Thomas
OffRL
11
65
0
17 May 2020
The AI Economist: Improving Equality and Productivity with AI-Driven Tax
  Policies
The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies
Stephan Zheng
Alexander R. Trott
Sunil Srinivasa
Nikhil Naik
Melvin Gruesbeck
David C. Parkes
R. Socher
23
131
0
28 Apr 2020
Interactive AI with a Theory of Mind
Interactive AI with a Theory of Mind
M. Çelikok
Tomi Peltola
Pedram Daee
Samuel Kaski
20
19
0
01 Dec 2019
Towards Deployment of Robust AI Agents for Human-Machine Partnerships
Towards Deployment of Robust AI Agents for Human-Machine Partnerships
Ahana Ghosh
Sebastian Tschiatschek
Hamed Mahdavi
Adish Singla
21
9
0
05 Oct 2019
The Differentiable Cross-Entropy Method
The Differentiable Cross-Entropy Method
Brandon Amos
Denis Yarats
21
54
0
27 Sep 2019
No Press Diplomacy: Modeling Multi-Agent Gameplay
No Press Diplomacy: Modeling Multi-Agent Gameplay
Philip Paquette
Yuchen Lu
Steven Bocco
Max O. Smith
Satya Ortiz-Gagné
Jonathan K. Kummerfeld
Satinder Singh
Joelle Pineau
Aaron Courville
25
57
0
04 Sep 2019
Arena: A General Evaluation Platform and Building Toolkit for
  Multi-Agent Intelligence
Arena: A General Evaluation Platform and Building Toolkit for Multi-Agent Intelligence
Yuhang Song
Andrzej Wojcicki
Thomas Lukasiewicz
Jianyi Wang
Abi Aryan
Zhenghua Xu
Mai Xu
Zihan Ding
Lianlong Wu
AI4CE
ELM
17
33
0
17 May 2019
Differentiable Game Mechanics
Differentiable Game Mechanics
Alistair Letcher
David Balduzzi
S. Racanière
James Martens
Jakob N. Foerster
K. Tuyls
T. Graepel
29
79
0
13 May 2019
How Shall I Drive? Interaction Modeling and Motion Planning towards
  Empathetic and Socially-Graceful Driving
How Shall I Drive? Interaction Modeling and Motion Planning towards Empathetic and Socially-Graceful Driving
Yi Ren
Steven Elliott
Yiwei Wang
Yezhou Yang
Wenlong Zhang
17
12
0
28 Jan 2019
Learning to Collaborate in Markov Decision Processes
Learning to Collaborate in Markov Decision Processes
Goran Radanović
R. Devidze
David C. Parkes
Adish Singla
27
33
0
23 Jan 2019
Evolving intrinsic motivations for altruistic behavior
Evolving intrinsic motivations for altruistic behavior
Jane X. Wang
Edward Hughes
Chrisantha Fernando
Wojciech M. Czarnecki
Edgar A. Duénez-Guzmán
Joel Z. Leibo
19
76
0
14 Nov 2018
A Survey and Critique of Multiagent Deep Reinforcement Learning
A Survey and Critique of Multiagent Deep Reinforcement Learning
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
OffRL
27
549
0
12 Oct 2018
Variance Reduction in Monte Carlo Counterfactual Regret Minimization
  (VR-MCCFR) for Extensive Form Games using Baselines
Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Martin Schmid
Neil Burch
Marc Lanctot
Matej Moravcík
Rudolf Kadlec
Michael H. Bowling
16
64
0
09 Sep 2018
Human-level performance in first-person multiplayer games with
  population-based deep reinforcement learning
Human-level performance in first-person multiplayer games with population-based deep reinforcement learning
Max Jaderberg
Wojciech M. Czarnecki
Iain Dunning
Luke Marris
Guy Lever
...
Joel Z. Leibo
David Silver
Demis Hassabis
Koray Kavukcuoglu
T. Graepel
OffRL
21
713
0
03 Jul 2018
Adaptive Mechanism Design: Learning to Promote Cooperation
Adaptive Mechanism Design: Learning to Promote Cooperation
T. Baumann
T. Graepel
John Shawe-Taylor
14
26
0
11 Jun 2018
Emergent Communication through Negotiation
Emergent Communication through Negotiation
Kris Cao
Angeliki Lazaridou
Marc Lanctot
Joel Z. Leibo
K. Tuyls
S. Clark
16
153
0
11 Apr 2018
Inequity aversion improves cooperation in intertemporal social dilemmas
Inequity aversion improves cooperation in intertemporal social dilemmas
Edward Hughes
Joel Z. Leibo
Matthew Phillips
K. Tuyls
Edgar A. Duénez-Guzmán
...
Tina Zhu
Kevin R. McKee
Raphael Köster
H. Roff
T. Graepel
19
204
0
23 Mar 2018
The Mechanics of n-Player Differentiable Games
The Mechanics of n-Player Differentiable Games
David Balduzzi
S. Racanière
James Martens
Jakob N. Foerster
K. Tuyls
T. Graepel
MLT
16
273
0
15 Feb 2018
Competitive Multi-agent Inverse Reinforcement Learning with Sub-optimal
  Demonstrations
Competitive Multi-agent Inverse Reinforcement Learning with Sub-optimal Demonstrations
Xingyu Wang
Diego Klabjan
16
39
0
07 Jan 2018
Stabilising Experience Replay for Deep Multi-Agent Reinforcement
  Learning
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning
Jakob N. Foerster
Nantas Nardelli
Gregory Farquhar
Triantafyllos Afouras
Philip H. S. Torr
Pushmeet Kohli
Shimon Whiteson
OffRL
109
595
0
28 Feb 2017
Previous
12