Learning with Opponent-Learning Awareness

13 September 2017

Pieter Abbeel

Papers citing "Learning with Opponent-Learning Awareness"

48 / 98 papers shown

Title
Continual Learning In Environments With Polynomial Mixing Times Matthew D Riemer Sharath Chandra Raparthy Ignacio Cases G. Subbaraj M. P. Touzel Irina Rish CLL 33 8 0 13 Dec 2021
Learning to Simulate Self-Driven Particles System with Coordinated Policy Optimization Zhenghao Peng Quanyi Li Ka-Ming Hui Chunxiao Liu Bolei Zhou 31 58 0 26 Oct 2021
Independent Natural Policy Gradient Always Converges in Markov Potential Games Roy Fox Stephen Marcus McAleer W. Overman Ioannis Panageas 24 49 0 20 Oct 2021
Interpretation of Emergent Communication in Heterogeneous Collaborative Embodied Agents Shivansh Patel Saim Wani Unnat Jain A. Schwing Svetlana Lazebnik Manolis Savva Angel X. Chang LM&Ro 24 25 0 12 Oct 2021
Influencing Towards Stable Multi-Agent Interactions Woodrow Z. Wang Andy Shih Annie Xie Dorsa Sadigh 38 35 0 05 Oct 2021
Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams Erdem Biyik Anusha Lalitha R. Saha Andrea J. Goldsmith Dorsa Sadigh 15 4 0 02 Oct 2021
Emergence of Theory of Mind Collaboration in Multiagent Systems Luyao Yuan Zipeng Fu Linqi Zhou Kexin Yang Song-Chun Zhu 46 10 0 30 Sep 2021
Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms Liyuan Zheng Tanner Fiez Zane Alumbaugh Benjamin J. Chasnov Lillian J. Ratliff OffRL 32 38 0 25 Sep 2021
ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via Convex Relaxation Chuangchuang Sun Dong-Ki Kim Jonathan P. How AAML 31 18 0 14 Sep 2021
Policy Gradient Methods Find the Nash Equilibrium in N-player General-sum Linear-quadratic Games B. Hambly Renyuan Xu Huining Yang 13 25 0 27 Jul 2021
Social Coordination and Altruism in Autonomous Driving Behrad Toghi Rodolfo Valiente Dorsa Sadigh Ramtin Pedarsani Y. P. Fallah 18 66 0 01 Jul 2021
A Game-Theoretic Approach to Multi-Agent Trust Region Optimization Ying Wen Hui Chen Yaodong Yang Zheng Tian Minne Li Xu Chen Jun Wang 28 11 0 12 Jun 2021
Gradient play in stochastic games: stationary points, convergence, and sample complexity Runyu Zhang Zhaolin Ren Na Li 20 43 0 01 Jun 2021
Who/What is My Teammate? Team Composition Considerations in Human-AI Teaming Nathan J. Mcneese Beau G. Schelble L. Canonico Mustafa Demir 108 48 0 23 May 2021
Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts Weinan Zhang Xihuai Wang Jian Shen Ming Zhou 19 35 0 07 May 2021
Deep Interpretable Models of Theory of Mind Ini Oguntola Dana Hughes Katia P. Sycara HAI 25 23 0 07 Apr 2021
Open Problems in Cooperative AI Allan Dafoe Edward Hughes Yoram Bachrach Tantum Collins Kevin R. McKee Joel Z. Leibo Kate Larson T. Graepel 21 199 0 15 Dec 2020
Learning in two-player games between transparent opponents A. Hutter 15 5 0 04 Dec 2020
Opponent Learning Awareness and Modelling in Multi-Objective Normal Form Games Roxana Rădulescu T. Verstraeten Yijie Zhang Patrick Mannion D. Roijers A. Nowé 20 14 0 14 Nov 2020
Learning Latent Representations to Influence Multi-Agent Interaction Annie Xie Dylan P. Losey R. Tolsma Chelsea Finn Dorsa Sadigh DRL 13 132 0 12 Nov 2020
Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences Bowen Baker LRM 13 33 0 10 Nov 2020
Learning to Play against Any Mixture of Opponents Max O. Smith Thomas W. Anthony Yongzhao Wang Michael P. Wellman OffRL 17 9 0 29 Sep 2020
Learning Nash Equilibria in Zero-Sum Stochastic Games via Entropy-Regularized Policy Approximation Yue Guan Qifan Zhang Panagiotis Tsiotras 4 7 0 01 Sep 2020
Joint Policy Search for Multi-agent Collaboration with Imperfect Information Yuandong Tian Qucheng Gong Tina Jiang 29 19 0 14 Aug 2020
Reinforcement Communication Learning in Different Social Network Structures M. Dubova A. Moskvichev Robert L. Goldstone GNN 11 9 0 19 Jul 2020
Learning to Play No-Press Diplomacy with Best Response Policy Iteration Thomas W. Anthony Tom Eccles Andrea Tacchetti János Kramár I. Gemp ... Richard Everett Roman Werpachowski Satinder Singh T. Graepel Yoram Bachrach 11 42 0 08 Jun 2020
AI Research Considerations for Human Existential Safety (ARCHES) Andrew Critch David M. Krueger 22 50 0 30 May 2020
On the Impossibility of Global Convergence in Multi-Loss Optimization Alistair Letcher 11 32 0 26 May 2020
Optimizing for the Future in Non-Stationary MDPs Yash Chandak Georgios Theocharous Shiv Shankar Martha White Sridhar Mahadevan Philip S. Thomas OffRL 11 65 0 17 May 2020
The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies Stephan Zheng Alexander R. Trott Sunil Srinivasa Nikhil Naik Melvin Gruesbeck David C. Parkes R. Socher 23 131 0 28 Apr 2020
Interactive AI with a Theory of Mind M. Çelikok Tomi Peltola Pedram Daee Samuel Kaski 20 19 0 01 Dec 2019
Towards Deployment of Robust AI Agents for Human-Machine Partnerships Ahana Ghosh Sebastian Tschiatschek Hamed Mahdavi Adish Singla 21 9 0 05 Oct 2019
The Differentiable Cross-Entropy Method Brandon Amos Denis Yarats 21 54 0 27 Sep 2019
No Press Diplomacy: Modeling Multi-Agent Gameplay Philip Paquette Yuchen Lu Steven Bocco Max O. Smith Satya Ortiz-Gagné Jonathan K. Kummerfeld Satinder Singh Joelle Pineau Aaron Courville 25 57 0 04 Sep 2019
Arena: A General Evaluation Platform and Building Toolkit for Multi-Agent Intelligence Yuhang Song Andrzej Wojcicki Thomas Lukasiewicz Jianyi Wang Abi Aryan Zhenghua Xu Mai Xu Zihan Ding Lianlong Wu AI4CE ELM 17 33 0 17 May 2019
Differentiable Game Mechanics Alistair Letcher David Balduzzi S. Racanière James Martens Jakob N. Foerster K. Tuyls T. Graepel 29 79 0 13 May 2019
How Shall I Drive? Interaction Modeling and Motion Planning towards Empathetic and Socially-Graceful Driving Yi Ren Steven Elliott Yiwei Wang Yezhou Yang Wenlong Zhang 17 12 0 28 Jan 2019
Learning to Collaborate in Markov Decision Processes Goran Radanović R. Devidze David C. Parkes Adish Singla 27 33 0 23 Jan 2019
Evolving intrinsic motivations for altruistic behavior Jane X. Wang Edward Hughes Chrisantha Fernando Wojciech M. Czarnecki Edgar A. Duénez-Guzmán Joel Z. Leibo 19 76 0 14 Nov 2018
A Survey and Critique of Multiagent Deep Reinforcement Learning Pablo Hernandez-Leal Bilal Kartal Matthew E. Taylor OffRL 27 549 0 12 Oct 2018
Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines Martin Schmid Neil Burch Marc Lanctot Matej Moravcík Rudolf Kadlec Michael H. Bowling 16 64 0 09 Sep 2018
Human-level performance in first-person multiplayer games with population-based deep reinforcement learning Max Jaderberg Wojciech M. Czarnecki Iain Dunning Luke Marris Guy Lever ... Joel Z. Leibo David Silver Demis Hassabis Koray Kavukcuoglu T. Graepel OffRL 21 713 0 03 Jul 2018
Adaptive Mechanism Design: Learning to Promote Cooperation T. Baumann T. Graepel John Shawe-Taylor 14 26 0 11 Jun 2018
Emergent Communication through Negotiation Kris Cao Angeliki Lazaridou Marc Lanctot Joel Z. Leibo K. Tuyls S. Clark 16 153 0 11 Apr 2018
Inequity aversion improves cooperation in intertemporal social dilemmas Edward Hughes Joel Z. Leibo Matthew Phillips K. Tuyls Edgar A. Duénez-Guzmán ... Tina Zhu Kevin R. McKee Raphael Köster H. Roff T. Graepel 19 204 0 23 Mar 2018
The Mechanics of n-Player Differentiable Games David Balduzzi S. Racanière James Martens Jakob N. Foerster K. Tuyls T. Graepel MLT 16 273 0 15 Feb 2018
Competitive Multi-agent Inverse Reinforcement Learning with Sub-optimal Demonstrations Xingyu Wang Diego Klabjan 16 39 0 07 Jan 2018
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning Jakob N. Foerster Nantas Nardelli Gregory Farquhar Triantafyllos Afouras Philip H. S. Torr Pushmeet Kohli Shimon Whiteson OffRL 109 595 0 28 Feb 2017