Emergent Tool Use From Multi-Agent Autocurricula

17 September 2019

Papers citing "Emergent Tool Use From Multi-Agent Autocurricula"

50 / 121 papers shown

Title
Large Language Models are Pretty Good Zero-Shot Video Game Bug Detectors Mohammad Reza Taesiri Finlay Macklon Yihe Wang Hengshuo Shen C. Bezemer ELM LLMAG MLLM 39 13 0 05 Oct 2022
Disentangling Transfer in Continual Reinforcement Learning Maciej Wołczyk Michal Zajkac Razvan Pascanu Lukasz Kuciñski Piotr Milo's CLL 62 27 0 28 Sep 2022
Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations Among Team Members Daphne Cornelisse Thomas Rood Mateusz Malinowski Yoram Bachrach Tal Kachman 35 10 0 18 Aug 2022
Human Decision Makings on Curriculum Reinforcement Learning with Difficulty Adjustment Yilei Zeng Jiali Duan Y. Li Emilio Ferrara Lerrel Pinto Chloe Kuo S. Nikolaidis 38 3 0 04 Aug 2022
A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games Zihan Ding DiJia Su Qinghua Liu Chi Jin 33 3 0 18 Jul 2022
VMAS: A Vectorized Multi-Agent Simulator for Collective Robot Learning Matteo Bettini Ryan Kortvelesy J. Blumenkamp Amanda Prorok 18 36 0 07 Jul 2022
Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning Wei Fu Chao Yu Zelai Xu Jiaqi Yang Yi Wu 34 32 0 15 Jun 2022
Analysis of Randomization Effects on Sim2Real Transfer in Reinforcement Learning for Robotic Manipulation Tasks Josip Josifovski M. Malmir Noah Klarmann B. L. Žagar Nicolás Navarro-Guerrero Alois C. Knoll 24 17 0 13 Jun 2022
Generalization, Mayhems and Limits in Recurrent Proximal Policy Optimization Marco Pleines Matthias Pallasch F. Zimmer Mike Preuss 23 13 0 23 May 2022
Exploring the Benefits of Teams in Multiagent Learning David Radke Kate Larson Timothy B. Brecht AI4TS 27 10 0 04 May 2022
The Importance of Credo in Multiagent Learning David Radke Kate Larson Timothy B. Brecht 27 11 0 15 Apr 2022
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization Zihan Zhou Wei Fu Bingliang Zhang Yi Wu 25 28 0 04 Apr 2022
Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning Seyed Kamyar Seyed Ghasemipour Daniel Freeman Byron David S. Gu Satoshi Kataoka Igor Mordatch OffRL 27 25 0 15 Mar 2022
Learning Markov Games with Adversarial Opponents: Efficient Algorithms and Fundamental Limits Qinghua Liu Yuanhao Wang Chi Jin AAML 24 15 0 14 Mar 2022
Learning Robust Real-Time Cultural Transmission without Human Data Cultural General Intelligence Team Avishkar Bhoopchand Bethanie Brownfield Adrian Collister Agustin Dal Lago ... Alex Platonov Evan Senter Sukhdeep Singh Alexander Zacherl Lei M. Zhang VLM 40 11 0 01 Mar 2022
Reinforcement Learning in Practice: Opportunities and Challenges Yuxi Li OffRL 34 9 0 23 Feb 2022
Open-Ended Reinforcement Learning with Neural Reward Functions Robert Meier Asier Mujika 37 7 0 16 Feb 2022
The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models Alexander Pan Kush S. Bhatia Jacob Steinhardt 41 168 0 10 Jan 2022
Building Human-like Communicative Intelligence: A Grounded Perspective M. Dubova 24 12 0 02 Jan 2022
Sequential memory improves sample and memory efficiency in Episodic Control Ismael T. Freire A. F. Amil P. Verschure OffRL 11 3 0 29 Dec 2021
Collective Intelligence for Deep Learning: A Survey of Recent Developments David R Ha Yu Tang AI4CE 25 68 0 29 Nov 2021
On the Use and Misuse of Absorbing States in Multi-agent Reinforcement Learning Andrew Cohen Ervin Teng Vincent-Pierre Berges Ruo-Ping Dong Hunter Henry Marwan Mattar Alexander Zook Sujoy Ganguly 16 33 0 10 Nov 2021
Learning to Simulate Self-Driven Particles System with Coordinated Policy Optimization Zhenghao Peng Quanyi Li Ka-Ming Hui Chunxiao Liu Bolei Zhou 44 58 0 26 Oct 2021
OPEn: An Open-ended Physics Environment for Learning Without a Task Chuang Gan Abhishek Bhandwaldar Antonio Torralba J. Tenenbaum Phillip Isola LRM 133 4 0 13 Oct 2021
Cooperative Assistance in Robotic Surgery through Multi-Agent Reinforcement Learning Paul Maria Scheikl B. Gyenes Tornike Davitashvili Rayan Younis A. Schulze Beat P. Müller-Stich Gerhard Neumann M. Wagner F. Mathis-Ullrich 19 12 0 10 Oct 2021
When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently? Ziang Song Song Mei Yu Bai 74 67 0 08 Oct 2021
SABER: Data-Driven Motion Planner for Autonomously Navigating Heterogeneous Robots Alexander Schperberg Stephanie Tsuei Stefano Soatto Dennis W. Hong 17 10 0 03 Aug 2021
Open-Ended Learning Leads to Generally Capable Agents Open-Ended Learning Team Adam Stooke Anuj Mahajan Catarina Barros Charlie Deck ... Nicolas Porcel Roberta Raileanu Steph Hughes-Fitt Valentin Dalibard Wojciech M. Czarnecki 26 181 0 27 Jul 2021
Reinforcement learning for pursuit and evasion of microswimmers at low Reynolds number Francesco Borra Luca Biferale M. Cencini A. Celani 19 21 0 16 Jun 2021
TempoRL: Learning When to Act André Biedenkapp Raghunandan Rajan Frank Hutter Marius Lindauer OffRL 13 27 0 09 Jun 2021
Did I do that? Blame as a means to identify controlled effects in reinforcement learning Oriol Corcoll Youssef Mohamed Raul Vicente 18 3 0 01 Jun 2021
On the Critical Role of Conventions in Adaptive Human-AI Collaboration Andy Shih Arjun Sawhney J. Kondic Stefano Ermon Dorsa Sadigh 36 37 0 07 Apr 2021
Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World Florian Laurent Manuel Schneider Christian Scheller J. Watson Jiaoyang Li ... Nilabha Bhattacharya Shivam Agarwal A. Egli Erik Nygren Sharada Mohanty 33 28 0 30 Mar 2021
Modelling Behavioural Diversity for Learning in Open-Ended Games Nicolas Perez Nieves Yaodong Yang Oliver Slumbers D. Mguni Ying Wen Jun Wang 22 67 0 14 Mar 2021
Potential Impacts of Smart Homes on Human Behavior: A Reinforcement Learning Approach Shashi Suman Ali Etemad F. Rivest 24 15 0 26 Feb 2021
Credit Assignment with Meta-Policy Gradient for Multi-Agent Reinforcement Learning Jianzhun Shao Hongchang Zhang Yuhang Jiang Shuncheng He Xiangyang Ji 29 5 0 24 Feb 2021
Open Problems in Cooperative AI Allan Dafoe Edward Hughes Yoram Bachrach Tantum Collins Kevin R. McKee Joel Z. Leibo Kate Larson T. Graepel 24 199 0 15 Dec 2020
Grounding Artificial Intelligence in the Origins of Human Behavior Eleni Nisioti Clément Moulin-Frier AI4CE 34 5 0 15 Dec 2020
An overview of 11 proposals for building safe advanced AI Evan Hubinger AAML 16 23 0 04 Dec 2020
Applied Machine Learning for Games: A Graduate School Course Yilei Zeng Aayush Shah Jameson Thai M. Zyda AI4CE 9 3 0 30 Nov 2020
Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences Bowen Baker LRM 13 33 0 10 Nov 2020
Continual Learning of Control Primitives: Skill Discovery via Reset-Games Kelvin Xu Siddharth Verma Chelsea Finn Sergey Levine CLL 28 33 0 10 Nov 2020
Learning a Decentralized Multi-arm Motion Planner Huy Ha Jingxi Xu Shuran Song 21 51 0 05 Nov 2020
A Generative Model based Adversarial Security of Deep Learning and Linear Classifier Models Ferhat Ozgur Catak Samed Sivaslioglu Kevser Sahinbas AAML 21 7 0 17 Oct 2020
Decentralized Multi-Agent Pursuit using Deep Reinforcement Learning C. de Souza Rhys Newbury Akansel Cosgun P. Castillo B. Vidolov Dana Kulić 53 90 0 16 Oct 2020
CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning Ossama Ahmed Frederik Trauble Anirudh Goyal Alexander Neitz Yoshua Bengio Bernhard Schölkopf M. Wuthrich Stefan Bauer CML 27 120 0 08 Oct 2020
A Sharp Analysis of Model-based Reinforcement Learning with Self-Play Qinghua Liu Tiancheng Yu Yu Bai Chi Jin 29 121 0 04 Oct 2020
RODE: Learning Roles to Decompose Multi-Agent Tasks Tonghan Wang Tarun Gupta Anuj Mahajan Bei Peng Shimon Whiteson Chongjie Zhang OffRL 22 203 0 04 Oct 2020
Competing AI: How does competition feedback affect machine learning? Antonio A. Ginart Eva Zhang Yongchan Kwon James Y. Zou AAML 13 0 0 15 Sep 2020
Joint Policy Search for Multi-agent Collaboration with Imperfect Information Yuandong Tian Qucheng Gong Tina Jiang 29 19 0 14 Aug 2020