v1v2 (latest)

Emergent Tool Use From Multi-Agent Autocurricula

International Conference on Learning Representations (ICLR), 2019

17 September 2019

Papers citing "Emergent Tool Use From Multi-Agent Autocurricula"

50 / 379 papers shown

Stay Focused: Problem Drift in Multi-Agent Debate

Jonas Becker

Lars Benedikt Kaesberg

600

10 Apr 2026

Evaluating Generalization Capabilities of LLM-Based Agents in Mixed-Motive Scenarios Using Concordia

...

Dylan Hadfield-Menell

Natasha Jaques

Tim Baarslag

José Hernández-Orallo

Joel Z Leibo

LLMAG

197

03 Dec 2025

Emergent Coordination and Phase Structure in Independent Multi-Agent Reinforcement Learning

Azusa Yamaguchi

171

28 Nov 2025

Adversarial Training for Process Reward Models

212

28 Nov 2025

A Negotiation-Based Multi-Agent Reinforcement Learning Approach for Dynamic Scheduling of Reconfigurable Manufacturing SystemsNASA Formal Methods (NFM), 2025

Manonmani Sekar

Nasim Nezamoddini

11 Nov 2025

SafeMIL: Learning Offline Safe Imitation Policy from Non-Preferred Trajectories

440

11 Nov 2025

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

...

188

10 Nov 2025

OpenSIR: Open-Ended Self-Improving Reasoner

296

01 Nov 2025

X-Ego: Acquiring Team-Level Tactical Situational Awareness via Cross-Egocentric Contrastive Video Representation Learning

245

22 Oct 2025

Heterogeneous Adversarial Play in Interactive Environments

202

21 Oct 2025

The Emergence of Complex Behavior in Large-Scale Ecological Environments

258

21 Oct 2025

Combining Reinforcement Learning and Behavior Trees for NPCs in Video Games with AMD Schola

101

15 Oct 2025

Inclusive Fitness as a Key Step Towards More Advanced Social Behaviors in Multi-Agent Reinforcement Learning Settings

Andries Rosseau

Raphael Avalos

Ann Nowé

113

14 Oct 2025

TROLL: Trust Regions improve Reinforcement Learning for Large Language Models

138

04 Oct 2025

LegalSim: Multi-Agent Simulation of Legal Systems for Discovering Procedural Exploits

Sanket Badhe

AILaw

179

03 Oct 2025

Fidelity-Aware Data Composition for Robust Robot Generalization

168

29 Sep 2025

A Framework for Scalable Heterogeneous Multi-Agent Adversarial Reinforcement Learning in IsaacLab

190

26 Sep 2025

Collaborative Loco-Manipulation for Pick-and-Place Tasks with Dynamic Reward Curriculum

220

16 Sep 2025

A Data-Driven Discretized CS:GO Simulation Environment to Facilitate Strategic Multi-Agent Planning Research

Yunzhe Wang

Volkan Ustun

Chris McGroarty

142

08 Sep 2025

Legal Zero-Days: A Novel Risk Vector for Advanced AI Systems

Greg Sadler

Nathan Sherburn

AILaw

12 Aug 2025

Successor Features for Transfer in Alternating Markov Games

205

29 Jul 2025

Emergent interactions lead to collective frustration in robotic matter

Onurcan Bektas

Adolfo Alsina

Steffen Rulands

181

29 Jul 2025

AutoGen Driven Multi Agent Framework for Iterative Crime Data Analysis and Prediction

365

13 Jun 2025

Uncertainty Prioritized Experience Replay

Rodrigo Carrasco-Davis

Sebastian Lee

Claudia Clopath

Will Dabney

258

10 Jun 2025

Leveraging Reward Models for Guiding Code Review Comment Generation

215

04 Jun 2025

Learned Controllers for Agile Quadrotors in Pursuit-Evasion Games

Alejandro Sanchez Roncero

Olov Andersson

Petter Ogren

221

03 Jun 2025

ROTATE: Regret-driven Open-ended Training for Ad Hoc Teamwork

444

29 May 2025

Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable BehaviorsAdaptive Agents and Multi-Agent Systems (AAMAS), 2025

292

16 May 2025

Interpretable Risk Mitigation in LLM Agent Systems

Jan Chojnacki

LLMAG

506

15 May 2025

Reciprocity as the Foundational Substrate of Society: How Reciprocal Dynamics Scale into Social Systems

Egil Diau

247

13 May 2025

Adversarial Coevolutionary Illumination with Generational Adversarial MAP-Elites

328

10 May 2025

CCL: Collaborative Curriculum Learning for Sparse-Reward Multi-Agent Reinforcement Learning via Co-evolutionary Task EvolutionInternational Conference on Intelligent Computing (ICIC), 2025

311

08 May 2025

Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents

Christian Schroeder de Witt

AAML AI4CE

1.2K

04 May 2025

Multi-Agent Reinforcement Learning for Resources Allocation Optimization: A SurveyArtificial Intelligence Review (AIR), 2025

287

29 Apr 2025

An Efficient Approach for Cooperative Multi-Agent Learning ProblemsIEEE International Conference on Tools with Artificial Intelligence (ICTAI), 2024

Ángel Aso-Mollar

Eva Onaindia

206

07 Apr 2025

Enabling Rapid Shared Human-AI Mental Model Alignment via the After-Action Review

238

25 Mar 2025

Monitoring Reasoning Models for Misbehavior and the Risks of Promoting Obfuscation

555

190

14 Mar 2025

ForceGrip: Reference-Free Curriculum Learning for Realistic Grip Force Control in VR Hand Manipulation

676

11 Mar 2025

Multi-Robot Collaboration through Reinforcement Learning and Abstract SimulationIEEE International Conference on Robotics and Automation (ICRA), 2025

Adam Labiosa

Josiah P. Hanna

243

07 Mar 2025

Blockchain-assisted Demonstration Cloning for Multi-Agent Deep Reinforcement LearningIEEE Internet of Things Journal (IEEE IoT J.), 2024

320

19 Jan 2025

Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A SurveyACM Computing Surveys (ACM CSUR), 2024

336

08 Nov 2024

Eurekaverse: Environment Curriculum Generation via Large Language ModelsConference on Robot Learning (CoRL), 2024

400

04 Nov 2024

Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence ChallengeNeural Information Processing Systems (NeurIPS), 2024

...

604

04 Nov 2024

Learning in Markov Games with Adaptive Adversaries: Policy Regret, Fundamental Barriers, and Efficient AlgorithmsNeural Information Processing Systems (NeurIPS), 2024

Thanh Nguyen-Tang

Raman Arora

441

01 Nov 2024

Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential ExplorationAdaptive Agents and Multi-Agent Systems (AAMAS), 2024

266

25 Oct 2024

Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent SystemAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Weize Chen

Qixin Xu

Chen Qian

Cheng Yang

Zhiyuan Liu

Maosong Sun

LLMAG

343

10 Oct 2024

RL, but don't do anything I wouldn't doConference on Uncertainty in Artificial Intelligence (UAI), 2024

Michael K. Cohen

204

08 Oct 2024

Training Interactive Agent in Large FPS Game Map with Rule-enhanced Reinforcement Learning

Ruochen Liu

Elvis S. Liu

290

07 Oct 2024

Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration

Chang Liu

399

03 Oct 2024

Enabling Multi-Robot Collaboration from Single-Human GuidanceIEEE International Conference on Robotics and Automation (ICRA), 2024

309

30 Sep 2024