ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.07528
  4. Cited By
Emergent Tool Use From Multi-Agent Autocurricula
v1v2 (latest)

Emergent Tool Use From Multi-Agent Autocurricula

International Conference on Learning Representations (ICLR), 2019
17 September 2019
Bowen Baker
I. Kanitscheider
Todor Markov
Yi Wu
Glenn Powell
Bob McGrew
Igor Mordatch
    LRM
ArXiv (abs)PDFHTML

Papers citing "Emergent Tool Use From Multi-Agent Autocurricula"

50 / 379 papers shown
Stay Focused: Problem Drift in Multi-Agent Debate
Stay Focused: Problem Drift in Multi-Agent Debate
Jonas Becker
Lars Benedikt Kaesberg
Andreas Stephan
Jan Philip Wahle
Terry Ruas
Bela Gipp
LRM
600
10
0
10 Apr 2026
Evaluating Generalization Capabilities of LLM-Based Agents in Mixed-Motive Scenarios Using Concordia
Evaluating Generalization Capabilities of LLM-Based Agents in Mixed-Motive Scenarios Using Concordia
Chandler Smith
Marwa Abdulhai
Manfred Diaz
Marko Tesic
Rakshit Trivedi
...
Dylan Hadfield-Menell
Natasha Jaques
Tim Baarslag
José Hernández-Orallo
Joel Z Leibo
LLMAG
197
7
0
03 Dec 2025
Emergent Coordination and Phase Structure in Independent Multi-Agent Reinforcement Learning
Emergent Coordination and Phase Structure in Independent Multi-Agent Reinforcement Learning
Azusa Yamaguchi
171
0
0
28 Nov 2025
Adversarial Training for Process Reward Models
Adversarial Training for Process Reward Models
Gurusha Juneja
Deepak Nathani
William Yang Wang
LRM
212
0
0
28 Nov 2025
A Negotiation-Based Multi-Agent Reinforcement Learning Approach for Dynamic Scheduling of Reconfigurable Manufacturing Systems
A Negotiation-Based Multi-Agent Reinforcement Learning Approach for Dynamic Scheduling of Reconfigurable Manufacturing SystemsNASA Formal Methods (NFM), 2025
Manonmani Sekar
Nasim Nezamoddini
88
0
0
11 Nov 2025
SafeMIL: Learning Offline Safe Imitation Policy from Non-Preferred Trajectories
SafeMIL: Learning Offline Safe Imitation Policy from Non-Preferred Trajectories
Returaj Burnwal
Nirav P. Bhatt
Balaraman Ravindran
OffRL
440
0
0
11 Nov 2025
RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Zhiyuan Zeng
Hamish Ivison
Yiping Wang
Lifan Yuan
Shuyue Stella Li
...
S. Du
Natasha Jaques
Hao Peng
Pang Wei Koh
Hannaneh Hajishirzi
OffRLLRM
188
17
0
10 Nov 2025
OpenSIR: Open-Ended Self-Improving Reasoner
OpenSIR: Open-Ended Self-Improving Reasoner
Wai-Chung Kwan
Joshua Ong Jun Leang
Pavlos Vougiouklis
Jeff Z. Pan
Marco Valentino
Pasquale Minervini
LRMReLM
296
3
0
01 Nov 2025
X-Ego: Acquiring Team-Level Tactical Situational Awareness via Cross-Egocentric Contrastive Video Representation Learning
X-Ego: Acquiring Team-Level Tactical Situational Awareness via Cross-Egocentric Contrastive Video Representation Learning
Yunzhe Wang
Soham Hans
Volkan Ustun
EgoV
245
1
0
22 Oct 2025
Heterogeneous Adversarial Play in Interactive Environments
Heterogeneous Adversarial Play in Interactive Environments
Manjie Xu
Xinyi Yang
Jiayu Zhan
Wei Liang
Chi Zhang
Yixin Zhu
202
1
0
21 Oct 2025
The Emergence of Complex Behavior in Large-Scale Ecological Environments
The Emergence of Complex Behavior in Large-Scale Ecological Environments
Joseph Bejjani
Chase Van Amburg
Chengrui Wang
Chloe Huangyuan Su
Sarah M Pratt
Yasin Mazloumi
Naeem Khoshnevis
Sham Kakade
Kianté Brantley
Aaron Walsman
258
0
0
21 Oct 2025
Combining Reinforcement Learning and Behavior Trees for NPCs in Video Games with AMD Schola
Combining Reinforcement Learning and Behavior Trees for NPCs in Video Games with AMD Schola
Tian Liu
Alex Cann
Ian Colbert
Mehdi Saeedi
OffRL
101
0
0
15 Oct 2025
Inclusive Fitness as a Key Step Towards More Advanced Social Behaviors in Multi-Agent Reinforcement Learning Settings
Inclusive Fitness as a Key Step Towards More Advanced Social Behaviors in Multi-Agent Reinforcement Learning Settings
Andries Rosseau
Raphael Avalos
Ann Nowé
113
0
0
14 Oct 2025
TROLL: Trust Regions improve Reinforcement Learning for Large Language Models
TROLL: Trust Regions improve Reinforcement Learning for Large Language Models
P. Becker
Niklas Freymuth
Serge Thilges
Fabian Otto
Gerhard Neumann
138
3
0
04 Oct 2025
LegalSim: Multi-Agent Simulation of Legal Systems for Discovering Procedural Exploits
LegalSim: Multi-Agent Simulation of Legal Systems for Discovering Procedural Exploits
Sanket Badhe
AILaw
179
1
0
03 Oct 2025
Fidelity-Aware Data Composition for Robust Robot Generalization
Fidelity-Aware Data Composition for Robust Robot Generalization
Zizhao Tong
Di Chen
Sicheng Hu
Hongwei Fan
Liliang Chen
Maoqing Yao
Hao Tang
Hao Dong
Ling Shao
168
1
0
29 Sep 2025
A Framework for Scalable Heterogeneous Multi-Agent Adversarial Reinforcement Learning in IsaacLab
A Framework for Scalable Heterogeneous Multi-Agent Adversarial Reinforcement Learning in IsaacLab
Isaac Peterson
Christopher Allred
Jacob Morrey
Mario Harper
190
0
0
26 Sep 2025
Collaborative Loco-Manipulation for Pick-and-Place Tasks with Dynamic Reward Curriculum
Collaborative Loco-Manipulation for Pick-and-Place Tasks with Dynamic Reward Curriculum
Tianxu An
Flavio De Vincenti
Yuntao Ma
Marco Hutter
Stelian Coros
220
3
0
16 Sep 2025
A Data-Driven Discretized CS:GO Simulation Environment to Facilitate Strategic Multi-Agent Planning Research
A Data-Driven Discretized CS:GO Simulation Environment to Facilitate Strategic Multi-Agent Planning Research
Yunzhe Wang
Volkan Ustun
Chris McGroarty
142
1
0
08 Sep 2025
Legal Zero-Days: A Novel Risk Vector for Advanced AI Systems
Legal Zero-Days: A Novel Risk Vector for Advanced AI Systems
Greg Sadler
Nathan Sherburn
AILaw
80
1
0
12 Aug 2025
Successor Features for Transfer in Alternating Markov Games
Successor Features for Transfer in Alternating Markov Games
Sunny Amatya
Yi Ren
Z. Xu
Wenlong Zhang
205
0
0
29 Jul 2025
Emergent interactions lead to collective frustration in robotic matter
Emergent interactions lead to collective frustration in robotic matter
Onurcan Bektas
Adolfo Alsina
Steffen Rulands
181
1
0
29 Jul 2025
AutoGen Driven Multi Agent Framework for Iterative Crime Data Analysis and Prediction
AutoGen Driven Multi Agent Framework for Iterative Crime Data Analysis and Prediction
Syeda Kisaa Fatima
Tehreem Zubair
Noman Ahmed
Asifullah Khan
LLMAGSyDa
365
2
0
13 Jun 2025
Uncertainty Prioritized Experience Replay
Uncertainty Prioritized Experience Replay
Rodrigo Carrasco-Davis
Sebastian Lee
Claudia Clopath
Will Dabney
258
2
0
10 Jun 2025
Leveraging Reward Models for Guiding Code Review Comment Generation
Leveraging Reward Models for Guiding Code Review Comment Generation
Oussama Ben Sghaier
Rosalia Tufano
Gabriele Bavota
Houari Sahraoui
215
2
0
04 Jun 2025
Learned Controllers for Agile Quadrotors in Pursuit-Evasion Games
Learned Controllers for Agile Quadrotors in Pursuit-Evasion Games
Alejandro Sanchez Roncero
Olov Andersson
Olov Andersson
Petter Ogren
221
0
0
03 Jun 2025
ROTATE: Regret-driven Open-ended Training for Ad Hoc Teamwork
ROTATE: Regret-driven Open-ended Training for Ad Hoc Teamwork
Caroline Wang
Arrasy Rahman
Jiaxun Cui
Yoonchang Sung
Peter Stone
444
4
0
29 May 2025
Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable Behaviors
Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable BehaviorsAdaptive Agents and Multi-Agent Systems (AAMAS), 2025
Lang Feng
Jiahao Lin
Dong Xing
Li Zhang
De Ma
Gang Pan
292
0
0
16 May 2025
Interpretable Risk Mitigation in LLM Agent Systems
Interpretable Risk Mitigation in LLM Agent Systems
Jan Chojnacki
LLMAG
506
4
0
15 May 2025
Reciprocity as the Foundational Substrate of Society: How Reciprocal Dynamics Scale into Social Systems
Reciprocity as the Foundational Substrate of Society: How Reciprocal Dynamics Scale into Social Systems
Egil Diau
247
0
0
13 May 2025
Adversarial Coevolutionary Illumination with Generational Adversarial MAP-Elites
Adversarial Coevolutionary Illumination with Generational Adversarial MAP-Elites
Timothée Anne
Noah Syrkis
Meriem Elhosni
Florian Turati
Franck Legendre
Alain Jaquier
Sebastian Risi
328
0
0
10 May 2025
CCL: Collaborative Curriculum Learning for Sparse-Reward Multi-Agent Reinforcement Learning via Co-evolutionary Task Evolution
CCL: Collaborative Curriculum Learning for Sparse-Reward Multi-Agent Reinforcement Learning via Co-evolutionary Task EvolutionInternational Conference on Intelligent Computing (ICIC), 2025
Yufei Lin
Chengwei Ye
Ning Yang
Kangsheng Wang
Linuo Xu
Shuyan Liu
Zeyu Zhang
311
3
0
08 May 2025
Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents
Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents
Christian Schroeder de Witt
AAMLAI4CE
1.2K
50
0
04 May 2025
Multi-Agent Reinforcement Learning for Resources Allocation Optimization: A Survey
Multi-Agent Reinforcement Learning for Resources Allocation Optimization: A SurveyArtificial Intelligence Review (AIR), 2025
Mohamad Abdul Hady
Siyi Hu
Mahardhika Pratama
Jimmy Cao
Ryszard Kowalczyk
287
39
0
29 Apr 2025
An Efficient Approach for Cooperative Multi-Agent Learning Problems
An Efficient Approach for Cooperative Multi-Agent Learning ProblemsIEEE International Conference on Tools with Artificial Intelligence (ICTAI), 2024
Ángel Aso-Mollar
Eva Onaindia
206
0
0
07 Apr 2025
Enabling Rapid Shared Human-AI Mental Model Alignment via the After-Action Review
Enabling Rapid Shared Human-AI Mental Model Alignment via the After-Action Review
Edward Gu
H. Siu
Melanie Platt
Isabelle Hurley
Jaime D. Peña
Rohan R. Paleja
238
2
0
25 Mar 2025
Monitoring Reasoning Models for Misbehavior and the Risks of Promoting Obfuscation
Monitoring Reasoning Models for Misbehavior and the Risks of Promoting Obfuscation
Bowen Baker
Joost Huizinga
Leo Gao
Zehao Dou
M. Guan
Aleksander Mądry
Wojciech Zaremba
J. Pachocki
David Farhi
LRM
555
190
0
14 Mar 2025
ForceGrip: Reference-Free Curriculum Learning for Realistic Grip Force Control in VR Hand Manipulation
ForceGrip: Reference-Free Curriculum Learning for Realistic Grip Force Control in VR Hand Manipulation
DongHeun Han
Byungmin Kim
RoUn Lee
KyeongMin Kim
Hyoseok Hwang
HyeongYeop Kang
676
0
0
11 Mar 2025
Multi-Robot Collaboration through Reinforcement Learning and Abstract Simulation
Multi-Robot Collaboration through Reinforcement Learning and Abstract SimulationIEEE International Conference on Robotics and Automation (ICRA), 2025
Adam Labiosa
Josiah P. Hanna
243
1
0
07 Mar 2025
Blockchain-assisted Demonstration Cloning for Multi-Agent Deep Reinforcement Learning
Blockchain-assisted Demonstration Cloning for Multi-Agent Deep Reinforcement LearningIEEE Internet of Things Journal (IEEE IoT J.), 2024
Ahmed Alagha
Jamal Bentahar
Hadi Otrok
Shakti Singh
R. Mizouni
320
3
0
19 Jan 2025
Acceleration for Deep Reinforcement Learning using Parallel and
  Distributed Computing: A Survey
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A SurveyACM Computing Surveys (ACM CSUR), 2024
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
336
20
0
08 Nov 2024
Eurekaverse: Environment Curriculum Generation via Large Language Models
Eurekaverse: Environment Curriculum Generation via Large Language ModelsConference on Robot Learning (CoRL), 2024
William Liang
Sam Wang
Hung-Ju Wang
Osbert Bastani
Dinesh Jayaraman
Yecheng Jason Ma
SyDa
400
5
0
04 Nov 2024
Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge
Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence ChallengeNeural Information Processing Systems (NeurIPS), 2024
Weihua Du
Qiushi Lyu
Jiaming Shan
Zhenting Qi
Hongxin Zhang
...
Andi Peng
Tianmin Shu
Kwonjoon Lee
Behzad Dariush
Chuang Gan
604
10
0
04 Nov 2024
Learning in Markov Games with Adaptive Adversaries: Policy Regret,
  Fundamental Barriers, and Efficient Algorithms
Learning in Markov Games with Adaptive Adversaries: Policy Regret, Fundamental Barriers, and Efficient AlgorithmsNeural Information Processing Systems (NeurIPS), 2024
Thanh Nguyen-Tang
Raman Arora
441
1
0
01 Nov 2024
Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration
Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential ExplorationAdaptive Agents and Multi-Agent Systems (AAMAS), 2024
Hai Zhong
Xun Wang
Zhuoran Li
Longbo Huang
OffRLOnRL
266
2
0
25 Oct 2024
Optima: Optimizing Effectiveness and Efficiency for LLM-Based
  Multi-Agent System
Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent SystemAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Weize Chen
Qixin Xu
Chen Qian
Cheng Yang
Zhiyuan Liu
Maosong Sun
LLMAG
343
34
0
10 Oct 2024
RL, but don't do anything I wouldn't do
RL, but don't do anything I wouldn't doConference on Uncertainty in Artificial Intelligence (UAI), 2024
Michael K. Cohen
Marcus Hutter
Yoshua Bengio
Stuart J. Russell
OffRL
204
2
0
08 Oct 2024
Training Interactive Agent in Large FPS Game Map with Rule-enhanced
  Reinforcement Learning
Training Interactive Agent in Large FPS Game Map with Rule-enhanced Reinforcement Learning
Chen Zhang
Huan Hu
Yuan Zhou
Qiyang Cao
Ruochen Liu
Wenya Wei
Elvis S. Liu
AI4CE
290
5
0
07 Oct 2024
Choices are More Important than Efforts: LLM Enables Efficient
  Multi-Agent Exploration
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Yun Qu
Boyuan Wang
Yuhang Jiang
Jianzhun Shao
Yixiu Mao
Cheems Wang
Chang Liu
Xiangyang Ji
399
12
0
03 Oct 2024
Enabling Multi-Robot Collaboration from Single-Human Guidance
Enabling Multi-Robot Collaboration from Single-Human GuidanceIEEE International Conference on Robotics and Automation (ICRA), 2024
Zhengran Ji
Lingyu Zhang
Paul Sajda
Boyuan Chen
309
4
0
30 Sep 2024
12345678
Next
Page 1 of 8