ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2508.08574
128
1
v1v2 (latest)

DeepFleet: Multi-Agent Foundation Models for Mobile Robots

12 August 2025
Ameya Agaskar
Sriram Siva
William Pickering
Kyle O'Brien
Charles Kekeh
Ang Li
Brianna Gallo Sarker
Alicia Chua
Mayur Nemade
Charun Thattai
Jiaming Di
Isaac Iyengar
Ramya Dharoor
Dino Kirouani
Jimmy Erskine
Tamir Hegazy
Scott Niekum
Usman A. Khan
Federico Pecora
Joseph W. Durham
ArXiv (abs)PDFHTML
Main:20 Pages
6 Figures
Bibliography:6 Pages
2 Tables
Appendix:1 Pages
Abstract

We introduce DeepFleet, a suite of foundation models designed to support coordination and planning for large-scale mobile robot fleets. These models are trained on fleet movement data, including robot positions, goals, and interactions, from hundreds of thousands of robots in Amazon warehouses worldwide. DeepFleet consists of four architectures that each embody a distinct inductive bias and collectively explore key points in the design space for multi-agent foundation models: the robot-centric (RC) model is an autoregressive decision transformer operating on neighborhoods of individual robots; the robot-floor (RF) model uses a transformer with cross-attention between robots and the warehouse floor; the image-floor (IF) model applies convolutional encoding to a multi-channel image representation of the full fleet; and the graph-floor (GF) model combines temporal attention with graph neural networks for spatial relationships. In this paper, we describe these models and present our evaluation of the impact of these design choices on prediction task performance. We find that the robot-centric and graph-floor models, which both use asynchronous robot state updates and incorporate the localized structure of robot interactions, show the most promise. We also present experiments that show that these two models can make effective use of larger warehouses operation datasets as the models are scaled up.

View on arXiv
Comments on this paper