ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.03270
25
23

Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators

5 May 2023
Alexander Herzog
Kanishka Rao
Karol Hausman
Yao Lu
Paul Wohlhart
Mengyuan Yan
Jessica Lin
Montse Gonzalez Arenas
Ted Xiao
Daniel Kappler
Daniel Ho
Jarek Rettinghouse
Yevgen Chebotar
Kuang-Huei Lee
K. Gopalakrishnan
Ryan C. Julian
A. Li
Chuyuan Fu
Bo Wei
S. Ramesh
K. Holden
Kim Kleiven
David Rendleman
Sean Kirmani
Jeffrey Bingham
Jonathan Weisz
Ying Xu
Wenlong Lu
Matthew Bennice
Cody Fong
David Do
Jessica Lam
Yunfei Bai
Benjie Holson
Michael J. Quinlan
Noah Brown
Mrinal Kalakrishnan
Julian Ibarz
P. Pastor
Sergey Levine
    OffRL
ArXivPDFHTML
Abstract

We describe a system for deep reinforcement learning of robotic manipulation skills applied to a large-scale real-world task: sorting recyclables and trash in office buildings. Real-world deployment of deep RL policies requires not only effective training algorithms, but the ability to bootstrap real-world training and enable broad generalization. To this end, our system combines scalable deep RL from real-world data with bootstrapping from training in simulation, and incorporates auxiliary inputs from existing computer vision systems as a way to boost generalization to novel objects, while retaining the benefits of end-to-end training. We analyze the tradeoffs of different design decisions in our system, and present a large-scale empirical validation that includes training on real-world data gathered over the course of 24 months of experimentation, across a fleet of 23 robots in three office buildings, with a total training set of 9527 hours of robotic experience. Our final validation also consists of 4800 evaluation trials across 240 waste station configurations, in order to evaluate in detail the impact of the design decisions in our system, the scaling effects of including more real-world data, and the performance of the method on novel objects. The projects website and videos can be found at \href{http://rl-at-scale.github.io}{rl-at-scale.github.io}.

View on arXiv
Comments on this paper