ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2408.01656
21
0

Deep Reinforcement Learning for Dynamic Order Picking in Warehouse Operations

3 August 2024
Sasan Mahmoudinazlou
Abhay Sobhanan
Hadi Charkhgard
A. Eshragh
George Dunn
ArXivPDFHTML
Abstract

Order picking is a pivotal operation in warehouses that directly impacts overall efficiency and profitability. This study addresses the dynamic order picking problem, a significant concern in modern warehouse management, where real-time adaptation to fluctuating order arrivals and efficient picker routing are crucial. Traditional methods, which often depend on static optimization algorithms designed around fixed order sets for the picker routing, fall short in addressing the challenges of this dynamic environment. To overcome these challenges, we propose a Deep Reinforcement Learning (DRL) framework tailored for single-block warehouses equipped with an autonomous picking device. By dynamically optimizing picker routes, our approach significantly reduces order throughput times and unfulfilled orders, particularly under high order arrival rates. We benchmark our DRL model against established algorithms, utilizing instances generated based on standard practices in the order picking literature. Experimental results demonstrate the superiority of our DRL model over benchmark algorithms. For example, at a high order arrival rate of 0.09 (i.e., 9 orders per 100 units of time on average), our approach achieves an order fulfillment rate of approximately 98%, compared to the 82% fulfillment rate observed with benchmarking algorithms. We further investigate the integration of a hyperparameter in the reward function that allows for flexible balancing between distance traveled and order completion time. Finally, we demonstrate the robustness of our DRL model on out-of-sample test instances.

View on arXiv
@article{mahmoudinazlou2025_2408.01656,
  title={ Deep Reinforcement Learning for Dynamic Order Picking in Warehouse Operations },
  author={ Sasan Mahmoudinazlou and Abhay Sobhanan and Hadi Charkhgard and Ali Eshragh and George Dunn },
  journal={arXiv preprint arXiv:2408.01656},
  year={ 2025 }
}
Comments on this paper