A reinforcement learning-based hyper-heuristic for AGV task assignment and route planning in parts-to-picker warehouses

•This paper formulates the task assignment and route planning of multiple AGVs in an intelligent warehouse as a fixed-destination Multiple Depot Traveling Salesman Problem (MDTSP). Considering the layout of the automated warehouse and the driving characteristics of the AGVs, a MILP model is develope...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Transportation research. Part E, Logistics and transportation review Logistics and transportation review, 2024-05, Vol.185, p.1-27, Article 103518
Hauptverfasser:	Li, Kunpeng, Liu, Tengbo, Ram Kumar, P.N., Han, Xuefang
Format:	Artikel
Sprache:	eng
Schlagworte:	Automated Guided Vehicles Hyper-heuristic Parts-to-picker picking system Reinforcement learning Task scheduling
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	•This paper formulates the task assignment and route planning of multiple AGVs in an intelligent warehouse as a fixed-destination Multiple Depot Traveling Salesman Problem (MDTSP). Considering the layout of the automated warehouse and the driving characteristics of the AGVs, a MILP model is developed to minimize the total completion time. A series of valid inequalities are presented to strengthen the model by analyzing the characteristics of the problem.•We develop a hyper-heuristic that uses a novel selection strategy based on the improved Multi-Armed Bandits algorithm called Co-SLMAB. It applies the Exponential Monte Carlo with counters (EMCQ) as the acceptance criterion. To our knowledge, ours is the first study to investigate the suitability of a reinforcement learning-based hyper-heuristic (RLHH) method for solving the multi-AGVs task assignment and scheduling problem.•We introduce a novel scheduling approach that optimizes racks' allocation between AGVs and the handling sequence in the RMFS. Then, an efficient method for conflict-free AGV path planning is proposed, which takes different collision avoidance measures depending on the situation. Practicality is achieved by integrating task scheduling and path planning for multi-AGV systems.•We demonstrate the superior performance of the RLHH on various problem instances through a comparative analysis with other algorithms. Furthermore, we analyze the efficiency of the proposed algorithm based on real-life warehouse layouts and perform a sensitivity analysis on AGV configurations. Numerical investigations reveal that the proposed approach greatly enhances the productivity of RMFS and the coordination of AGVs in an actual intelligent warehouse scenario. Globally, e-commerce warehouses have begun implementing robotic mobile fulfillment systems (RMFS), which can improve order-picking efficiency by using automated guided vehicles (AGVs) to realize operations from parts to pickers. AGVs depart from their initial points, move to a target rack position, and subsequently transport racks to picking stations. The AGVs return the racks to their original positions after the workers pick them up. When all tasks are completed, the AGVs return to their starting point. In this context, the main challenge is the task assignment and route planning of multiple AGVs to minimize travel times. We formulate a mixed-integer linear programming (MILP) model with valid inequalities to solve small problem instances optimally. We in
ISSN:	1366-5545 1878-5794
DOI:	10.1016/j.tre.2024.103518