Deep reinforcement learning based AGVs real-time scheduling with mixed rule for flexible shop floor in industry 4.0

•A deep reinforcement learning based real-time scheduling for Automated Guided Vehicles is proposed.•Useful policy can be achieved through continuous training process.•Adaptive and efficient decisions can be made based on the proposed approach. Driven by the recent advances in industry 4.0 and indus...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computers & industrial engineering 2020-11, Vol.149, p.106749, Article 106749
Hauptverfasser: Hu, Hao, Jia, Xiaoliang, He, Qixuan, Fu, Shifeng, Liu, Kuo
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•A deep reinforcement learning based real-time scheduling for Automated Guided Vehicles is proposed.•Useful policy can be achieved through continuous training process.•Adaptive and efficient decisions can be made based on the proposed approach. Driven by the recent advances in industry 4.0 and industrial artificial intelligence, Automated Guided Vehicles (AGVs) has been widely used in flexible shop floor for material handling. However, great challenges aroused by the high dynamics, complexity, and uncertainty of the shop floor environment still exists on AGVs real-time scheduling. To address these challenges, an adaptive deep reinforcement learning (DRL) based AGVs real-time scheduling approach with mixed rule is proposed to the flexible shop floor to minimize the makespan and delay ratio. Firstly, the problem of AGVs real-time scheduling is formulated as a Markov Decision Process (MDP) in which state representation, action representation, reward function, and optimal mixed rule policy, are described in detail. Then a novel deep q-network (DQN) method is further developed to achieve the optimal mixed rule policy with which the suitable dispatching rules and AGVs can be selected to execute the scheduling towards various states. Finally, the case study based on a real-world flexible shop floor is illustrated and the results validate the feasibility and effectiveness of the proposed approach.
ISSN:0360-8352
1879-0550
DOI:10.1016/j.cie.2020.106749