Imitation Learning with Graph Neural Networks for Improving Swarm Robustness under Restricted Communications

This paper focuses on generating distributed flocking strategies via imitation learning. The primary motivation is to improve the swarm robustness and achieve better consistency while respecting the communication constraints. This paper first proposes a quantitative metric of swarm robustness based...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Applied sciences 2021-10, Vol.11 (19), p.9055
Hauptverfasser:	Guo, Ce, Zhu, Pengming, Zhou, Zhiqian, Lang, Lin, Zeng, Zhiwen, Lu, Huimin
Format:	Artikel
Sprache:	eng
Schlagworte:	Behavior Communication Control algorithms graph convolutional networks graph importance entropy Graph neural networks Graph theory imitation learning Motivation Neural networks Observational learning Researchers Robots swarm robustness Velocity
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This paper focuses on generating distributed flocking strategies via imitation learning. The primary motivation is to improve the swarm robustness and achieve better consistency while respecting the communication constraints. This paper first proposes a quantitative metric of swarm robustness based on entropy evaluation. Then, the graph importance consistency is also proposed, which is one of the critical goals of the flocking task. Moreover, the importance-correlated directed graph convolutional networks (IDGCNs) are constructed for multidimensional feature extraction and structure-related aggregation of graph data. Next, by employing IDGCNs-based imitation learning, a distributed and scalable flocking strategy is obtained, and its performance is very close to the centralized strategy template while considering communication constraints. To speed up and simplify the training process, we train the flocking strategy with a small number of agents and set restrictions on communication. Finally, various simulation experiments are executed to verify the advantages of the obtained strategy in terms of realizing the swarm consistency and improving the swarm robustness. The results also show that the performance is well maintained while the scale of agents expands (tested with 20, 30, 40 robots).
ISSN:	2076-3417 2076-3417
DOI:	10.3390/app11199055