Safe navigation for UAV-enabled data dissemination by deep reinforcement learning in unknown environments

Unmanned aerial vehicles (UAVs) are increasingly considered in safe autonomous navigation systems to explore unknown environments where UAVs are equipped with multiple sensors to perceive the surroundings. However, how to achieve UAV-enabled data dissemination and also ensure safe navigation synchro...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:China communications 2022-01, Vol.19 (1), p.202-217
Hauptverfasser: Huang, Fei, Li, Guangxia, Tian, Shiwei, Chen, Jin, Fan, Guangteng, Chang, Jinghui
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Unmanned aerial vehicles (UAVs) are increasingly considered in safe autonomous navigation systems to explore unknown environments where UAVs are equipped with multiple sensors to perceive the surroundings. However, how to achieve UAV-enabled data dissemination and also ensure safe navigation synchronously is a new challenge. In this paper, our goal is minimizing the whole weighted sum of the UAV's task completion time while satisfying the data transmission task requirement and the UAV's feasible flight region constraints. However, it is unable to be solved via standard optimization methods mainly on account of lacking a tractable and accurate system model in practice. To overcome this tough issue, we propose a new solution approach by utilizing the most advanced dueling double deep Q network (dueling DDQN) with multi-step learning. Specifically, to improve the algorithm, the extra labels are added to the primitive states. Simulation results indicate the validity and performance superiority of the proposed algorithm under different data thresholds compared with two other benchmarks.
ISSN:1673-5447
DOI:10.23919/JCC.2022.01.015