Deep Reinforcement Learning-Based Distributed 3D UAV Trajectory Design

The deployment of UAVs as aerial base stations (BSs) has been considered as a promising supplement to the ground networks, which can quickly build an emergency communication network in a disaster area or significantly relief the communication burden imposed by hot-spots. However, the application of...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on communications 2024-06, Vol.72 (6), p.3736-3751
Hauptverfasser:	He, Huasen, Yuan, Wenke, Chen, Shuangwu, Jiang, Xiaofeng, Yang, Feng, Yang, Jian
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Autonomous aerial vehicles Communication Communication networks Deep learning Deep reinforcement learning Energy consumption Flight altitude multi-objective optimization Multiple objective analysis Optimization Solid modeling Three-dimensional displays Trajectory Trajectory optimization UAV communication UAV trajectory design Unmanned aerial vehicles
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The deployment of UAVs as aerial base stations (BSs) has been considered as a promising supplement to the ground networks, which can quickly build an emergency communication network in a disaster area or significantly relief the communication burden imposed by hot-spots. However, the application of UAVs as aerial BSs is constrained by the limited onboard energy and communication coverage of UAVs. In particular, for a large target area, multiple UAVs should be deployed to meet the communication requirements. Therefore, designing the optimal trajectories of multiple UAVs is crucial to boost the UAV network performance. Inspired by the promising future of UAV BSs, this paper aims at proposing a distributed 3-dimensional (3D) trajectory design algorithm for multiple UAVs to optimize the system performance. We formulate the trajectory design problem as a multi-objective optimization problem to improve the user equipment (UE) access rate, ensure fair access opportunities, increase transmitted data volume and reduce energy consumption. Further inspired by the decision-making ability of deep reinforcement learning (DRL) in complex environments, we propose a DRL based trajectory design algorithm for multiple UAVs, namely DMTD, in which UAVs can explore both the optimal flight altitude and the potential UE distribution area in the iterative interactions with the environment, and then select the optimal flight trajectories to boost the network performance from multiple aspects. Extensive experimental results under different UE distributions have demonstrated that the proposed DMTD algorithm can find the optimal altitude to provide maximum coverage. Moreover, DMTD beats existing algorithms by providing high UE access rate, ensuring fair network service and increasing total transmitted data volume at the cost of a relatively low energy consumption. Especially in the scenes with dense and randomly distributed UEs, DMTD provides a UE access rate close to 0.9 and transmits 6 times of data volume than existing algorithms.
ISSN:	0090-6778 1558-0857
DOI:	10.1109/TCOMM.2024.3361534