Path planning for underwater gliders in time-varying ocean current using deep reinforcement learning

The objective of this paper is to solve the application research of underwater glider (UG) and UGs formation, it is aiming to solve the path planning of gliders in ocean current environment by deep deterministic policy gradient (DDPG). Gliders can be deployed individually or collectively to execute...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Ocean engineering 2022-10, Vol.262, p.112226, Article 112226
Hauptverfasser:	Lan, Wei, Jin, Xiang, Chang, Xin, Wang, Tianlin, Zhou, Han, Tian, Wei, Zhou, Lilei
Format:	Artikel
Sprache:	eng
Schlagworte:	Deep reinforcement learning Multi-agent systems Path planning Time-varying ocean Underwater glider
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The objective of this paper is to solve the application research of underwater glider (UG) and UGs formation, it is aiming to solve the path planning of gliders in ocean current environment by deep deterministic policy gradient (DDPG). Gliders can be deployed individually or collectively to execute ocean missions. Using the existing glider model and the interactions between gliders and environment, models close to the practical application of UGs are established. The deep reinforcement learning (DRL) based planning algorithm by integrating artificial intelligence, and solution to planning problem of UGs is provided. For a single UG planning, the designed RL algorithm can solve the compliance of UG motion constraints. The algorithm can calculate the appropriate path for the UGs formation, and change the shape of formation as necessary, which is useful for navigation in the environment of dense obstacles. With the same reward function, the improved DDPG outperforms the deep Q-network (DQN). Based on Tokyo Bay geography and unacquainted ocean, the developed algorithm is tested in ocean current environments. •Reinforcement learning algorithm for glider applications in ocean environment.•Path planning of glider or formation in known or unknown ocean currents.•UG and formation obstacle avoidance and navigation in the restricted conditions.•It has advantages over several other algorithms in ocean environment by tests.•The algorithm is tested in the self-built simulated actual operating environment.
ISSN:	0029-8018 1873-5258
DOI:	10.1016/j.oceaneng.2022.112226