Multidimensional Beam Optimization in Underwater Optical Wireless Communication Based on Deep Reinforcement Learning

In this work, we study learning-aided adaptive control of optical beam alignment to maintain a seamless connection with high communication performance in a point-to-point (P2P) underwater optical wireless communication (UOWC). To this end, we propose a two-step two-agent deep reinforcement learning...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE internet of things journal 2024-09, Vol.11 (17), p.28623-28634
Hauptverfasser:	Shin, Huicheol, Baek, Seungjae, Song, Yujae
Format:	Artikel
Sprache:	eng
Schlagworte:	Acoustic beams Adaptive control Adaptive optics Algorithms Alignment Beam divergence (BD) angle beam orientation (BO) angle Deep learning deep reinforcement learning Laser beams Machine learning Ocean floor Optical beams Optical scattering Optical sensors Optical transmitters Optical wireless Sensors Signal to noise ratio Surface vehicles Underwater communication Underwater detectors underwater optical wireless communication (UOWC) Unmanned vehicles Wireless communications
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In this work, we study learning-aided adaptive control of optical beam alignment to maintain a seamless connection with high communication performance in a point-to-point (P2P) underwater optical wireless communication (UOWC). To this end, we propose a two-step two-agent deep reinforcement learning (TSTA-DRL) algorithm that enables an underwater sensor (US) installed on the seabed to sequentially determine the beam orientation (BO) and beam divergence (BD) angles for transmitting its sensing data to an unmanned surface vehicle (USV) that may irregularly shake above the sea level. Specifically, the proposed TSTA-DRL algorithm includes two DRL agents: BO and BD. The BO agent selects the BO angle to point the optical beam of the US toward the USV to perform beam alignment between the US and USV. Moreover, given the BO angle determined by the BO agent, the BD agent chooses the BD angle to maximize the signal-to-noise ratio (SNR) while maintaining the seamless optical link between the two nodes. For the practical application of the proposed algorithm, movement data of the USV measured in the South Sea of Korea are utilized for training the proposed algorithm. The simulation results demonstrate that the proposed TSTA-DRL algorithm achieves the highest SNR while maintaining a stable UOWC link compared with the existing algorithms.
ISSN:	2327-4662 2327-4662
DOI:	10.1109/JIOT.2024.3404476