An Intelligent SDWN Routing Algorithm Based on Network Situational Awareness and Deep Reinforcement Learning

To address the challenges of obtaining network state information, flexibly forwarding data, and improving the communication quality of service (QoS) in wireless network transmission environments in response to dynamic changes in network topology, this paper introduces an intelligent routing algorith...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE access 2023, Vol.11, p.83322-83342
Hauptverfasser:	Li, Jinqiang, Ye, Miao, Huang, Linqiang, Deng, Xiaofang, Qiu, Hongbing, Wang, Yong, Jiang, Qiuxiang
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Artificial neural networks Communications traffic Computer architecture Convergence Data models Deep learning Deep reinforcement learning gradient clipping Heuristic algorithms Importance sampling intelligent routing Machine learning network situational awareness Network topologies Optimization Optimization methods Prediction algorithms Predictive models Quality of service architectures Routing Situational awareness software-defined wireless networking Source code Traffic information Wireless networks
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	To address the challenges of obtaining network state information, flexibly forwarding data, and improving the communication quality of service (QoS) in wireless network transmission environments in response to dynamic changes in network topology, this paper introduces an intelligent routing algorithm based on deep reinforcement learning (DRL) with network situational awareness under a software-defined wireless networking (SDWN) architecture. First, comprehensive network traffic information is collected under the SDWN architecture, and a graph convolutional network-gated recurrent unit (GCN-GRU) prediction mechanism is used to perceive future traffic trends. Second, a proximal policy optimization (PPO) DRL-based data forwarding mechanism is designed in the knowledge plane. The predicted network traffic matrix and topology information matrix are treated as the DRL environment, while next-hop adjacent nodes are treated as executable actions, and action selection policies are designed for different network conditions. To guide the learning and improvement of the DRL agent's routing strategy, reward functions of different forms are designed by utilizing network link information and different penalty mechanisms. Additionally, importance sampling steps and gradient clipping methods are employed during gradient updating to enhance the convergence speed and stability of the designed intelligent routing method. Experimental results show that this solution outperforms traditional routing methods in network throughput, delay, packet loss rate, and wireless node distance. Compared to value-function-based Dueling Deep Q-Network (DQN) routing, the convergence of the proposed method is significantly faster and more stable. Simultaneously, hardware storage consumption is reduced, and real-time routing decisions can be made using the current network state information. The source code can be accessed at https://github.com/GuetYe/DRL-PPONSA .
ISSN:	2169-3536 2169-3536
DOI:	10.1109/ACCESS.2023.3302178