Ultra-Wideband Radar-Based Two-Stream ConvNeXt-AFF Neural Network for Sign Language Gesture Recognition

In recent years, the miniaturization of ultra-wideband (UWB) radar devices has provided new opportunities for an increasing number of research applications. In the field of human gesture recognition, UWB radar has attracted significant research interest due to its unique advantages of high measureme...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE sensors journal 2024-01, Vol.24 (18), p.28960-28970
Hauptverfasser:	Yin, Yonggen, Zhang, Zhaoxia, Huo, Ze, Shen, Zhiyuan, Chen, Hongyang
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Artificial neural networks Attention mechanism ConvNeXt Feature extraction feature fusion Gesture recognition Neural networks Radar Radar detection Radar equipment Sensors Sign language sign language gesture recognition (SLGR) Spectrograms Target detection Transformers Ultra wideband technology ultra-wideband (UWB) radar Ultrawideband radar
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In recent years, the miniaturization of ultra-wideband (UWB) radar devices has provided new opportunities for an increasing number of research applications. In the field of human gesture recognition, UWB radar has attracted significant research interest due to its unique advantages of high measurement accuracy, strong interference rejection capabilities, and insensitivity to ambient illumination conditions during target detection. However, the current research relies on extracting gesture features from single-domain information, which does not fully utilize all the information available in the echo. To address this, we have developed a novel multidomain feature fusion model called two-stream ConvNeXt-AFF to improve the accuracy of sign language gesture identification. In our proposed model, multiscale features are first independently extracted from time-frequency spectrograms and range-Doppler signatures using dual convolutional neural network (CNN) streams. A attention feature fusion (AFF) module is then applied to integrate multimodal representations. To validate the model's efficacy, an experimental dataset containing 2000 samples of ten different sign language gestures was collected. Results demonstrate that the two-stream ConvNeXt-AFF network can effectively recognize sign language gestures with an accuracy of 99.86%, outperforming other traditional methods.
ISSN:	1530-437X 1558-1748
DOI:	10.1109/JSEN.2024.3431548