ConvAtt Network: A Low Parameter Approach For Sign Language Recognition

Despite recent advances in Large Language Models in text processing, Sign Language Recognition (SLR) remains an unresolved task. This is, in part, due to limitations in the available data. In this paper, we investigate combining 1D convolutions with transformer layers to capture local features and g...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of computer science and technology (La Plata) 2024-10, Vol.24 (2), p.e10
Hauptverfasser:	Rios, Gaston Gustavo, Dal Bianco, Pedro, Ronchetti, Franco, Quiroga, Facundo, Ponte Ahón, Santiago, Stanchi, Oscar, Hasperué, Waldo
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Data augmentation Deep Learning Large language models Parameters Regularization Sequence Classification Sign Language Recognition Unbalanced Data
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Despite recent advances in Large Language Models in text processing, Sign Language Recognition (SLR) remains an unresolved task. This is, in part, due to limitations in the available data. In this paper, we investigate combining 1D convolutions with transformer layers to capture local features and global interactions in a low-parameter SLR model. We experimented using multiple data augmentation and regularization techniques to categorize signs of the French Belgian Sign Language. We achieved a top-1 accuracy of 42.7% and a top-10 accuracy of 81.9% in 600 different signs. This model is competitive with the current state of the art while using a significantly lower number of parameters.
ISSN:	1666-6046 1666-6038
DOI:	10.24215/16666038.24.e10