Study on recognition and classification of English accents using deep learning algorithms
The recognition and classification of English accents have high practical value in areas such as security management and information retrieval. This study introduced two English accent features, filter bank (FBank) and Mel-frequency cepstral coefficient (MFCC), based on deep learning techniques. It...
Gespeichert in:
Veröffentlicht in: | Journal of intelligent systems 2023-12, Vol.32 (1), p.201-14 |
---|---|
1. Verfasser: | |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The recognition and classification of English accents have high practical value in areas such as security management and information retrieval. This study introduced two English accent features, filter bank (FBank) and Mel-frequency cepstral coefficient (MFCC), based on deep learning techniques. It then combined convolutional neural network (CNN), gated recurrent unit, and an attention mechanism to design a 1D CNN-BiGRU-Attention model for English accent recognition and classification. Experimental tests were conducted on the VoxForge dataset. The results showed that compared to MFCC, FBank performed better in English accent recognition and classification, and 70FBank achieved the highest
1 value. Among the recurrent neural network, long short-term memory, and other models, the BiGRU model had the best performance. The average
1 value of the 1D CNN-BiGRU-attention model was the highest, reaching 85.52%, and all the
1 values were above 80% for different accents, indicating that the addition of the attention mechanism effectively improved the model’s recognition and classification effectiveness. The results prove the reliability of the method proposed in this article for English accent recognition and classification, making it suitable for practical application and promotion. |
---|---|
ISSN: | 2191-026X 0334-1860 2191-026X |
DOI: | 10.1515/jisys-2023-0174 |