A noise robust Arabic speech recognition system based on the echo state network

A major challenge in the field of automated speech recognition (ASR) lies in designing noise-resilient systems. These systems are crucial for real-world applications where high levels of noise tend to be present. We introduce a noise robust system based on a recently developed approach to training a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Journal of the Acoustical Society of America 2014-04, Vol.135 (4_Supplement), p.2195-2195
Hauptverfasser: Alalshekmubarak, Abdulrahman, Smith, Leslie S.
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A major challenge in the field of automated speech recognition (ASR) lies in designing noise-resilient systems. These systems are crucial for real-world applications where high levels of noise tend to be present. We introduce a noise robust system based on a recently developed approach to training a recurrent neural network (RNN), namely, the echo state network (ESN). To evaluate the performance of the proposed system, we used our recently released public Arabic dataset that contains a total of about 10 000 examples of 20 isolated words spoken by 50 speakers. Different feature extraction methods considered in this study include mel-frequency cepstral coefficients (MFCCs), perceptual linear prediction (PLP) and RASTA- perceptual linear prediction. These extracted features were fed to the ESN and the result was compared with a baseline hidden Markov model (HMM), so that six models were compared in total. These models were trained on clean data and then tested on unseen data with different levels and types of noise. ESN models outperformed HMM models under almost all the feature extraction methods, noise levels, and noise types. The best performance was obtained by the model that combined RASTA-PLP with ESN.
ISSN:0001-4966
1520-8524
DOI:10.1121/1.4877154