Performance comparison of ASR classifiers for the development of an English CAPT system for Filipino students

Computer Assisted Pronunciation Training (CAPT) systems aim to provide immediate, individualized feedback to the user on the overall quality of the pronunciation made. In such systems, one must be able to extract features from a waveform and represent words in the vocabulary. This paper presents the...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Obach, D. D., Cordel, M. O.
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Feature extraction Hidden Markov models Mel frequency cepstral coefficient Speech Speech recognition Support vector machines Training
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Computer Assisted Pronunciation Training (CAPT) systems aim to provide immediate, individualized feedback to the user on the overall quality of the pronunciation made. In such systems, one must be able to extract features from a waveform and represent words in the vocabulary. This paper presents the performance of Hidden Markov Model (HMM), Support-Vector Machine (SVM) and Multilayer Perceptron (MLP) as automatic speech recognizers for the English digits spoken by Filipino speakers. Speech waveforms are translated into a set of feature vectors using Mel Frequency Cepstrum Coefficients (MFCC). The training set consists of speech samples recorded by native Filipinos who speak English. The HMM-trained model produced a recognition rate of 95.79% compared to 86.33% and 91.66% recognition rates of SVM and MLP, respectively 1 .
ISSN:	2159-3442 2159-3450
DOI:	10.1109/TENCON.2012.6412252