Automatic speech recognition system with pitch dependent features for Punjabi language on KALDI toolkit

In this paper the improvement in performance of automatic speech recognition (ASR) system is achieved with help of pitch dependent features and probability of voicing estimated features. The pitch dependent features are useful for tonal language ASR system. Punjabi language is highly tonal language...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Applied acoustics 2020-10, Vol.167, p.107386, Article 107386
Hauptverfasser: Guglani, Jyoti, Mishra, A.N.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In this paper the improvement in performance of automatic speech recognition (ASR) system is achieved with help of pitch dependent features and probability of voicing estimated features. The pitch dependent features are useful for tonal language ASR system. Punjabi language is highly tonal language and hence here we are building ASR system for Punjabi language with pitch dependent features and probability of voicing estimated features. The word error rate of system gives the performance of system which drastically improves with pitch dependent features and probability of voicing estimated features. Comparison of Yin, SAcC, Fundamental Frequency Variation (FFV) and Kaldi pitch features of ASR system were done in terms of WER. The KALDI pitch tracker of Kaldi toolkit gives the best performance ASR system among other featured ASR systems. The performance of ASR system is evaluated for Punjabi language.
ISSN:0003-682X
1872-910X
DOI:10.1016/j.apacoust.2020.107386