Incorporating a novel confidence scoring method in a Persian spoken dialogue system

Reliability assessment of phonemes, syllabi, words, concepts or utterances has become the key feature of Automatic Speech Recognition (ASR) engines in order to make a decision to accept or reject a hypothesis. In this paper, we propose utterance-level confidence annotation based on combination of fe...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Sakhaee, E., Sameti, H., BabaAli, B.
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Acoustics Feature extraction Kernel Logistics Reliability Speech recognition Support vector machines
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Reliability assessment of phonemes, syllabi, words, concepts or utterances has become the key feature of Automatic Speech Recognition (ASR) engines in order to make a decision to accept or reject a hypothesis. In this paper, we propose utterance-level confidence annotation based on combination of features extracted from multiple knowledge sources in Persian language. The experiment was conducted first to examine the performance of individual features, then to combine them using statistical data analysis and density estimation methods to assign a confidence score to utterances. Using the data collected from a Persian spoken dialogue system, we show that combining features from independent sources for confidence measure (CM) results in an improvement of about 5.5% mean error rate reduction in comparison to exploiting single features.
ISSN:	2326-0262 2326-0319