A Discriminative Segmental Speech Model and Its Application to Hungarian Number Recognition

This paper presents a stochastic segmental speech recogniser that models the a posteriori probabilities directly. The main issues concerning the system are segmental phoneme classification, utterance-level aggregation and the pruning of the search space. For phoneme classification, artificial neural...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Tóth, László, Kocsor, András, Kovács, Kornél
Format:	Buchkapitel
Sprache:	eng
Schlagworte:	Applied sciences Artificial intelligence Computer science control theory systems Exact sciences and technology Speech and sound recognition and synthesis. Linguistics
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This paper presents a stochastic segmental speech recogniser that models the a posteriori probabilities directly. The main issues concerning the system are segmental phoneme classification, utterance-level aggregation and the pruning of the search space. For phoneme classification, artificial neural networks and support vector machines are applied. Phonemic segmentation and utterancelevel aggregation is performed with the aid of anti-phoneme modelling. At the phoneme level, the system convincingly outperforms the HMM system trained on the same corpus, while at the word level it attains the performance of the HMM system trained without embedded training.
ISSN:	0302-9743 1611-3349
DOI:	10.1007/3-540-45323-7_52