Universal background model based speech recognition

The universal background model (UBM) is an effective framework widely used in speaker recognition. But so far it has received little attention from the speech recognition field. In this work, we make a first attempt to apply the UBM to acoustic modeling in ASR. We propose a tree-based parameter esti...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Povey, D., Chu, S.M., Varadarajan, B.
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	acoustic modeling Automatic speech recognition Broadcasting Context modeling Gaussian processes Loudspeakers Parameter estimation Smoothing methods Speaker recognition Speech recognition Statistics UBM universal background model
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The universal background model (UBM) is an effective framework widely used in speaker recognition. But so far it has received little attention from the speech recognition field. In this work, we make a first attempt to apply the UBM to acoustic modeling in ASR. We propose a tree-based parameter estimation technique for UBMs, and describe a set of smoothing and pruning methods to facilitate learning. The proposed UBM approach is benchmarked on a state-of-the-art large-vocabulary continuous speech recognition platform on a broadcast transcription task. Preliminary experiments reported in this paper already show very exciting results.
ISSN:	1520-6149 2379-190X
DOI:	10.1109/ICASSP.2008.4518671