Maximum Likelihood Linear Regression Adaptation for the Polynomial Segment Models
Speaker adaptation has long been applied to improve speech recognition performance of hidden Markov model (HMM)-based systems. Recently, the polynomial segment model (PSM) has been shown as a viable alternative that can significantly improve the performance of large vocabulary continuous speech reco...
Gespeichert in:
Veröffentlicht in: | IEEE signal processing letters 2006-10, Vol.13 (10), p.644-647 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Speaker adaptation has long been applied to improve speech recognition performance of hidden Markov model (HMM)-based systems. Recently, the polynomial segment model (PSM) has been shown as a viable alternative that can significantly improve the performance of large vocabulary continuous speech recognition (LVCSR). In this letter, we extend the widely used HMM-based maximum likelihood linear regression (MLLR) speaker adaptation technique to PSMs. PSM properties, such as using segment as a modeling unit, and a polynomial curve as model mean, are taken into account in deriving the PSM-based MLLR. Experiments show that PSM-based MLLR adaptation performs equally well as the HMM-based MLLR adaptation with about 19% relative improvement from the SI model. In addition, another 5% relative improvement can be obtained by combining the adapted PSMs and HMMs |
---|---|
ISSN: | 1070-9908 1558-2361 |
DOI: | 10.1109/LSP.2006.875351 |