Speaker recognition using syllable-based constraints for cepstral frame selection

We describe a new GMM-UBM speaker recognition system that uses standard cepstral features, but selects different frames of speech for different subsystems. Subsystems, or ldquoconstraintsrdquo, are based on syllable-level information and combined at the score level. Results on both the NIST 2006 and...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Bocklet, T., Shriberg, E.
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Cepstral analysis cepstral features Data mining Feature extraction GMMs higher-level features Mel frequency cepstral coefficient MFCCs NIST Performance evaluation Speaker recognition Speech syllables System testing Telephony
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	We describe a new GMM-UBM speaker recognition system that uses standard cepstral features, but selects different frames of speech for different subsystems. Subsystems, or ldquoconstraintsrdquo, are based on syllable-level information and combined at the score level. Results on both the NIST 2006 and 2008 test data sets for the English telephone train and test condition reveal that a set of eight constraints performs extremely well, resulting in better performance than other commonly-used cepstral models. Given the still largely-unexplored world of possible constraints and combinations, it is likely that the approach can be even further improved.
ISSN:	1520-6149 2379-190X
DOI:	10.1109/ICASSP.2009.4960636