Parameterization of Prosodic Feature Distributions for SVM Modeling in Speaker Recognition

Multiple recent studies have shown that speaker recognition performance using frame-based cepstral features is improved by adding higher-level information, including prosodic and lexical features. This paper explores the important question of finding a good kernel for a system that models syllable-b...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Ferrer, Luciana, Shriberg, Elizabeth, Kajarekar, Sachin, Sonmez, Kemal
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Cepstral analysis Feature extraction GMM Kernel Laboratories NIST Performance evaluation Performance gain Prosody Speaker recognition Speech Support vector machines SVM
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Multiple recent studies have shown that speaker recognition performance using frame-based cepstral features is improved by adding higher-level information, including prosodic and lexical features. This paper explores the important question of finding a good kernel for a system that models syllable-based prosodic features using support vector machines (SVMs). The system has been the best performing of our high-level systems in the last two NIST evaluations, and gives significant improvements when combined with cepstral-based systems. We introduce two new methods for transforming the syllable-level features into a single high-dimensional vector that can be well modeled by SVMs, resulting in significant gains in speaker recognition performance.
ISSN:	1520-6149 2379-190X
DOI:	10.1109/ICASSP.2007.366892