Discriminative training of GMM for speaker identification

We describe a novel discriminative training procedure for a Gaussian mixture model (GMM) speaker identification system. The proposal is based on the segmental generalized probabilistic descent (GPD) algorithm formulated to estimate the GMM parameters. Two major innovations over similar formulations...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: del Alamo, C.M., Caminero Gil, F.J., dela Torre Munilla, C., Hernandez Gomez, L.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:We describe a novel discriminative training procedure for a Gaussian mixture model (GMM) speaker identification system. The proposal is based on the segmental generalized probabilistic descent (GPD) algorithm formulated to estimate the GMM parameters. Two major innovations over similar formulations of segmental GPD training are proposed. (1) A misclassification measure based on an individual representation of competing speakers, that explicitly allows to take into account different learning strategies for correctly or incorrectly classified speakers. (2) An empirical loss function to control the training procedure convergence, with a likelihood-based selection of correctly or incorrectly classified competing speakers. A comparison between the proposed method and the traditional GPD algorithm is also presented.
ISSN:1520-6149
2379-190X
DOI:10.1109/ICASSP.1996.540297