Neural network models for extracting complementary speaker-specific information from residual phase

In this paper using neural network models we demonstrate the presence of complementary speaker-specific information in the residual phase as compared to the conventional spectral features. The spectral features mainly represent the speaker-specific vocal tract system features. The proposed LP residu...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Kodukula, S.R.M., Mahadeva Prasanna, S.R., Yegnanarayana, B.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In this paper using neural network models we demonstrate the presence of complementary speaker-specific information in the residual phase as compared to the conventional spectral features. The spectral features mainly represent the speaker-specific vocal tract system features. The proposed LP residual phase represents the speaker-specific excitation source information. Speaker recognition studies are conducted using NIST 2003 speaker recognition evaluation database. The speaker recognition system using only spectral features gives an equal error rate (EER) of 15.5% and using only LP residual phase information gives an EER of 22.0%. However, combining the evidences from LP residual phase and spectral features increases the performance to an EER of 13.5%. This result clearly demonstrates the complementary nature of speaker-specific information present in the LP residual phase.
DOI:10.1109/ICISIP.2005.1529489