Parameter discrimination analysis in speaker identification using self organizing map

This paper presents a comparison of the discrimination in representing the individual features of speakers between Mel Frequency Cepstrum Coefficients(MFCC) and Line Spectrum Pair Frequencies(LSP). We use Self Organizing Map of Kohonen(SOM) to explore the effectiveness of these two parameters. Becau...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Yue, Pan, Qixiu, Hu, Wenhu, Wu
Format: Buchkapitel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:This paper presents a comparison of the discrimination in representing the individual features of speakers between Mel Frequency Cepstrum Coefficients(MFCC) and Line Spectrum Pair Frequencies(LSP). We use Self Organizing Map of Kohonen(SOM) to explore the effectiveness of these two parameters. Because SOM can keep the topological property of the feature space, it helps us to understand the difference directly through the senses. In the experiment, MFCC is derived from FFT and LSP is derived from LPC analysis. To reduce the computation complexity and improve the robustness, LSP parameters are vector quantized by a codebook like in speech coding and a distance weighting is incorporated. SOM is trained by 33 speakers and a codebook with 400 codes. For each speaker, the training utterance is 60 sec. long. The final result shows that these two speech parameters produce very similar feature maps for the same speaker in the general feature space. A correlation criterion gives further verification. Thus, LSP and MFCC coefficients may be considered to be equivalent in Euclidean distance meaning. At the end of the paper, neural networks VQ model method is adopted to compare the experiment validity of these two parameters in text independent speaker identification and both of them achieve satisfactory results.
ISSN:0302-9743
1611-3349
DOI:10.1007/BFb0016005