Gender Clustering and Classification Algorithms in Speech Processing: A Comprehensive Performance Analysis

In speech processing gender clustering and classification is the most outstanding and challenging task. In both gender clustering and classification, one the most vital processes carried out is the selection of features. In speech processing, pitch is the most often used feature for gender clusterin...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of computer applications 2012-01, Vol.51 (20), p.9-17
Hauptverfasser: Gomathy, M, Meena, K, Subramaniam, K R
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In speech processing gender clustering and classification is the most outstanding and challenging task. In both gender clustering and classification, one the most vital processes carried out is the selection of features. In speech processing, pitch is the most often used feature for gender clustering and classification. It is essential to note that compared to a female speech the pitch value of a male speech is much different. Also, in terms of frequency there is a considerable dissimilarity between the male and female speech. In some situations, either the frequency of male is almost same as female or the frequency of female is same as male. It is difficult to find out the exact gender in such conditions. This paper focus on rectifying these practical obstacles by extracting three significant features, namely, energy entropy, zero crossing rate, and short time energy. Gender clustering is performed based on these features. However, by means of Euclidean distance, Mahalanobis distance, Manhattan distance & Bhattacharyya distance methods the clustering performance is analyzed. Using fuzzy logic, neural network, hybrid neuro-fuzzy, and support vector machine the gender classification is done. A benchmark dataset and real-time dataset is used for testing to make sure the reliability of the performance. The test results show the performance of various techniques and distance algorithms for different datasets
ISSN:0975-8887
0975-8887
DOI:10.5120/8156-1533