Auditory-model based robust feature selection for speech recognition

It is shown that robust dimension-reduction of a feature set for speech recognition can be based on a model of the human auditory system. Whereas conventional methods optimize classification performance, the proposed method exploits knowledge implicit in the auditory periphery, inheriting its robust...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	The Journal of the Acoustical Society of America 2010-02, Vol.127 (2), p.EL73-EL79
Hauptverfasser:	Koniaris, Christos, Kuropatwinski, Marcin, Kleijn, W. Bastiaan
Format:	Artikel
Sprache:	eng
Schlagworte:	Acoustic signal processing Acoustic Stimulation Acoustics Algorithms Audition auditory model Auditory Perception Audologi Biological and medical sciences Discriminant Analysis Environment Exact sciences and technology feature selection Folkhälsomedicinska forskningsområden Fundamental and applied biological sciences. Psychology Fundamental areas of phenomenology (including applications) Humans Hörselvetenskap Linear Models MEDICIN MEDICINE Models, Neurological Noise Pattern Recognition, Physiological Perception Physics Psychology. Psychoanalysis. Psychiatry Psychology. Psychophysiology Public health medicine research areas Recognition (Psychology) sensitivity matrix Social medicine Socialmedicin Speech speech recognition Speech Recognition Software
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	It is shown that robust dimension-reduction of a feature set for speech recognition can be based on a model of the human auditory system. Whereas conventional methods optimize classification performance, the proposed method exploits knowledge implicit in the auditory periphery, inheriting its robustness. Features are selected to maximize the similarity of the Euclidean geometry of the feature domain and the perceptual domain. Recognition experiments using mel-frequency cepstral coefficients (MFCCs) confirm the effectiveness of the approach, which does not require labeled training data. For noisy data the method outperforms commonly used discriminant-analysis based dimension-reduction methods that rely on labeling. The results indicate that selecting MFCCs in their natural order results in subsets with good performance.
ISSN:	0001-4966 1520-8524 1520-8524
DOI:	10.1121/1.3284545