Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power flooring

This paper presents a new robust feature extraction algorithm based on a modified approach to power bias subtraction combined with applying a threshold to the power spectral density. Power bias level is selected as a level above which the signal power distribution is sharpest. The sharpness is measu...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Chanwoo Kim, Stern, R M
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Arithmetic auditory threshold Feature extraction Hidden Markov models Natural languages physiological modeling Power distribution power flooring Power measurement Power system modeling Robust speech recognition Robustness sharpness of power distribution Speech recognition Working environment noise
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This paper presents a new robust feature extraction algorithm based on a modified approach to power bias subtraction combined with applying a threshold to the power spectral density. Power bias level is selected as a level above which the signal power distribution is sharpest. The sharpness is measured using the ratio of arithmetic mean to the geometric mean of medium-duration power. When subtracting this bias level, power flooring is applied to enhance robustness. These new ideas are employed to enhance our recently introduced feature extraction algorithm PNCC (Power Normalized Cepstral Coefficient). While simpler than our previous PNCC, experimental results show that this new PNCC is showing better performance than our previous implementation.
ISSN:	1520-6149 2379-190X
DOI:	10.1109/ICASSP.2010.5495570