A spectro-temporal algorithm for pitch frequency estimation from noisy observations

A novel algorithm for pitch frequency (PF) estimation from non-stationary noise-corrupted speech observations is presented in this paper based on both spectral pre-processing and temporal representation. A modified power spectral subtraction based de-noising scheme that allows tracking the time-vari...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Shahnaz, C., Zhu, W.-P., Ahmad, M.O.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A novel algorithm for pitch frequency (PF) estimation from non-stationary noise-corrupted speech observations is presented in this paper based on both spectral pre-processing and temporal representation. A modified power spectral subtraction based de-noising scheme that allows tracking the time-variation of the underlying non-stationary noise is put forward to enhance speech prior to PF estimation. The de-noised speech is then utilized to propose a squared difference function of the Linear Prediction (LP) residual which is expected to reveal more prominent dips at integral multiples of the pitch period compared to that revealed by the LP residual. The dips at different pitch-harmonic locations are added and weighted by a periodicity dependent weighting factor for every possible pitch period thus yielding a weighted and harmonically summed temporal function which is globally minimized to extract the desired PF. Simulation results using the Keele database show the superior efficacy of the proposed method in the presence of a multi-talker babble noise relative to some of the existing methods.
ISSN:0271-4302
2158-1525
DOI:10.1109/ISCAS.2008.4541765