A spectro-temporal algorithm for pitch frequency estimation from noisy observations
A novel algorithm for pitch frequency (PF) estimation from non-stationary noise-corrupted speech observations is presented in this paper based on both spectral pre-processing and temporal representation. A modified power spectral subtraction based de-noising scheme that allows tracking the time-vari...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | A novel algorithm for pitch frequency (PF) estimation from non-stationary noise-corrupted speech observations is presented in this paper based on both spectral pre-processing and temporal representation. A modified power spectral subtraction based de-noising scheme that allows tracking the time-variation of the underlying non-stationary noise is put forward to enhance speech prior to PF estimation. The de-noised speech is then utilized to propose a squared difference function of the Linear Prediction (LP) residual which is expected to reveal more prominent dips at integral multiples of the pitch period compared to that revealed by the LP residual. The dips at different pitch-harmonic locations are added and weighted by a periodicity dependent weighting factor for every possible pitch period thus yielding a weighted and harmonically summed temporal function which is globally minimized to extract the desired PF. Simulation results using the Keele database show the superior efficacy of the proposed method in the presence of a multi-talker babble noise relative to some of the existing methods. |
---|---|
ISSN: | 0271-4302 2158-1525 |
DOI: | 10.1109/ISCAS.2008.4541765 |