An application of the warped discrete Fourier transform in the perceptual speech enhancement

An application of the warped discrete Fourier transform (WDFT) in the perceptual speech enhancement is of interest. The WDFT allows nonuniform sampling the z-transform of finite length sequence. We focus on the perceptual warping which allocates frequency samples in good accordance with the Bark sca...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Speech communication 2006-08, Vol.48 (8), p.1024-1036
Hauptverfasser: Borowicz, A., Parfieniuk, M., Petrovsky, A.A.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:An application of the warped discrete Fourier transform (WDFT) in the perceptual speech enhancement is of interest. The WDFT allows nonuniform sampling the z-transform of finite length sequence. We focus on the perceptual warping which allocates frequency samples in good accordance with the Bark scale. The WDFT can replace conventional DFT based analysis/synthesis block of spectral weighting method. In the case of the perceptual warping, there is a problem with signal reconstruction because the WDFT matrix is ill-conditioned. This paper addresses the problem of signal distortions generated in WDFT based synthesis block. Spectral characteristics of the reconstructed signal are analyzed and discussed in the context of the perceptual processing. A new extension of the WDFT intended to cancellation of the synthesis error is presented. The new method is also validated in practical speech enhancement system. The results show that the new algorithm outperforms pure WDFT based system.
ISSN:0167-6393
1872-7182
DOI:10.1016/j.specom.2006.01.004