Harmonic enhancement using learnable comb filter for light-weight full-band speech enhancement model
With fewer feature dimensions, filter banks are often used in light-weight full-band speech enhancement models. In order to further enhance the coarse speech in the sub-band domain, it is necessary to apply a post-filtering for harmonic retrieval. The signal processing-based comb filters used in RNN...
Gespeichert in:
Hauptverfasser: | , , , , , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | With fewer feature dimensions, filter banks are often used in light-weight
full-band speech enhancement models. In order to further enhance the coarse
speech in the sub-band domain, it is necessary to apply a post-filtering for
harmonic retrieval. The signal processing-based comb filters used in RNNoise
and PercepNet have limited performance and may cause speech quality degradation
due to inaccurate fundamental frequency estimation. To tackle this problem, we
propose a learnable comb filter to enhance harmonics. Based on the sub-band
model, we design a DNN-based fundamental frequency estimator to estimate the
discrete fundamental frequencies and a comb filter for harmonic enhancement,
which are trained via an end-to-end pattern. The experiments show the
advantages of our proposed method over PecepNet and DeepFilterNet. |
---|---|
DOI: | 10.48550/arxiv.2306.00812 |