An improved TF-GSC for dual-microphone interference suppression in the specific direction
The performance of the speech enhancement (SE) algorithm will decrease rapidly in the presence of interference, especially competing or interfering speech. In this article, an improved real-time implementation of the transfer function generalized sidelobe canceller(TF-GSC) method based on distribute...
Gespeichert in:
Veröffentlicht in: | Multimedia tools and applications 2024, Vol.83 (4), p.11769-11783 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The performance of the speech enhancement (SE) algorithm will decrease rapidly in the presence of interference, especially competing or interfering speech. In this article, an improved real-time implementation of the transfer function generalized sidelobe canceller(TF-GSC) method based on distributed dual-microphone is proposed for interference suppression in the specific direction. In our method, we first derive an improved TF-GSC method based on a primary microphone and a secondary microphone which is abbreviated as GSC-PS in the following. GSC-PS estimates the desired signal by the dual-microphone structure based on estimation of time delay of arrival and calculation of the transfer functions. After that, we propose a new adaptive interference canceller based on the multichannel speech presence probability (MC-SPP) and the output gate unit. The calculated MC-SPP is applied to the step size adjustment and cost function modification of the adaptive interference canceller, while the output gate is designed based on the normalized posterior signal-to-interference ratio difference, which is sensitive to the direction of signal sources. The simulation results show that the proposed GSC-PS algorithm outperforms the current mainstream single-channel and multi-channel SE algorithms in suppressing interference and causes less damage to the quality of the target speech. In addition, experimental results confirm the usability of the proposed algorithm in real world acoustic environment with multiple sources of noises. |
---|---|
ISSN: | 1380-7501 1573-7721 |
DOI: | 10.1007/s11042-023-15817-9 |