Model-based distributed node clustering and multi-speaker speech presence probability estimation in wireless acoustic sensor networks

The knowledge of speech presence probability (SPP) plays an essential role in noise estimation and speech enhancement. Single channel SPP estimation and centralized multi-channel SPP estimation have been well studied. However, how to estimate SPP in wireless acoustic sensor networks (WASNs) remains...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Journal of the Acoustical Society of America 2020-06, Vol.147 (6), p.4189-4201
Hauptverfasser: Zhao, Yingke, Nielsen, Jesper Kjær, Chen, Jingdong, Christensen, Mads Græsbøll
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The knowledge of speech presence probability (SPP) plays an essential role in noise estimation and speech enhancement. Single channel SPP estimation and centralized multi-channel SPP estimation have been well studied. However, how to estimate SPP in wireless acoustic sensor networks (WASNs) remains a great challenge and few efforts can be found in this topic, particularly for WASN applications with multiple speakers. Accordingly, this paper is devoted to the problem of SPP estimation in WASNs and it presents a distributed model-based SPP estimation method for multi-speaker detection, which does not need any fusion center. A distributed k-means clustering method is first used to cluster the nodes into subnetworks, which detect different speakers. For each node in the subnetwork, the speech and noise power spectral densities are estimated locally by using a model-based method, then a distributed SPP estimator is developed and applied in every subnetwork. A distributed consensus method is used to obtain the distributed clustering and the distributed SPP estimation. Simulation results show that the proposed distributed clustering method can assign nodes into subnetworks based on their noisy observations. Moreover, the proposed distributed SPP estimator achieves robust speech detection performance under different noise conditions.
ISSN:0001-4966
1520-8524
DOI:10.1121/10.0001449