Speech perception in noise with binary gains

For a given mixture of speech and noise, an ideal binary time-frequency mask is constructed by whether SNR within individual time-frequency units exceeds a local SNR criterion (LC). With linear filters, co-reducing mixture SNR and LC does not alter the ideal binary mask. Taking this manipulation to...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	The Journal of the Acoustical Society of America 2008-05, Vol.123 (5_Supplement), p.3066-3066
Hauptverfasser:	Wang, Deliang, Kjems, Ulrik, Pedersen, Michael S., Boldt, Jesper B., Lunner, Thomas
Format:	Artikel
Sprache:	eng
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	For a given mixture of speech and noise, an ideal binary time-frequency mask is constructed by whether SNR within individual time-frequency units exceeds a local SNR criterion (LC). With linear filters, co-reducing mixture SNR and LC does not alter the ideal binary mask. Taking this manipulation to the limit by setting both mixture SNR and LC to minus infinity produces an output that contains only noise with no target speech at all. This particular output corresponds to turning on or off the filtered noise according to a pattern prescribed by the ideal binary mask. Our study was designed to test on speech intelligibility of noise gated by the ideal binary mask obtained this way. It is observed that listeners achieve nearly perfect speech recognition from gated noise. Only sixteen filter channels and a frame rate of one hundred Hertz are sufficient for high intelligibility. The results show that, despite a dramatic reduction of speech information, a pattern of binary gains provides an adequate basis for speech perception in noise.
ISSN:	0001-4966 1520-8524
DOI:	10.1121/1.2932823