Neural speech restoration at the cocktail party: Auditory cortex recovers masked speech of both attended and ignored speakers

Humans are remarkably skilled at listening to one speaker out of an acoustic mixture of several speech sources. Two speakers are easily segregated, even without binaural cues, but the neural mechanisms underlying this ability are not well understood. One possibility is that early cortical processing...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	PLoS biology 2020-10, Vol.18 (10), p.e3000883-e3000883
Hauptverfasser:	Brodbeck, Christian, Jiao, Alex, Hong, L Elliot, Simon, Jonathan Z
Format:	Artikel
Sprache:	eng
Schlagworte:	Acoustic noise Acoustic Stimulation Acoustics Adult Attention - physiology Auditory cortex Auditory Cortex - physiology Auditory masking Biology and Life Sciences Brain research Computer engineering Cortex (auditory) Engineering and Technology Female Humans Hypotheses Magnetoencephalography Male Medicine and Health Sciences Middle Aged Models, Biological Physical Sciences Physiological aspects Psychological research Representations Research and Analysis Methods Segregation process Social Sciences Speech Speech - physiology Speech perception Time Factors Young Adult
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Humans are remarkably skilled at listening to one speaker out of an acoustic mixture of several speech sources. Two speakers are easily segregated, even without binaural cues, but the neural mechanisms underlying this ability are not well understood. One possibility is that early cortical processing performs a spectrotemporal decomposition of the acoustic mixture, allowing the attended speech to be reconstructed via optimally weighted recombinations that discount spectrotemporal regions where sources heavily overlap. Using human magnetoencephalography (MEG) responses to a 2-talker mixture, we show evidence for an alternative possibility, in which early, active segregation occurs even for strongly spectrotemporally overlapping regions. Early (approximately 70-millisecond) responses to nonoverlapping spectrotemporal features are seen for both talkers. When competing talkers' spectrotemporal features mask each other, the individual representations persist, but they occur with an approximately 20-millisecond delay. This suggests that the auditory cortex recovers acoustic features that are masked in the mixture, even if they occurred in the ignored speech. The existence of such noise-robust cortical representations, of features present in attended as well as ignored speech, suggests an active cortical stream segregation process, which could explain a range of behavioral effects of ignored background speech.
ISSN:	1545-7885 1544-9173 1545-7885
DOI:	10.1371/journal.pbio.3000883