The Relevance of the Availability of Visual Speech Cues during Adaptation to Noise-Vocoded Speech

Purpose: This study first aimed to establish whether viewing specific parts of the speaker's face (eyes or mouth), compared to viewing the whole face, affected adaptation to distorted noise-vocoded sentences. Second, this study also aimed to replicate results on processing of distorted speech f...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of speech, language, and hearing research language, and hearing research, 2021-07, Vol.64 (7), p.2513-2528
Hauptverfasser:	Trotter, Antony S, Banks, Briony, Adank, Patti
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Acoustics Adaptation Adjustment (to Environment) Attention Auditory Perception Cues Foreign Countries Linguistic research Listening Listening Comprehension Mouth Native Speakers Noise Nonverbal communication Oral communication Recognition (Psychology) Sentences Speech Speech Improvement Speech perception Vision Visual Acuity Visual perception Visual Stimuli
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Purpose: This study first aimed to establish whether viewing specific parts of the speaker's face (eyes or mouth), compared to viewing the whole face, affected adaptation to distorted noise-vocoded sentences. Second, this study also aimed to replicate results on processing of distorted speech from lab-based experiments in an online setup. Method: We monitored recognition accuracy online while participants were listening to noise-vocoded sentences. We first established if participants were able to perceive and adapt to audiovisual four-band noise-vocoded sentences when the entire moving face was visible (AV Full). Four further groups were then tested: a group in which participants viewed the moving lower part of the speaker's face (AV Mouth), a group in which participants only see the moving upper part of the face (AV Eyes), a group in which participants could not see the moving lower or upper face (AV Blocked), and a group in which participants saw an image of a still face (AV Still). Results: Participants repeated around 40% of the key words correctly and adapted during the experiment, but only when the moving mouth was visible. In contrast, performance was at floor level, and no adaptation took place, in conditions when the moving mouth was occluded. Conclusions: The results show the importance of being able to observe relevant visual speech information from the speaker's mouth region, but not the eyes/upper face region, when listening and adapting to distorted sentences online. Second, the results also demonstrated that it is feasible to run speech perception and adaptation studies online, but that not all findings reported for lab studies replicate.
ISSN:	1092-4388 1558-9102
DOI:	10.1044/2021_JSLHR-20-00575