Sequential stream segregation of voiced and unvoiced speech sounds based on fundamental frequency

Differences in fundamental frequency (F0) between voiced sounds are known to be a strong cue for stream segregation. However, speech consists of both voiced and unvoiced sounds, and less is known about whether and how the unvoiced portions are segregated. This study measured listeners' ability...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Hearing research 2017-02, Vol.344, p.235-243
Hauptverfasser:	David, Marion, Lavandier, Mathieu, Grimault, Nicolas, Oxenham, Andrew J.
Format:	Artikel
Sprache:	eng
Schlagworte:	Acoustic Stimulation Acoustics Adolescent Adult Audiometry, Speech Auditory Threshold Cues Engineering Sciences Female Fundamental frequency Humans Male Middle Aged Speech Acoustics Speech Intelligibility Speech Perception Speech sounds Stream segregation Voice Quality Young Adult
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	243
container_issue
container_start_page	235
container_title	Hearing research
container_volume	344
creator	David, Marion Lavandier, Mathieu Grimault, Nicolas Oxenham, Andrew J.
description	Differences in fundamental frequency (F0) between voiced sounds are known to be a strong cue for stream segregation. However, speech consists of both voiced and unvoiced sounds, and less is known about whether and how the unvoiced portions are segregated. This study measured listeners' ability to integrate or segregate sequences of consonant-vowel tokens, comprising a voiceless fricative and a vowel, as a function of the F0 difference between interleaved sequences of tokens. A performance-based measure was used, in which listeners detected the presence of a repeated token either within one sequence or between the two sequences (measures of voluntary and obligatory streaming, respectively). The results showed a systematic increase of voluntary stream segregation as the F0 difference between the two interleaved sequences increased from 0 to 13 semitones, suggesting that F0 differences allowed listeners to segregate speech sounds, including the unvoiced portions. In contrast to the consistent effects of voluntary streaming, the trend towards obligatory stream segregation at large F0 differences failed to reach significance. Listeners were no longer able to perform the voluntary-streaming task reliably when the unvoiced portions were removed from the stimuli, suggesting that the unvoiced portions were used and correctly segregated in the original task. The results demonstrate that streaming based on F0 differences occurs for natural speech sounds, and that the unvoiced portions are correctly assigned to the corresponding voiced portions. •The stimuli used in the study consisted in an unvoiced fricative consonant and a voiced vowel (CV token).•Listeners could use a difference in F0 to segregate alternating CV tokens.•Evidence for both obligatory and voluntary stream segregation was found.•Listeners did not base their judgments on the vowel part only or the consonant part only.•Listeners were no longer able to perform the task without the fricative part of the stimuli.•Listeners were able to segregate the whole tokens based on ΔF0 despite the lack of F0 cues in the fricative part.
doi_str_mv	10.1016/j.heares.2016.11.016
format	Article
fullrecord	<record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_5239743</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S037859551630315X</els_id><sourcerecordid>1846721353</sourcerecordid><originalsourceid>FETCH-LOGICAL-c497t-9676e5c38963e4e31cd5db0606109b493047b52ed86877aa836f7e4bb9416fb43</originalsourceid><addsrcrecordid>eNp9kU1v1DAQhi3Uii6Ff4CqHMthgx1_xRekqioUaSUObc-WY092vUrixU5W2n9fh10K5cBpPOOZZz5ehD4SXBJMxOdtuQETIZVV9kpCymzeoAWpZb3ktSJnaIHp_FacX6B3KW0xJpyy6i26qKSqqKRqgcwD_JxgGL3pijRGMH2RYB1hbUYfhiK0xT54C64wgyum4eSkHYDdFClMg0tFY1KO5ew2u6bPtAxr4y-wPbxH563pEnw42Uv09PXu8fZ-ufrx7fvtzWppmZLjUgkpgFtaK0GBASXWcddggQXBqmGKYiYbXoGrRS2lMTUVrQTWNIoR0TaMXqIvR-5uanpwNo8RTad30fcmHnQwXr_-GfxGr8Ne84oqyWgGfDoCNv-U3d-s9BzLB1ZYVnhPcu71qVkMec006t4nC11nBghT0qRmQlaE8hnLjqk2hpQitC9sgvWspN7qo5J6VlITMjfKZVd_r_NS9Fu6P_tCPureQ9TJ-nxwcD6CHbUL_v8dngHGeLKN</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1846721353</pqid></control><display><type>article</type><title>Sequential stream segregation of voiced and unvoiced speech sounds based on fundamental frequency</title><source>MEDLINE</source><source>Access via ScienceDirect (Elsevier)</source><creator>David, Marion ; Lavandier, Mathieu ; Grimault, Nicolas ; Oxenham, Andrew J.</creator><creatorcontrib>David, Marion ; Lavandier, Mathieu ; Grimault, Nicolas ; Oxenham, Andrew J.</creatorcontrib><description>Differences in fundamental frequency (F0) between voiced sounds are known to be a strong cue for stream segregation. However, speech consists of both voiced and unvoiced sounds, and less is known about whether and how the unvoiced portions are segregated. This study measured listeners' ability to integrate or segregate sequences of consonant-vowel tokens, comprising a voiceless fricative and a vowel, as a function of the F0 difference between interleaved sequences of tokens. A performance-based measure was used, in which listeners detected the presence of a repeated token either within one sequence or between the two sequences (measures of voluntary and obligatory streaming, respectively). The results showed a systematic increase of voluntary stream segregation as the F0 difference between the two interleaved sequences increased from 0 to 13 semitones, suggesting that F0 differences allowed listeners to segregate speech sounds, including the unvoiced portions. In contrast to the consistent effects of voluntary streaming, the trend towards obligatory stream segregation at large F0 differences failed to reach significance. Listeners were no longer able to perform the voluntary-streaming task reliably when the unvoiced portions were removed from the stimuli, suggesting that the unvoiced portions were used and correctly segregated in the original task. The results demonstrate that streaming based on F0 differences occurs for natural speech sounds, and that the unvoiced portions are correctly assigned to the corresponding voiced portions. •The stimuli used in the study consisted in an unvoiced fricative consonant and a voiced vowel (CV token).•Listeners could use a difference in F0 to segregate alternating CV tokens.•Evidence for both obligatory and voluntary stream segregation was found.•Listeners did not base their judgments on the vowel part only or the consonant part only.•Listeners were no longer able to perform the task without the fricative part of the stimuli.•Listeners were able to segregate the whole tokens based on ΔF0 despite the lack of F0 cues in the fricative part.</description><identifier>ISSN: 0378-5955</identifier><identifier>EISSN: 1878-5891</identifier><identifier>DOI: 10.1016/j.heares.2016.11.016</identifier><identifier>PMID: 27923739</identifier><language>eng</language><publisher>Netherlands: Elsevier B.V</publisher><subject>Acoustic Stimulation ; Acoustics ; Adolescent ; Adult ; Audiometry, Speech ; Auditory Threshold ; Cues ; Engineering Sciences ; Female ; Fundamental frequency ; Humans ; Male ; Middle Aged ; Speech Acoustics ; Speech Intelligibility ; Speech Perception ; Speech sounds ; Stream segregation ; Voice Quality ; Young Adult</subject><ispartof>Hearing research, 2017-02, Vol.344, p.235-243</ispartof><rights>2016 Elsevier B.V.</rights><rights>Copyright © 2016 Elsevier B.V. All rights reserved.</rights><rights>Distributed under a Creative Commons Attribution 4.0 International License</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c497t-9676e5c38963e4e31cd5db0606109b493047b52ed86877aa836f7e4bb9416fb43</citedby><cites>FETCH-LOGICAL-c497t-9676e5c38963e4e31cd5db0606109b493047b52ed86877aa836f7e4bb9416fb43</cites><orcidid>0000-0003-3586-4426 ; 0009-0004-1339-8887</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.heares.2016.11.016$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>230,314,780,784,885,3550,27924,27925,45995</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27923739$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink><backlink>$$Uhttps://hal.science/hal-01690720$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>David, Marion</creatorcontrib><creatorcontrib>Lavandier, Mathieu</creatorcontrib><creatorcontrib>Grimault, Nicolas</creatorcontrib><creatorcontrib>Oxenham, Andrew J.</creatorcontrib><title>Sequential stream segregation of voiced and unvoiced speech sounds based on fundamental frequency</title><title>Hearing research</title><addtitle>Hear Res</addtitle><description>Differences in fundamental frequency (F0) between voiced sounds are known to be a strong cue for stream segregation. However, speech consists of both voiced and unvoiced sounds, and less is known about whether and how the unvoiced portions are segregated. This study measured listeners' ability to integrate or segregate sequences of consonant-vowel tokens, comprising a voiceless fricative and a vowel, as a function of the F0 difference between interleaved sequences of tokens. A performance-based measure was used, in which listeners detected the presence of a repeated token either within one sequence or between the two sequences (measures of voluntary and obligatory streaming, respectively). The results showed a systematic increase of voluntary stream segregation as the F0 difference between the two interleaved sequences increased from 0 to 13 semitones, suggesting that F0 differences allowed listeners to segregate speech sounds, including the unvoiced portions. In contrast to the consistent effects of voluntary streaming, the trend towards obligatory stream segregation at large F0 differences failed to reach significance. Listeners were no longer able to perform the voluntary-streaming task reliably when the unvoiced portions were removed from the stimuli, suggesting that the unvoiced portions were used and correctly segregated in the original task. The results demonstrate that streaming based on F0 differences occurs for natural speech sounds, and that the unvoiced portions are correctly assigned to the corresponding voiced portions. •The stimuli used in the study consisted in an unvoiced fricative consonant and a voiced vowel (CV token).•Listeners could use a difference in F0 to segregate alternating CV tokens.•Evidence for both obligatory and voluntary stream segregation was found.•Listeners did not base their judgments on the vowel part only or the consonant part only.•Listeners were no longer able to perform the task without the fricative part of the stimuli.•Listeners were able to segregate the whole tokens based on ΔF0 despite the lack of F0 cues in the fricative part.</description><subject>Acoustic Stimulation</subject><subject>Acoustics</subject><subject>Adolescent</subject><subject>Adult</subject><subject>Audiometry, Speech</subject><subject>Auditory Threshold</subject><subject>Cues</subject><subject>Engineering Sciences</subject><subject>Female</subject><subject>Fundamental frequency</subject><subject>Humans</subject><subject>Male</subject><subject>Middle Aged</subject><subject>Speech Acoustics</subject><subject>Speech Intelligibility</subject><subject>Speech Perception</subject><subject>Speech sounds</subject><subject>Stream segregation</subject><subject>Voice Quality</subject><subject>Young Adult</subject><issn>0378-5955</issn><issn>1878-5891</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp9kU1v1DAQhi3Uii6Ff4CqHMthgx1_xRekqioUaSUObc-WY092vUrixU5W2n9fh10K5cBpPOOZZz5ehD4SXBJMxOdtuQETIZVV9kpCymzeoAWpZb3ktSJnaIHp_FacX6B3KW0xJpyy6i26qKSqqKRqgcwD_JxgGL3pijRGMH2RYB1hbUYfhiK0xT54C64wgyum4eSkHYDdFClMg0tFY1KO5ew2u6bPtAxr4y-wPbxH563pEnw42Uv09PXu8fZ-ufrx7fvtzWppmZLjUgkpgFtaK0GBASXWcddggQXBqmGKYiYbXoGrRS2lMTUVrQTWNIoR0TaMXqIvR-5uanpwNo8RTad30fcmHnQwXr_-GfxGr8Ne84oqyWgGfDoCNv-U3d-s9BzLB1ZYVnhPcu71qVkMec006t4nC11nBghT0qRmQlaE8hnLjqk2hpQitC9sgvWspN7qo5J6VlITMjfKZVd_r_NS9Fu6P_tCPureQ9TJ-nxwcD6CHbUL_v8dngHGeLKN</recordid><startdate>20170201</startdate><enddate>20170201</enddate><creator>David, Marion</creator><creator>Lavandier, Mathieu</creator><creator>Grimault, Nicolas</creator><creator>Oxenham, Andrew J.</creator><general>Elsevier B.V</general><general>Elsevier</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>1XC</scope><scope>VOOES</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0003-3586-4426</orcidid><orcidid>https://orcid.org/0009-0004-1339-8887</orcidid></search><sort><creationdate>20170201</creationdate><title>Sequential stream segregation of voiced and unvoiced speech sounds based on fundamental frequency</title><author>David, Marion ; Lavandier, Mathieu ; Grimault, Nicolas ; Oxenham, Andrew J.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c497t-9676e5c38963e4e31cd5db0606109b493047b52ed86877aa836f7e4bb9416fb43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Acoustic Stimulation</topic><topic>Acoustics</topic><topic>Adolescent</topic><topic>Adult</topic><topic>Audiometry, Speech</topic><topic>Auditory Threshold</topic><topic>Cues</topic><topic>Engineering Sciences</topic><topic>Female</topic><topic>Fundamental frequency</topic><topic>Humans</topic><topic>Male</topic><topic>Middle Aged</topic><topic>Speech Acoustics</topic><topic>Speech Intelligibility</topic><topic>Speech Perception</topic><topic>Speech sounds</topic><topic>Stream segregation</topic><topic>Voice Quality</topic><topic>Young Adult</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>David, Marion</creatorcontrib><creatorcontrib>Lavandier, Mathieu</creatorcontrib><creatorcontrib>Grimault, Nicolas</creatorcontrib><creatorcontrib>Oxenham, Andrew J.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>Hyper Article en Ligne (HAL)</collection><collection>Hyper Article en Ligne (HAL) (Open Access)</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Hearing research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>David, Marion</au><au>Lavandier, Mathieu</au><au>Grimault, Nicolas</au><au>Oxenham, Andrew J.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Sequential stream segregation of voiced and unvoiced speech sounds based on fundamental frequency</atitle><jtitle>Hearing research</jtitle><addtitle>Hear Res</addtitle><date>2017-02-01</date><risdate>2017</risdate><volume>344</volume><spage>235</spage><epage>243</epage><pages>235-243</pages><issn>0378-5955</issn><eissn>1878-5891</eissn><abstract>Differences in fundamental frequency (F0) between voiced sounds are known to be a strong cue for stream segregation. However, speech consists of both voiced and unvoiced sounds, and less is known about whether and how the unvoiced portions are segregated. This study measured listeners' ability to integrate or segregate sequences of consonant-vowel tokens, comprising a voiceless fricative and a vowel, as a function of the F0 difference between interleaved sequences of tokens. A performance-based measure was used, in which listeners detected the presence of a repeated token either within one sequence or between the two sequences (measures of voluntary and obligatory streaming, respectively). The results showed a systematic increase of voluntary stream segregation as the F0 difference between the two interleaved sequences increased from 0 to 13 semitones, suggesting that F0 differences allowed listeners to segregate speech sounds, including the unvoiced portions. In contrast to the consistent effects of voluntary streaming, the trend towards obligatory stream segregation at large F0 differences failed to reach significance. Listeners were no longer able to perform the voluntary-streaming task reliably when the unvoiced portions were removed from the stimuli, suggesting that the unvoiced portions were used and correctly segregated in the original task. The results demonstrate that streaming based on F0 differences occurs for natural speech sounds, and that the unvoiced portions are correctly assigned to the corresponding voiced portions. •The stimuli used in the study consisted in an unvoiced fricative consonant and a voiced vowel (CV token).•Listeners could use a difference in F0 to segregate alternating CV tokens.•Evidence for both obligatory and voluntary stream segregation was found.•Listeners did not base their judgments on the vowel part only or the consonant part only.•Listeners were no longer able to perform the task without the fricative part of the stimuli.•Listeners were able to segregate the whole tokens based on ΔF0 despite the lack of F0 cues in the fricative part.</abstract><cop>Netherlands</cop><pub>Elsevier B.V</pub><pmid>27923739</pmid><doi>10.1016/j.heares.2016.11.016</doi><tpages>9</tpages><orcidid>https://orcid.org/0000-0003-3586-4426</orcidid><orcidid>https://orcid.org/0009-0004-1339-8887</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 0378-5955
ispartof	Hearing research, 2017-02, Vol.344, p.235-243
issn	0378-5955 1878-5891
language	eng
recordid	cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_5239743
source	MEDLINE; Access via ScienceDirect (Elsevier)
subjects	Acoustic Stimulation Acoustics Adolescent Adult Audiometry, Speech Auditory Threshold Cues Engineering Sciences Female Fundamental frequency Humans Male Middle Aged Speech Acoustics Speech Intelligibility Speech Perception Speech sounds Stream segregation Voice Quality Young Adult
title	Sequential stream segregation of voiced and unvoiced speech sounds based on fundamental frequency
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-22T13%3A08%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Sequential%20stream%20segregation%20of%20voiced%20and%20unvoiced%20speech%20sounds%20based%20on%20fundamental%20frequency&rft.jtitle=Hearing%20research&rft.au=David,%20Marion&rft.date=2017-02-01&rft.volume=344&rft.spage=235&rft.epage=243&rft.pages=235-243&rft.issn=0378-5955&rft.eissn=1878-5891&rft_id=info:doi/10.1016/j.heares.2016.11.016&rft_dat=%3Cproquest_pubme%3E1846721353%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1846721353&rft_id=info:pmid/27923739&rft_els_id=S037859551630315X&rfr_iscdi=true