A harmonic-cancellation-based model to predict speech intelligibility against a harmonic masker

This work aims to predict speech intelligibility against harmonic maskers. Unlike noise maskers, harmonic maskers (including speech) have a harmonic structure that may allow for a release from masking based on fundamental frequency (F0). Mechanisms, such as spectral glimpsing and harmonic cancellati...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Journal of the Acoustical Society of America 2020-11, Vol.148 (5), p.3246-3254
Hauptverfasser: Prud'homme, Luna, Lavandier, Mathieu, Best, Virginia
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 3254
container_issue 5
container_start_page 3246
container_title The Journal of the Acoustical Society of America
container_volume 148
creator Prud'homme, Luna
Lavandier, Mathieu
Best, Virginia
description This work aims to predict speech intelligibility against harmonic maskers. Unlike noise maskers, harmonic maskers (including speech) have a harmonic structure that may allow for a release from masking based on fundamental frequency (F0). Mechanisms, such as spectral glimpsing and harmonic cancellation, have been proposed to explain F0 segregation, but their relative contributions and ability to predict behavioral data have not been explored. A speech intelligibility model was developed that includes both spectral glimpsing and harmonic cancellation. The model was used to fit the data of two experiments from Deroche, Culling, Chatterjee, and Limb [J. Acoust. Soc. Am. 135, 2873–2884 (2014)], in which speech reception thresholds were measured for stationary harmonic maskers varying in their F0 and degree of harmonicity. Key model parameters (jitter in the masker F0, shape of the cancellation filter, frequency limit for cancellation, and signal-to-noise ratio ceiling) were optimized by maximizing the correspondence between the predictions and data. The model was able to accurately describe the effects associated with varying the masker F0 and harmonicity. Across both experiments, the correlation between data and predictions was 0.99, and the mean and largest absolute prediction errors were lower than 0.5 and 1 dB, respectively.
doi_str_mv 10.1121/10.0002492
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1121_10_0002492</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2466294060</sourcerecordid><originalsourceid>FETCH-LOGICAL-c484t-f6051aacae4bfd78723a39f6e3fd3ae31c4b8ef5edccc370f49ca781192836153</originalsourceid><addsrcrecordid>eNp9kU1vEzEQhi1ERdPChR-AfISiBX-t13tBiiqglSJxgbM167UTw-56sZ1I_fc4TYhKVXEaefz4GY1fhF5T8oFSRj-WSghhomXP0ILWjFSqZuI5WpQurUQr5Tm6SOlnOdaKty_QOedMUt6oBdJLvIE4hsmbysBk7DBA9mGqOki2x2Po7YBzwHO0vTcZp9las8F-yoX0a9_5wec7DGvwU8oYTjY8Qvpl40t05mBI9tWxXqIfXz5_v76pVt--3l4vV5URSuTKSVJTAANWdK5vVMM48NZJy13PwXJqRKesq21vjOENcaI10ChKW6a4pDW_RJ8O3nnbjYWyU44w6Dn6EeKdDuD1vzeT3-h12GlF2qahogjeHQSbR89uliu97xFOpBCE72hh3x6HxfB7a1PWo0_3XzfZsE2aCSlZK4gkBb06oCaGlKJ1Jzclep_evh7TK_Cbh0uc0L9xFeD9AUjG5_ucTswuxAcqPffuf_QTw_8AiSiykg</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2466294060</pqid></control><display><type>article</type><title>A harmonic-cancellation-based model to predict speech intelligibility against a harmonic masker</title><source>AIP Journals Complete</source><source>Alma/SFX Local Collection</source><source>AIP Acoustical Society of America</source><creator>Prud'homme, Luna ; Lavandier, Mathieu ; Best, Virginia</creator><creatorcontrib>Prud'homme, Luna ; Lavandier, Mathieu ; Best, Virginia</creatorcontrib><description>This work aims to predict speech intelligibility against harmonic maskers. Unlike noise maskers, harmonic maskers (including speech) have a harmonic structure that may allow for a release from masking based on fundamental frequency (F0). Mechanisms, such as spectral glimpsing and harmonic cancellation, have been proposed to explain F0 segregation, but their relative contributions and ability to predict behavioral data have not been explored. A speech intelligibility model was developed that includes both spectral glimpsing and harmonic cancellation. The model was used to fit the data of two experiments from Deroche, Culling, Chatterjee, and Limb [J. Acoust. Soc. Am. 135, 2873–2884 (2014)], in which speech reception thresholds were measured for stationary harmonic maskers varying in their F0 and degree of harmonicity. Key model parameters (jitter in the masker F0, shape of the cancellation filter, frequency limit for cancellation, and signal-to-noise ratio ceiling) were optimized by maximizing the correspondence between the predictions and data. The model was able to accurately describe the effects associated with varying the masker F0 and harmonicity. Across both experiments, the correlation between data and predictions was 0.99, and the mean and largest absolute prediction errors were lower than 0.5 and 1 dB, respectively.</description><identifier>ISSN: 0001-4966</identifier><identifier>EISSN: 1520-8524</identifier><identifier>DOI: 10.1121/10.0002492</identifier><identifier>PMID: 33261378</identifier><identifier>CODEN: JASMAN</identifier><language>eng</language><publisher>United States: Acoustical Society of America</publisher><subject>Engineering Sciences ; Psychological and Physiological Acoustics</subject><ispartof>The Journal of the Acoustical Society of America, 2020-11, Vol.148 (5), p.3246-3254</ispartof><rights>Acoustical Society of America</rights><rights>Distributed under a Creative Commons Attribution 4.0 International License</rights><rights>2020 Acoustical Society of America. 2020 Acoustical Society of America</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c484t-f6051aacae4bfd78723a39f6e3fd3ae31c4b8ef5edccc370f49ca781192836153</citedby><cites>FETCH-LOGICAL-c484t-f6051aacae4bfd78723a39f6e3fd3ae31c4b8ef5edccc370f49ca781192836153</cites><orcidid>0000-0002-5535-5736 ; 0000-0002-8195-6834 ; s0000000255355736 ; s0000000281956834</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://pubs.aip.org/jasa/article-lookup/doi/10.1121/10.0002492$$EHTML$$P50$$Gscitation$$H</linktohtml><link.rule.ids>207,208,230,314,776,780,790,881,1559,4498,27901,27902,76126</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/33261378$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink><backlink>$$Uhttps://hal.science/hal-03064403$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>Prud'homme, Luna</creatorcontrib><creatorcontrib>Lavandier, Mathieu</creatorcontrib><creatorcontrib>Best, Virginia</creatorcontrib><title>A harmonic-cancellation-based model to predict speech intelligibility against a harmonic masker</title><title>The Journal of the Acoustical Society of America</title><addtitle>J Acoust Soc Am</addtitle><description>This work aims to predict speech intelligibility against harmonic maskers. Unlike noise maskers, harmonic maskers (including speech) have a harmonic structure that may allow for a release from masking based on fundamental frequency (F0). Mechanisms, such as spectral glimpsing and harmonic cancellation, have been proposed to explain F0 segregation, but their relative contributions and ability to predict behavioral data have not been explored. A speech intelligibility model was developed that includes both spectral glimpsing and harmonic cancellation. The model was used to fit the data of two experiments from Deroche, Culling, Chatterjee, and Limb [J. Acoust. Soc. Am. 135, 2873–2884 (2014)], in which speech reception thresholds were measured for stationary harmonic maskers varying in their F0 and degree of harmonicity. Key model parameters (jitter in the masker F0, shape of the cancellation filter, frequency limit for cancellation, and signal-to-noise ratio ceiling) were optimized by maximizing the correspondence between the predictions and data. The model was able to accurately describe the effects associated with varying the masker F0 and harmonicity. Across both experiments, the correlation between data and predictions was 0.99, and the mean and largest absolute prediction errors were lower than 0.5 and 1 dB, respectively.</description><subject>Engineering Sciences</subject><subject>Psychological and Physiological Acoustics</subject><issn>0001-4966</issn><issn>1520-8524</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9kU1vEzEQhi1ERdPChR-AfISiBX-t13tBiiqglSJxgbM167UTw-56sZ1I_fc4TYhKVXEaefz4GY1fhF5T8oFSRj-WSghhomXP0ILWjFSqZuI5WpQurUQr5Tm6SOlnOdaKty_QOedMUt6oBdJLvIE4hsmbysBk7DBA9mGqOki2x2Po7YBzwHO0vTcZp9las8F-yoX0a9_5wec7DGvwU8oYTjY8Qvpl40t05mBI9tWxXqIfXz5_v76pVt--3l4vV5URSuTKSVJTAANWdK5vVMM48NZJy13PwXJqRKesq21vjOENcaI10ChKW6a4pDW_RJ8O3nnbjYWyU44w6Dn6EeKdDuD1vzeT3-h12GlF2qahogjeHQSbR89uliu97xFOpBCE72hh3x6HxfB7a1PWo0_3XzfZsE2aCSlZK4gkBb06oCaGlKJ1Jzclep_evh7TK_Cbh0uc0L9xFeD9AUjG5_ucTswuxAcqPffuf_QTw_8AiSiykg</recordid><startdate>202011</startdate><enddate>202011</enddate><creator>Prud'homme, Luna</creator><creator>Lavandier, Mathieu</creator><creator>Best, Virginia</creator><general>Acoustical Society of America</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>1XC</scope><scope>VOOES</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-5535-5736</orcidid><orcidid>https://orcid.org/0000-0002-8195-6834</orcidid><orcidid>https://orcid.org/s0000000255355736</orcidid><orcidid>https://orcid.org/s0000000281956834</orcidid></search><sort><creationdate>202011</creationdate><title>A harmonic-cancellation-based model to predict speech intelligibility against a harmonic masker</title><author>Prud'homme, Luna ; Lavandier, Mathieu ; Best, Virginia</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c484t-f6051aacae4bfd78723a39f6e3fd3ae31c4b8ef5edccc370f49ca781192836153</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Engineering Sciences</topic><topic>Psychological and Physiological Acoustics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Prud'homme, Luna</creatorcontrib><creatorcontrib>Lavandier, Mathieu</creatorcontrib><creatorcontrib>Best, Virginia</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>Hyper Article en Ligne (HAL)</collection><collection>Hyper Article en Ligne (HAL) (Open Access)</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>The Journal of the Acoustical Society of America</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Prud'homme, Luna</au><au>Lavandier, Mathieu</au><au>Best, Virginia</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A harmonic-cancellation-based model to predict speech intelligibility against a harmonic masker</atitle><jtitle>The Journal of the Acoustical Society of America</jtitle><addtitle>J Acoust Soc Am</addtitle><date>2020-11</date><risdate>2020</risdate><volume>148</volume><issue>5</issue><spage>3246</spage><epage>3254</epage><pages>3246-3254</pages><issn>0001-4966</issn><eissn>1520-8524</eissn><coden>JASMAN</coden><abstract>This work aims to predict speech intelligibility against harmonic maskers. Unlike noise maskers, harmonic maskers (including speech) have a harmonic structure that may allow for a release from masking based on fundamental frequency (F0). Mechanisms, such as spectral glimpsing and harmonic cancellation, have been proposed to explain F0 segregation, but their relative contributions and ability to predict behavioral data have not been explored. A speech intelligibility model was developed that includes both spectral glimpsing and harmonic cancellation. The model was used to fit the data of two experiments from Deroche, Culling, Chatterjee, and Limb [J. Acoust. Soc. Am. 135, 2873–2884 (2014)], in which speech reception thresholds were measured for stationary harmonic maskers varying in their F0 and degree of harmonicity. Key model parameters (jitter in the masker F0, shape of the cancellation filter, frequency limit for cancellation, and signal-to-noise ratio ceiling) were optimized by maximizing the correspondence between the predictions and data. The model was able to accurately describe the effects associated with varying the masker F0 and harmonicity. Across both experiments, the correlation between data and predictions was 0.99, and the mean and largest absolute prediction errors were lower than 0.5 and 1 dB, respectively.</abstract><cop>United States</cop><pub>Acoustical Society of America</pub><pmid>33261378</pmid><doi>10.1121/10.0002492</doi><tpages>9</tpages><orcidid>https://orcid.org/0000-0002-5535-5736</orcidid><orcidid>https://orcid.org/0000-0002-8195-6834</orcidid><orcidid>https://orcid.org/s0000000255355736</orcidid><orcidid>https://orcid.org/s0000000281956834</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0001-4966
ispartof The Journal of the Acoustical Society of America, 2020-11, Vol.148 (5), p.3246-3254
issn 0001-4966
1520-8524
language eng
recordid cdi_crossref_primary_10_1121_10_0002492
source AIP Journals Complete; Alma/SFX Local Collection; AIP Acoustical Society of America
subjects Engineering Sciences
Psychological and Physiological Acoustics
title A harmonic-cancellation-based model to predict speech intelligibility against a harmonic masker
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T11%3A21%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20harmonic-cancellation-based%20model%20to%20predict%20speech%20intelligibility%20against%20a%20harmonic%20masker&rft.jtitle=The%20Journal%20of%20the%20Acoustical%20Society%20of%20America&rft.au=Prud'homme,%20Luna&rft.date=2020-11&rft.volume=148&rft.issue=5&rft.spage=3246&rft.epage=3254&rft.pages=3246-3254&rft.issn=0001-4966&rft.eissn=1520-8524&rft.coden=JASMAN&rft_id=info:doi/10.1121/10.0002492&rft_dat=%3Cproquest_cross%3E2466294060%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2466294060&rft_id=info:pmid/33261378&rfr_iscdi=true