A harmonic-cancellation-based model to predict speech intelligibility against a harmonic masker
This work aims to predict speech intelligibility against harmonic maskers. Unlike noise maskers, harmonic maskers (including speech) have a harmonic structure that may allow for a release from masking based on fundamental frequency (F0). Mechanisms, such as spectral glimpsing and harmonic cancellati...
Gespeichert in:
Veröffentlicht in: | The Journal of the Acoustical Society of America 2020-11, Vol.148 (5), p.3246-3254 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 3254 |
---|---|
container_issue | 5 |
container_start_page | 3246 |
container_title | The Journal of the Acoustical Society of America |
container_volume | 148 |
creator | Prud'homme, Luna Lavandier, Mathieu Best, Virginia |
description | This work aims to predict speech intelligibility against harmonic maskers. Unlike noise maskers, harmonic maskers (including speech) have a harmonic structure that may allow for a release from masking based on fundamental frequency (F0). Mechanisms, such as spectral glimpsing and harmonic cancellation, have been proposed to explain F0 segregation, but their relative contributions and ability to predict behavioral data have not been explored. A speech intelligibility model was developed that includes both spectral glimpsing and harmonic cancellation. The model was used to fit the data of two experiments from Deroche, Culling, Chatterjee, and Limb [J. Acoust. Soc. Am. 135, 2873–2884 (2014)], in which speech reception thresholds were measured for stationary harmonic maskers varying in their F0 and degree of harmonicity. Key model parameters (jitter in the masker F0, shape of the cancellation filter, frequency limit for cancellation, and signal-to-noise ratio ceiling) were optimized by maximizing the correspondence between the predictions and data. The model was able to accurately describe the effects associated with varying the masker F0 and harmonicity. Across both experiments, the correlation between data and predictions was 0.99, and the mean and largest absolute prediction errors were lower than 0.5 and 1 dB, respectively. |
doi_str_mv | 10.1121/10.0002492 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1121_10_0002492</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2466294060</sourcerecordid><originalsourceid>FETCH-LOGICAL-c484t-f6051aacae4bfd78723a39f6e3fd3ae31c4b8ef5edccc370f49ca781192836153</originalsourceid><addsrcrecordid>eNp9kU1vEzEQhi1ERdPChR-AfISiBX-t13tBiiqglSJxgbM167UTw-56sZ1I_fc4TYhKVXEaefz4GY1fhF5T8oFSRj-WSghhomXP0ILWjFSqZuI5WpQurUQr5Tm6SOlnOdaKty_QOedMUt6oBdJLvIE4hsmbysBk7DBA9mGqOki2x2Po7YBzwHO0vTcZp9las8F-yoX0a9_5wec7DGvwU8oYTjY8Qvpl40t05mBI9tWxXqIfXz5_v76pVt--3l4vV5URSuTKSVJTAANWdK5vVMM48NZJy13PwXJqRKesq21vjOENcaI10ChKW6a4pDW_RJ8O3nnbjYWyU44w6Dn6EeKdDuD1vzeT3-h12GlF2qahogjeHQSbR89uliu97xFOpBCE72hh3x6HxfB7a1PWo0_3XzfZsE2aCSlZK4gkBb06oCaGlKJ1Jzclep_evh7TK_Cbh0uc0L9xFeD9AUjG5_ucTswuxAcqPffuf_QTw_8AiSiykg</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2466294060</pqid></control><display><type>article</type><title>A harmonic-cancellation-based model to predict speech intelligibility against a harmonic masker</title><source>AIP Journals Complete</source><source>Alma/SFX Local Collection</source><source>AIP Acoustical Society of America</source><creator>Prud'homme, Luna ; Lavandier, Mathieu ; Best, Virginia</creator><creatorcontrib>Prud'homme, Luna ; Lavandier, Mathieu ; Best, Virginia</creatorcontrib><description>This work aims to predict speech intelligibility against harmonic maskers. Unlike noise maskers, harmonic maskers (including speech) have a harmonic structure that may allow for a release from masking based on fundamental frequency (F0). Mechanisms, such as spectral glimpsing and harmonic cancellation, have been proposed to explain F0 segregation, but their relative contributions and ability to predict behavioral data have not been explored. A speech intelligibility model was developed that includes both spectral glimpsing and harmonic cancellation. The model was used to fit the data of two experiments from Deroche, Culling, Chatterjee, and Limb [J. Acoust. Soc. Am. 135, 2873–2884 (2014)], in which speech reception thresholds were measured for stationary harmonic maskers varying in their F0 and degree of harmonicity. Key model parameters (jitter in the masker F0, shape of the cancellation filter, frequency limit for cancellation, and signal-to-noise ratio ceiling) were optimized by maximizing the correspondence between the predictions and data. The model was able to accurately describe the effects associated with varying the masker F0 and harmonicity. Across both experiments, the correlation between data and predictions was 0.99, and the mean and largest absolute prediction errors were lower than 0.5 and 1 dB, respectively.</description><identifier>ISSN: 0001-4966</identifier><identifier>EISSN: 1520-8524</identifier><identifier>DOI: 10.1121/10.0002492</identifier><identifier>PMID: 33261378</identifier><identifier>CODEN: JASMAN</identifier><language>eng</language><publisher>United States: Acoustical Society of America</publisher><subject>Engineering Sciences ; Psychological and Physiological Acoustics</subject><ispartof>The Journal of the Acoustical Society of America, 2020-11, Vol.148 (5), p.3246-3254</ispartof><rights>Acoustical Society of America</rights><rights>Distributed under a Creative Commons Attribution 4.0 International License</rights><rights>2020 Acoustical Society of America. 2020 Acoustical Society of America</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c484t-f6051aacae4bfd78723a39f6e3fd3ae31c4b8ef5edccc370f49ca781192836153</citedby><cites>FETCH-LOGICAL-c484t-f6051aacae4bfd78723a39f6e3fd3ae31c4b8ef5edccc370f49ca781192836153</cites><orcidid>0000-0002-5535-5736 ; 0000-0002-8195-6834 ; s0000000255355736 ; s0000000281956834</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://pubs.aip.org/jasa/article-lookup/doi/10.1121/10.0002492$$EHTML$$P50$$Gscitation$$H</linktohtml><link.rule.ids>207,208,230,314,776,780,790,881,1559,4498,27901,27902,76126</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/33261378$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink><backlink>$$Uhttps://hal.science/hal-03064403$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>Prud'homme, Luna</creatorcontrib><creatorcontrib>Lavandier, Mathieu</creatorcontrib><creatorcontrib>Best, Virginia</creatorcontrib><title>A harmonic-cancellation-based model to predict speech intelligibility against a harmonic masker</title><title>The Journal of the Acoustical Society of America</title><addtitle>J Acoust Soc Am</addtitle><description>This work aims to predict speech intelligibility against harmonic maskers. Unlike noise maskers, harmonic maskers (including speech) have a harmonic structure that may allow for a release from masking based on fundamental frequency (F0). Mechanisms, such as spectral glimpsing and harmonic cancellation, have been proposed to explain F0 segregation, but their relative contributions and ability to predict behavioral data have not been explored. A speech intelligibility model was developed that includes both spectral glimpsing and harmonic cancellation. The model was used to fit the data of two experiments from Deroche, Culling, Chatterjee, and Limb [J. Acoust. Soc. Am. 135, 2873–2884 (2014)], in which speech reception thresholds were measured for stationary harmonic maskers varying in their F0 and degree of harmonicity. Key model parameters (jitter in the masker F0, shape of the cancellation filter, frequency limit for cancellation, and signal-to-noise ratio ceiling) were optimized by maximizing the correspondence between the predictions and data. The model was able to accurately describe the effects associated with varying the masker F0 and harmonicity. Across both experiments, the correlation between data and predictions was 0.99, and the mean and largest absolute prediction errors were lower than 0.5 and 1 dB, respectively.</description><subject>Engineering Sciences</subject><subject>Psychological and Physiological Acoustics</subject><issn>0001-4966</issn><issn>1520-8524</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9kU1vEzEQhi1ERdPChR-AfISiBX-t13tBiiqglSJxgbM167UTw-56sZ1I_fc4TYhKVXEaefz4GY1fhF5T8oFSRj-WSghhomXP0ILWjFSqZuI5WpQurUQr5Tm6SOlnOdaKty_QOedMUt6oBdJLvIE4hsmbysBk7DBA9mGqOki2x2Po7YBzwHO0vTcZp9las8F-yoX0a9_5wec7DGvwU8oYTjY8Qvpl40t05mBI9tWxXqIfXz5_v76pVt--3l4vV5URSuTKSVJTAANWdK5vVMM48NZJy13PwXJqRKesq21vjOENcaI10ChKW6a4pDW_RJ8O3nnbjYWyU44w6Dn6EeKdDuD1vzeT3-h12GlF2qahogjeHQSbR89uliu97xFOpBCE72hh3x6HxfB7a1PWo0_3XzfZsE2aCSlZK4gkBb06oCaGlKJ1Jzclep_evh7TK_Cbh0uc0L9xFeD9AUjG5_ucTswuxAcqPffuf_QTw_8AiSiykg</recordid><startdate>202011</startdate><enddate>202011</enddate><creator>Prud'homme, Luna</creator><creator>Lavandier, Mathieu</creator><creator>Best, Virginia</creator><general>Acoustical Society of America</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>1XC</scope><scope>VOOES</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-5535-5736</orcidid><orcidid>https://orcid.org/0000-0002-8195-6834</orcidid><orcidid>https://orcid.org/s0000000255355736</orcidid><orcidid>https://orcid.org/s0000000281956834</orcidid></search><sort><creationdate>202011</creationdate><title>A harmonic-cancellation-based model to predict speech intelligibility against a harmonic masker</title><author>Prud'homme, Luna ; Lavandier, Mathieu ; Best, Virginia</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c484t-f6051aacae4bfd78723a39f6e3fd3ae31c4b8ef5edccc370f49ca781192836153</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Engineering Sciences</topic><topic>Psychological and Physiological Acoustics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Prud'homme, Luna</creatorcontrib><creatorcontrib>Lavandier, Mathieu</creatorcontrib><creatorcontrib>Best, Virginia</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>Hyper Article en Ligne (HAL)</collection><collection>Hyper Article en Ligne (HAL) (Open Access)</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>The Journal of the Acoustical Society of America</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Prud'homme, Luna</au><au>Lavandier, Mathieu</au><au>Best, Virginia</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A harmonic-cancellation-based model to predict speech intelligibility against a harmonic masker</atitle><jtitle>The Journal of the Acoustical Society of America</jtitle><addtitle>J Acoust Soc Am</addtitle><date>2020-11</date><risdate>2020</risdate><volume>148</volume><issue>5</issue><spage>3246</spage><epage>3254</epage><pages>3246-3254</pages><issn>0001-4966</issn><eissn>1520-8524</eissn><coden>JASMAN</coden><abstract>This work aims to predict speech intelligibility against harmonic maskers. Unlike noise maskers, harmonic maskers (including speech) have a harmonic structure that may allow for a release from masking based on fundamental frequency (F0). Mechanisms, such as spectral glimpsing and harmonic cancellation, have been proposed to explain F0 segregation, but their relative contributions and ability to predict behavioral data have not been explored. A speech intelligibility model was developed that includes both spectral glimpsing and harmonic cancellation. The model was used to fit the data of two experiments from Deroche, Culling, Chatterjee, and Limb [J. Acoust. Soc. Am. 135, 2873–2884 (2014)], in which speech reception thresholds were measured for stationary harmonic maskers varying in their F0 and degree of harmonicity. Key model parameters (jitter in the masker F0, shape of the cancellation filter, frequency limit for cancellation, and signal-to-noise ratio ceiling) were optimized by maximizing the correspondence between the predictions and data. The model was able to accurately describe the effects associated with varying the masker F0 and harmonicity. Across both experiments, the correlation between data and predictions was 0.99, and the mean and largest absolute prediction errors were lower than 0.5 and 1 dB, respectively.</abstract><cop>United States</cop><pub>Acoustical Society of America</pub><pmid>33261378</pmid><doi>10.1121/10.0002492</doi><tpages>9</tpages><orcidid>https://orcid.org/0000-0002-5535-5736</orcidid><orcidid>https://orcid.org/0000-0002-8195-6834</orcidid><orcidid>https://orcid.org/s0000000255355736</orcidid><orcidid>https://orcid.org/s0000000281956834</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0001-4966 |
ispartof | The Journal of the Acoustical Society of America, 2020-11, Vol.148 (5), p.3246-3254 |
issn | 0001-4966 1520-8524 |
language | eng |
recordid | cdi_crossref_primary_10_1121_10_0002492 |
source | AIP Journals Complete; Alma/SFX Local Collection; AIP Acoustical Society of America |
subjects | Engineering Sciences Psychological and Physiological Acoustics |
title | A harmonic-cancellation-based model to predict speech intelligibility against a harmonic masker |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T11%3A21%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20harmonic-cancellation-based%20model%20to%20predict%20speech%20intelligibility%20against%20a%20harmonic%20masker&rft.jtitle=The%20Journal%20of%20the%20Acoustical%20Society%20of%20America&rft.au=Prud'homme,%20Luna&rft.date=2020-11&rft.volume=148&rft.issue=5&rft.spage=3246&rft.epage=3254&rft.pages=3246-3254&rft.issn=0001-4966&rft.eissn=1520-8524&rft.coden=JASMAN&rft_id=info:doi/10.1121/10.0002492&rft_dat=%3Cproquest_cross%3E2466294060%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2466294060&rft_id=info:pmid/33261378&rfr_iscdi=true |