Speech coherence index with real speech and reverberation

Speech Coherence Index (SCI) is a proposed method of estimating speech intelligibility in real time with program material. The coherence of the complex valued transfer function is used to estimate the signal-to-noise ratio on a per frequency basis. The transfer function is calculated using short tim...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Journal of the Acoustical Society of America 2017-05, Vol.141 (5), p.3782-3782
Hauptverfasser: Szuts, Tobi A., Schwenke, Roger W.
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 3782
container_issue 5
container_start_page 3782
container_title The Journal of the Acoustical Society of America
container_volume 141
creator Szuts, Tobi A.
Schwenke, Roger W.
description Speech Coherence Index (SCI) is a proposed method of estimating speech intelligibility in real time with program material. The coherence of the complex valued transfer function is used to estimate the signal-to-noise ratio on a per frequency basis. The transfer function is calculated using short time windows at high frequencies and longer time windows at low frequencies to mimic the multi-resolution nature of human hearing. SCI has been shown to produce identical results to Speech Transmission Index (STI) in the case of pure noise interference. Like STI, SCI has been shown to decrease with a single reflection at longer latencies or at greater magnitude. SCI always decreases monotonically with single reflection latency, whereas STI varies up and down at extremely long latencies. For simulated reverberation, both STI and SCI have been shown to decrease with increasing reverberation time and reverberant level. However, SCI is more sensitive to direct-to-reverberant level. This paper will compare SCI to STI under more realistic conditions, such as speech signals and real-world impulse responses. The effect of signal crest factor will also be examined.
doi_str_mv 10.1121/1.4988322
format Article
fullrecord <record><control><sourceid>scitation_cross</sourceid><recordid>TN_cdi_scitation_primary_10_1121_1_4988322</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>jasa</sourcerecordid><originalsourceid>FETCH-LOGICAL-c692-e8458de38d00f0b847fde096c65308ecc4f334fb62f18250b0ade3ba14a982083</originalsourceid><addsrcrecordid>eNp9z8tKxEAQBdBGFIyjC_8gW4WMVf2yeimDLxhw4exDp1NNImNm6A4-_t5oZu2quHC41BXiEmGJKPEGl9oRKSmPRIFGQkVG6mNRAABW2ll7Ks5yfpuiIeUK4V73zKErw67jxEPgsh9a_io_-7ErE_ttmWfgh3bKH5waTn7sd8O5OIl-m_nicBdi83C_WT1V65fH59XdugrWyYpJG2pZUQsQoSF9G1sGZ4M1CohD0FEpHRsrI5I00ICfdONRe0cSSC3E1Vwb0i7nxLHep_7dp-8aof6dXGN9mDzZ69nm0I9_T_6DfwAm0VVc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Speech coherence index with real speech and reverberation</title><source>AIP Journals Complete</source><source>Alma/SFX Local Collection</source><source>AIP Acoustical Society of America</source><creator>Szuts, Tobi A. ; Schwenke, Roger W.</creator><creatorcontrib>Szuts, Tobi A. ; Schwenke, Roger W.</creatorcontrib><description>Speech Coherence Index (SCI) is a proposed method of estimating speech intelligibility in real time with program material. The coherence of the complex valued transfer function is used to estimate the signal-to-noise ratio on a per frequency basis. The transfer function is calculated using short time windows at high frequencies and longer time windows at low frequencies to mimic the multi-resolution nature of human hearing. SCI has been shown to produce identical results to Speech Transmission Index (STI) in the case of pure noise interference. Like STI, SCI has been shown to decrease with a single reflection at longer latencies or at greater magnitude. SCI always decreases monotonically with single reflection latency, whereas STI varies up and down at extremely long latencies. For simulated reverberation, both STI and SCI have been shown to decrease with increasing reverberation time and reverberant level. However, SCI is more sensitive to direct-to-reverberant level. This paper will compare SCI to STI under more realistic conditions, such as speech signals and real-world impulse responses. The effect of signal crest factor will also be examined.</description><identifier>ISSN: 0001-4966</identifier><identifier>EISSN: 1520-8524</identifier><identifier>DOI: 10.1121/1.4988322</identifier><identifier>CODEN: JASMAN</identifier><language>eng</language><ispartof>The Journal of the Acoustical Society of America, 2017-05, Vol.141 (5), p.3782-3782</ispartof><rights>Acoustical Society of America</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://pubs.aip.org/jasa/article-lookup/doi/10.1121/1.4988322$$EHTML$$P50$$Gscitation$$H</linktohtml><link.rule.ids>207,208,314,780,784,794,1565,4512,27924,27925,76384</link.rule.ids></links><search><creatorcontrib>Szuts, Tobi A.</creatorcontrib><creatorcontrib>Schwenke, Roger W.</creatorcontrib><title>Speech coherence index with real speech and reverberation</title><title>The Journal of the Acoustical Society of America</title><description>Speech Coherence Index (SCI) is a proposed method of estimating speech intelligibility in real time with program material. The coherence of the complex valued transfer function is used to estimate the signal-to-noise ratio on a per frequency basis. The transfer function is calculated using short time windows at high frequencies and longer time windows at low frequencies to mimic the multi-resolution nature of human hearing. SCI has been shown to produce identical results to Speech Transmission Index (STI) in the case of pure noise interference. Like STI, SCI has been shown to decrease with a single reflection at longer latencies or at greater magnitude. SCI always decreases monotonically with single reflection latency, whereas STI varies up and down at extremely long latencies. For simulated reverberation, both STI and SCI have been shown to decrease with increasing reverberation time and reverberant level. However, SCI is more sensitive to direct-to-reverberant level. This paper will compare SCI to STI under more realistic conditions, such as speech signals and real-world impulse responses. The effect of signal crest factor will also be examined.</description><issn>0001-4966</issn><issn>1520-8524</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><recordid>eNp9z8tKxEAQBdBGFIyjC_8gW4WMVf2yeimDLxhw4exDp1NNImNm6A4-_t5oZu2quHC41BXiEmGJKPEGl9oRKSmPRIFGQkVG6mNRAABW2ll7Ks5yfpuiIeUK4V73zKErw67jxEPgsh9a_io_-7ErE_ttmWfgh3bKH5waTn7sd8O5OIl-m_nicBdi83C_WT1V65fH59XdugrWyYpJG2pZUQsQoSF9G1sGZ4M1CohD0FEpHRsrI5I00ICfdONRe0cSSC3E1Vwb0i7nxLHep_7dp-8aof6dXGN9mDzZ69nm0I9_T_6DfwAm0VVc</recordid><startdate>201705</startdate><enddate>201705</enddate><creator>Szuts, Tobi A.</creator><creator>Schwenke, Roger W.</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>201705</creationdate><title>Speech coherence index with real speech and reverberation</title><author>Szuts, Tobi A. ; Schwenke, Roger W.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c692-e8458de38d00f0b847fde096c65308ecc4f334fb62f18250b0ade3ba14a982083</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Szuts, Tobi A.</creatorcontrib><creatorcontrib>Schwenke, Roger W.</creatorcontrib><collection>CrossRef</collection><jtitle>The Journal of the Acoustical Society of America</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Szuts, Tobi A.</au><au>Schwenke, Roger W.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Speech coherence index with real speech and reverberation</atitle><jtitle>The Journal of the Acoustical Society of America</jtitle><date>2017-05</date><risdate>2017</risdate><volume>141</volume><issue>5</issue><spage>3782</spage><epage>3782</epage><pages>3782-3782</pages><issn>0001-4966</issn><eissn>1520-8524</eissn><coden>JASMAN</coden><abstract>Speech Coherence Index (SCI) is a proposed method of estimating speech intelligibility in real time with program material. The coherence of the complex valued transfer function is used to estimate the signal-to-noise ratio on a per frequency basis. The transfer function is calculated using short time windows at high frequencies and longer time windows at low frequencies to mimic the multi-resolution nature of human hearing. SCI has been shown to produce identical results to Speech Transmission Index (STI) in the case of pure noise interference. Like STI, SCI has been shown to decrease with a single reflection at longer latencies or at greater magnitude. SCI always decreases monotonically with single reflection latency, whereas STI varies up and down at extremely long latencies. For simulated reverberation, both STI and SCI have been shown to decrease with increasing reverberation time and reverberant level. However, SCI is more sensitive to direct-to-reverberant level. This paper will compare SCI to STI under more realistic conditions, such as speech signals and real-world impulse responses. The effect of signal crest factor will also be examined.</abstract><doi>10.1121/1.4988322</doi><tpages>1</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0001-4966
ispartof The Journal of the Acoustical Society of America, 2017-05, Vol.141 (5), p.3782-3782
issn 0001-4966
1520-8524
language eng
recordid cdi_scitation_primary_10_1121_1_4988322
source AIP Journals Complete; Alma/SFX Local Collection; AIP Acoustical Society of America
title Speech coherence index with real speech and reverberation
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T16%3A34%3A15IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-scitation_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Speech%20coherence%20index%20with%20real%20speech%20and%20reverberation&rft.jtitle=The%20Journal%20of%20the%20Acoustical%20Society%20of%20America&rft.au=Szuts,%20Tobi%20A.&rft.date=2017-05&rft.volume=141&rft.issue=5&rft.spage=3782&rft.epage=3782&rft.pages=3782-3782&rft.issn=0001-4966&rft.eissn=1520-8524&rft.coden=JASMAN&rft_id=info:doi/10.1121/1.4988322&rft_dat=%3Cscitation_cross%3Ejasa%3C/scitation_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true