Toward Optimum Quantification of Pathology-Induced Noises: An Investigation of Information Missed by Human Auditory System
Clinical diagnosis of voice disorder and evaluation of therapy outcome heavily rely on accurate quantification of voice quality, which is closely tied to the physiology and function of the laryngeal mechanism. Considering the evaluation methodology of the voice, two main categories of auditory-perce...
Gespeichert in:
Veröffentlicht in: | IEEE/ACM transactions on audio, speech, and language processing speech, and language processing, 2020, Vol.28, p.519-528 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 528 |
---|---|
container_issue | |
container_start_page | 519 |
container_title | IEEE/ACM transactions on audio, speech, and language processing |
container_volume | 28 |
creator | Ghasemzadeh, Hamzeh Arjmandi, Meisam K. |
description | Clinical diagnosis of voice disorder and evaluation of therapy outcome heavily rely on accurate quantification of voice quality, which is closely tied to the physiology and function of the laryngeal mechanism. Considering the evaluation methodology of the voice, two main categories of auditory-perceptual assessment and acoustic analysis can be identified. This article presents a new approach for acoustic analysis of voice quality, which brings several advantages to the field. The proposed approach is non-parametric in the sense that it does not require the estimation of the fundamental frequency or spectral response of the vocal tract. This reduces the computational complexity of the measurement and reduces the possible errors due to inaccurate estimation of those parameters. Additionally, the method does not make any assumption about the phonetic context and hence has the potential to be applied to connected speech. The proposed method benefits from the multiresolution structure of the wavelet analysis for estimating the noisy component of a voice in the spectro-temporal domain. The informativeness of the estimated noise for voice quality distinction is examined based on different noise-quantification approaches. It is shown that deviation from the model of the human auditory system (HAS) leads to performance improvement. Through several analyses, it is argued that using models of HAS for quantification of the noise leads to significant loss of information relevant to voice quality. Findings from this article suggest that perception-based measures of voice quality are highly restricted in capturing important aspects of acoustic that could assist with voice quality distinctions. This characteristic is inherent to HAS and cannot be alleviated, highlighting a significant limitation of perception-based measures. |
doi_str_mv | 10.1109/TASLP.2019.2959222 |
format | Article |
fullrecord | <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TASLP_2019_2959222</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>8932600</ieee_id><sourcerecordid>2338695138</sourcerecordid><originalsourceid>FETCH-LOGICAL-c295t-df18ad0a2a097429ac9f67661fbbdc47e6927862c3405097f13e39a5de90843a3</originalsourceid><addsrcrecordid>eNo9kFtPAjEQhRujiQT5A_rSxOfFXvZW3whR2QQFAz43ZbfFEnaLbVez_nqLiz7NTPKdmTMHgGuMxhgjdreerObLMUGYjQlLGCHkDAwIJSxiFMXnfz1h6BKMnNshhDDKGMviAfhemy9hK7g4eF23NXxtReO10qXw2jTQKLgU_t3szbaLiqZqS1nBF6OddPdw0sCi-ZTO6-0_XTTK2Lofn7VzAd90cNbWooGTttLe2A6uOudlfQUulNg7OTrVIXh7fFhPZ9F88VRMJ_OoDN_4qFI4FxUSRKDgmDBRMpVmaYrVZlOVcSZTRrI8JSWNURIQhamkTCSVZCiPqaBDcNvvPVjz0Qa7fGda24STnFCapyzBNA8U6anSGuesVPxgdS1sxzHix5j5b8z8GDM_xRxEN71ISyn_BTmjJEWI_gA7AXpA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2338695138</pqid></control><display><type>article</type><title>Toward Optimum Quantification of Pathology-Induced Noises: An Investigation of Information Missed by Human Auditory System</title><source>IEEE Electronic Library (IEL)</source><creator>Ghasemzadeh, Hamzeh ; Arjmandi, Meisam K.</creator><creatorcontrib>Ghasemzadeh, Hamzeh ; Arjmandi, Meisam K.</creatorcontrib><description>Clinical diagnosis of voice disorder and evaluation of therapy outcome heavily rely on accurate quantification of voice quality, which is closely tied to the physiology and function of the laryngeal mechanism. Considering the evaluation methodology of the voice, two main categories of auditory-perceptual assessment and acoustic analysis can be identified. This article presents a new approach for acoustic analysis of voice quality, which brings several advantages to the field. The proposed approach is non-parametric in the sense that it does not require the estimation of the fundamental frequency or spectral response of the vocal tract. This reduces the computational complexity of the measurement and reduces the possible errors due to inaccurate estimation of those parameters. Additionally, the method does not make any assumption about the phonetic context and hence has the potential to be applied to connected speech. The proposed method benefits from the multiresolution structure of the wavelet analysis for estimating the noisy component of a voice in the spectro-temporal domain. The informativeness of the estimated noise for voice quality distinction is examined based on different noise-quantification approaches. It is shown that deviation from the model of the human auditory system (HAS) leads to performance improvement. Through several analyses, it is argued that using models of HAS for quantification of the noise leads to significant loss of information relevant to voice quality. Findings from this article suggest that perception-based measures of voice quality are highly restricted in capturing important aspects of acoustic that could assist with voice quality distinctions. This characteristic is inherent to HAS and cannot be alleviated, highlighting a significant limitation of perception-based measures.</description><identifier>ISSN: 2329-9290</identifier><identifier>EISSN: 2329-9304</identifier><identifier>DOI: 10.1109/TASLP.2019.2959222</identifier><identifier>CODEN: ITASD8</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Acoustic analysis ; Acoustic measurements ; Acoustic noise ; Acoustics ; cepstral analysis ; Estimation ; instrumental assessment of voice ; Mel frequency cepstral coefficient ; Noise ; Noise measurement ; Parameter estimation ; Pathology ; Perception ; Quality ; Resonant frequencies ; Signal resolution ; Spectral sensitivity ; Vocal tract ; Voice ; voice disorder ; Wavelet analysis ; Wavelet transforms ; wavelet-based noise estimation</subject><ispartof>IEEE/ACM transactions on audio, speech, and language processing, 2020, Vol.28, p.519-528</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2020</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c295t-df18ad0a2a097429ac9f67661fbbdc47e6927862c3405097f13e39a5de90843a3</citedby><cites>FETCH-LOGICAL-c295t-df18ad0a2a097429ac9f67661fbbdc47e6927862c3405097f13e39a5de90843a3</cites><orcidid>0000-0001-5395-1908 ; 0000-0002-4368-9106</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/8932600$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,4024,27923,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/8932600$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Ghasemzadeh, Hamzeh</creatorcontrib><creatorcontrib>Arjmandi, Meisam K.</creatorcontrib><title>Toward Optimum Quantification of Pathology-Induced Noises: An Investigation of Information Missed by Human Auditory System</title><title>IEEE/ACM transactions on audio, speech, and language processing</title><addtitle>TASLP</addtitle><description>Clinical diagnosis of voice disorder and evaluation of therapy outcome heavily rely on accurate quantification of voice quality, which is closely tied to the physiology and function of the laryngeal mechanism. Considering the evaluation methodology of the voice, two main categories of auditory-perceptual assessment and acoustic analysis can be identified. This article presents a new approach for acoustic analysis of voice quality, which brings several advantages to the field. The proposed approach is non-parametric in the sense that it does not require the estimation of the fundamental frequency or spectral response of the vocal tract. This reduces the computational complexity of the measurement and reduces the possible errors due to inaccurate estimation of those parameters. Additionally, the method does not make any assumption about the phonetic context and hence has the potential to be applied to connected speech. The proposed method benefits from the multiresolution structure of the wavelet analysis for estimating the noisy component of a voice in the spectro-temporal domain. The informativeness of the estimated noise for voice quality distinction is examined based on different noise-quantification approaches. It is shown that deviation from the model of the human auditory system (HAS) leads to performance improvement. Through several analyses, it is argued that using models of HAS for quantification of the noise leads to significant loss of information relevant to voice quality. Findings from this article suggest that perception-based measures of voice quality are highly restricted in capturing important aspects of acoustic that could assist with voice quality distinctions. This characteristic is inherent to HAS and cannot be alleviated, highlighting a significant limitation of perception-based measures.</description><subject>Acoustic analysis</subject><subject>Acoustic measurements</subject><subject>Acoustic noise</subject><subject>Acoustics</subject><subject>cepstral analysis</subject><subject>Estimation</subject><subject>instrumental assessment of voice</subject><subject>Mel frequency cepstral coefficient</subject><subject>Noise</subject><subject>Noise measurement</subject><subject>Parameter estimation</subject><subject>Pathology</subject><subject>Perception</subject><subject>Quality</subject><subject>Resonant frequencies</subject><subject>Signal resolution</subject><subject>Spectral sensitivity</subject><subject>Vocal tract</subject><subject>Voice</subject><subject>voice disorder</subject><subject>Wavelet analysis</subject><subject>Wavelet transforms</subject><subject>wavelet-based noise estimation</subject><issn>2329-9290</issn><issn>2329-9304</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kFtPAjEQhRujiQT5A_rSxOfFXvZW3whR2QQFAz43ZbfFEnaLbVez_nqLiz7NTPKdmTMHgGuMxhgjdreerObLMUGYjQlLGCHkDAwIJSxiFMXnfz1h6BKMnNshhDDKGMviAfhemy9hK7g4eF23NXxtReO10qXw2jTQKLgU_t3szbaLiqZqS1nBF6OddPdw0sCi-ZTO6-0_XTTK2Lofn7VzAd90cNbWooGTttLe2A6uOudlfQUulNg7OTrVIXh7fFhPZ9F88VRMJ_OoDN_4qFI4FxUSRKDgmDBRMpVmaYrVZlOVcSZTRrI8JSWNURIQhamkTCSVZCiPqaBDcNvvPVjz0Qa7fGda24STnFCapyzBNA8U6anSGuesVPxgdS1sxzHix5j5b8z8GDM_xRxEN71ISyn_BTmjJEWI_gA7AXpA</recordid><startdate>2020</startdate><enddate>2020</enddate><creator>Ghasemzadeh, Hamzeh</creator><creator>Arjmandi, Meisam K.</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0001-5395-1908</orcidid><orcidid>https://orcid.org/0000-0002-4368-9106</orcidid></search><sort><creationdate>2020</creationdate><title>Toward Optimum Quantification of Pathology-Induced Noises: An Investigation of Information Missed by Human Auditory System</title><author>Ghasemzadeh, Hamzeh ; Arjmandi, Meisam K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c295t-df18ad0a2a097429ac9f67661fbbdc47e6927862c3405097f13e39a5de90843a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Acoustic analysis</topic><topic>Acoustic measurements</topic><topic>Acoustic noise</topic><topic>Acoustics</topic><topic>cepstral analysis</topic><topic>Estimation</topic><topic>instrumental assessment of voice</topic><topic>Mel frequency cepstral coefficient</topic><topic>Noise</topic><topic>Noise measurement</topic><topic>Parameter estimation</topic><topic>Pathology</topic><topic>Perception</topic><topic>Quality</topic><topic>Resonant frequencies</topic><topic>Signal resolution</topic><topic>Spectral sensitivity</topic><topic>Vocal tract</topic><topic>Voice</topic><topic>voice disorder</topic><topic>Wavelet analysis</topic><topic>Wavelet transforms</topic><topic>wavelet-based noise estimation</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ghasemzadeh, Hamzeh</creatorcontrib><creatorcontrib>Arjmandi, Meisam K.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE/ACM transactions on audio, speech, and language processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Ghasemzadeh, Hamzeh</au><au>Arjmandi, Meisam K.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Toward Optimum Quantification of Pathology-Induced Noises: An Investigation of Information Missed by Human Auditory System</atitle><jtitle>IEEE/ACM transactions on audio, speech, and language processing</jtitle><stitle>TASLP</stitle><date>2020</date><risdate>2020</risdate><volume>28</volume><spage>519</spage><epage>528</epage><pages>519-528</pages><issn>2329-9290</issn><eissn>2329-9304</eissn><coden>ITASD8</coden><abstract>Clinical diagnosis of voice disorder and evaluation of therapy outcome heavily rely on accurate quantification of voice quality, which is closely tied to the physiology and function of the laryngeal mechanism. Considering the evaluation methodology of the voice, two main categories of auditory-perceptual assessment and acoustic analysis can be identified. This article presents a new approach for acoustic analysis of voice quality, which brings several advantages to the field. The proposed approach is non-parametric in the sense that it does not require the estimation of the fundamental frequency or spectral response of the vocal tract. This reduces the computational complexity of the measurement and reduces the possible errors due to inaccurate estimation of those parameters. Additionally, the method does not make any assumption about the phonetic context and hence has the potential to be applied to connected speech. The proposed method benefits from the multiresolution structure of the wavelet analysis for estimating the noisy component of a voice in the spectro-temporal domain. The informativeness of the estimated noise for voice quality distinction is examined based on different noise-quantification approaches. It is shown that deviation from the model of the human auditory system (HAS) leads to performance improvement. Through several analyses, it is argued that using models of HAS for quantification of the noise leads to significant loss of information relevant to voice quality. Findings from this article suggest that perception-based measures of voice quality are highly restricted in capturing important aspects of acoustic that could assist with voice quality distinctions. This characteristic is inherent to HAS and cannot be alleviated, highlighting a significant limitation of perception-based measures.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/TASLP.2019.2959222</doi><tpages>10</tpages><orcidid>https://orcid.org/0000-0001-5395-1908</orcidid><orcidid>https://orcid.org/0000-0002-4368-9106</orcidid></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 2329-9290 |
ispartof | IEEE/ACM transactions on audio, speech, and language processing, 2020, Vol.28, p.519-528 |
issn | 2329-9290 2329-9304 |
language | eng |
recordid | cdi_crossref_primary_10_1109_TASLP_2019_2959222 |
source | IEEE Electronic Library (IEL) |
subjects | Acoustic analysis Acoustic measurements Acoustic noise Acoustics cepstral analysis Estimation instrumental assessment of voice Mel frequency cepstral coefficient Noise Noise measurement Parameter estimation Pathology Perception Quality Resonant frequencies Signal resolution Spectral sensitivity Vocal tract Voice voice disorder Wavelet analysis Wavelet transforms wavelet-based noise estimation |
title | Toward Optimum Quantification of Pathology-Induced Noises: An Investigation of Information Missed by Human Auditory System |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T08%3A23%3A17IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Toward%20Optimum%20Quantification%20of%20Pathology-Induced%20Noises:%20An%20Investigation%20of%20Information%20Missed%20by%20Human%20Auditory%20System&rft.jtitle=IEEE/ACM%20transactions%20on%20audio,%20speech,%20and%20language%20processing&rft.au=Ghasemzadeh,%20Hamzeh&rft.date=2020&rft.volume=28&rft.spage=519&rft.epage=528&rft.pages=519-528&rft.issn=2329-9290&rft.eissn=2329-9304&rft.coden=ITASD8&rft_id=info:doi/10.1109/TASLP.2019.2959222&rft_dat=%3Cproquest_RIE%3E2338695138%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2338695138&rft_id=info:pmid/&rft_ieee_id=8932600&rfr_iscdi=true |