Toward Optimum Quantification of Pathology-Induced Noises: An Investigation of Information Missed by Human Auditory System

Clinical diagnosis of voice disorder and evaluation of therapy outcome heavily rely on accurate quantification of voice quality, which is closely tied to the physiology and function of the laryngeal mechanism. Considering the evaluation methodology of the voice, two main categories of auditory-perce...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE/ACM transactions on audio, speech, and language processing speech, and language processing, 2020, Vol.28, p.519-528
Hauptverfasser:	Ghasemzadeh, Hamzeh, Arjmandi, Meisam K.
Format:	Artikel
Sprache:	eng
Schlagworte:	Acoustic analysis Acoustic measurements Acoustic noise Acoustics cepstral analysis Estimation instrumental assessment of voice Mel frequency cepstral coefficient Noise Noise measurement Parameter estimation Pathology Perception Quality Resonant frequencies Signal resolution Spectral sensitivity Vocal tract Voice voice disorder Wavelet analysis Wavelet transforms wavelet-based noise estimation
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	528
container_issue
container_start_page	519
container_title	IEEE/ACM transactions on audio, speech, and language processing
container_volume	28
creator	Ghasemzadeh, Hamzeh Arjmandi, Meisam K.
description	Clinical diagnosis of voice disorder and evaluation of therapy outcome heavily rely on accurate quantification of voice quality, which is closely tied to the physiology and function of the laryngeal mechanism. Considering the evaluation methodology of the voice, two main categories of auditory-perceptual assessment and acoustic analysis can be identified. This article presents a new approach for acoustic analysis of voice quality, which brings several advantages to the field. The proposed approach is non-parametric in the sense that it does not require the estimation of the fundamental frequency or spectral response of the vocal tract. This reduces the computational complexity of the measurement and reduces the possible errors due to inaccurate estimation of those parameters. Additionally, the method does not make any assumption about the phonetic context and hence has the potential to be applied to connected speech. The proposed method benefits from the multiresolution structure of the wavelet analysis for estimating the noisy component of a voice in the spectro-temporal domain. The informativeness of the estimated noise for voice quality distinction is examined based on different noise-quantification approaches. It is shown that deviation from the model of the human auditory system (HAS) leads to performance improvement. Through several analyses, it is argued that using models of HAS for quantification of the noise leads to significant loss of information relevant to voice quality. Findings from this article suggest that perception-based measures of voice quality are highly restricted in capturing important aspects of acoustic that could assist with voice quality distinctions. This characteristic is inherent to HAS and cannot be alleviated, highlighting a significant limitation of perception-based measures.
doi_str_mv	10.1109/TASLP.2019.2959222
format	Article
fullrecord	<record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TASLP_2019_2959222</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>8932600</ieee_id><sourcerecordid>2338695138</sourcerecordid><originalsourceid>FETCH-LOGICAL-c295t-df18ad0a2a097429ac9f67661fbbdc47e6927862c3405097f13e39a5de90843a3</originalsourceid><addsrcrecordid>eNo9kFtPAjEQhRujiQT5A_rSxOfFXvZW3whR2QQFAz43ZbfFEnaLbVez_nqLiz7NTPKdmTMHgGuMxhgjdreerObLMUGYjQlLGCHkDAwIJSxiFMXnfz1h6BKMnNshhDDKGMviAfhemy9hK7g4eF23NXxtReO10qXw2jTQKLgU_t3szbaLiqZqS1nBF6OddPdw0sCi-ZTO6-0_XTTK2Lofn7VzAd90cNbWooGTttLe2A6uOudlfQUulNg7OTrVIXh7fFhPZ9F88VRMJ_OoDN_4qFI4FxUSRKDgmDBRMpVmaYrVZlOVcSZTRrI8JSWNURIQhamkTCSVZCiPqaBDcNvvPVjz0Qa7fGda24STnFCapyzBNA8U6anSGuesVPxgdS1sxzHix5j5b8z8GDM_xRxEN71ISyn_BTmjJEWI_gA7AXpA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2338695138</pqid></control><display><type>article</type><title>Toward Optimum Quantification of Pathology-Induced Noises: An Investigation of Information Missed by Human Auditory System</title><source>IEEE Electronic Library (IEL)</source><creator>Ghasemzadeh, Hamzeh ; Arjmandi, Meisam K.</creator><creatorcontrib>Ghasemzadeh, Hamzeh ; Arjmandi, Meisam K.</creatorcontrib><description>Clinical diagnosis of voice disorder and evaluation of therapy outcome heavily rely on accurate quantification of voice quality, which is closely tied to the physiology and function of the laryngeal mechanism. Considering the evaluation methodology of the voice, two main categories of auditory-perceptual assessment and acoustic analysis can be identified. This article presents a new approach for acoustic analysis of voice quality, which brings several advantages to the field. The proposed approach is non-parametric in the sense that it does not require the estimation of the fundamental frequency or spectral response of the vocal tract. This reduces the computational complexity of the measurement and reduces the possible errors due to inaccurate estimation of those parameters. Additionally, the method does not make any assumption about the phonetic context and hence has the potential to be applied to connected speech. The proposed method benefits from the multiresolution structure of the wavelet analysis for estimating the noisy component of a voice in the spectro-temporal domain. The informativeness of the estimated noise for voice quality distinction is examined based on different noise-quantification approaches. It is shown that deviation from the model of the human auditory system (HAS) leads to performance improvement. Through several analyses, it is argued that using models of HAS for quantification of the noise leads to significant loss of information relevant to voice quality. Findings from this article suggest that perception-based measures of voice quality are highly restricted in capturing important aspects of acoustic that could assist with voice quality distinctions. This characteristic is inherent to HAS and cannot be alleviated, highlighting a significant limitation of perception-based measures.</description><identifier>ISSN: 2329-9290</identifier><identifier>EISSN: 2329-9304</identifier><identifier>DOI: 10.1109/TASLP.2019.2959222</identifier><identifier>CODEN: ITASD8</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Acoustic analysis ; Acoustic measurements ; Acoustic noise ; Acoustics ; cepstral analysis ; Estimation ; instrumental assessment of voice ; Mel frequency cepstral coefficient ; Noise ; Noise measurement ; Parameter estimation ; Pathology ; Perception ; Quality ; Resonant frequencies ; Signal resolution ; Spectral sensitivity ; Vocal tract ; Voice ; voice disorder ; Wavelet analysis ; Wavelet transforms ; wavelet-based noise estimation</subject><ispartof>IEEE/ACM transactions on audio, speech, and language processing, 2020, Vol.28, p.519-528</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2020</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c295t-df18ad0a2a097429ac9f67661fbbdc47e6927862c3405097f13e39a5de90843a3</citedby><cites>FETCH-LOGICAL-c295t-df18ad0a2a097429ac9f67661fbbdc47e6927862c3405097f13e39a5de90843a3</cites><orcidid>0000-0001-5395-1908 ; 0000-0002-4368-9106</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/8932600$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,4024,27923,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/8932600$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Ghasemzadeh, Hamzeh</creatorcontrib><creatorcontrib>Arjmandi, Meisam K.</creatorcontrib><title>Toward Optimum Quantification of Pathology-Induced Noises: An Investigation of Information Missed by Human Auditory System</title><title>IEEE/ACM transactions on audio, speech, and language processing</title><addtitle>TASLP</addtitle><description>Clinical diagnosis of voice disorder and evaluation of therapy outcome heavily rely on accurate quantification of voice quality, which is closely tied to the physiology and function of the laryngeal mechanism. Considering the evaluation methodology of the voice, two main categories of auditory-perceptual assessment and acoustic analysis can be identified. This article presents a new approach for acoustic analysis of voice quality, which brings several advantages to the field. The proposed approach is non-parametric in the sense that it does not require the estimation of the fundamental frequency or spectral response of the vocal tract. This reduces the computational complexity of the measurement and reduces the possible errors due to inaccurate estimation of those parameters. Additionally, the method does not make any assumption about the phonetic context and hence has the potential to be applied to connected speech. The proposed method benefits from the multiresolution structure of the wavelet analysis for estimating the noisy component of a voice in the spectro-temporal domain. The informativeness of the estimated noise for voice quality distinction is examined based on different noise-quantification approaches. It is shown that deviation from the model of the human auditory system (HAS) leads to performance improvement. Through several analyses, it is argued that using models of HAS for quantification of the noise leads to significant loss of information relevant to voice quality. Findings from this article suggest that perception-based measures of voice quality are highly restricted in capturing important aspects of acoustic that could assist with voice quality distinctions. This characteristic is inherent to HAS and cannot be alleviated, highlighting a significant limitation of perception-based measures.</description><subject>Acoustic analysis</subject><subject>Acoustic measurements</subject><subject>Acoustic noise</subject><subject>Acoustics</subject><subject>cepstral analysis</subject><subject>Estimation</subject><subject>instrumental assessment of voice</subject><subject>Mel frequency cepstral coefficient</subject><subject>Noise</subject><subject>Noise measurement</subject><subject>Parameter estimation</subject><subject>Pathology</subject><subject>Perception</subject><subject>Quality</subject><subject>Resonant frequencies</subject><subject>Signal resolution</subject><subject>Spectral sensitivity</subject><subject>Vocal tract</subject><subject>Voice</subject><subject>voice disorder</subject><subject>Wavelet analysis</subject><subject>Wavelet transforms</subject><subject>wavelet-based noise estimation</subject><issn>2329-9290</issn><issn>2329-9304</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kFtPAjEQhRujiQT5A_rSxOfFXvZW3whR2QQFAz43ZbfFEnaLbVez_nqLiz7NTPKdmTMHgGuMxhgjdreerObLMUGYjQlLGCHkDAwIJSxiFMXnfz1h6BKMnNshhDDKGMviAfhemy9hK7g4eF23NXxtReO10qXw2jTQKLgU_t3szbaLiqZqS1nBF6OddPdw0sCi-ZTO6-0_XTTK2Lofn7VzAd90cNbWooGTttLe2A6uOudlfQUulNg7OTrVIXh7fFhPZ9F88VRMJ_OoDN_4qFI4FxUSRKDgmDBRMpVmaYrVZlOVcSZTRrI8JSWNURIQhamkTCSVZCiPqaBDcNvvPVjz0Qa7fGda24STnFCapyzBNA8U6anSGuesVPxgdS1sxzHix5j5b8z8GDM_xRxEN71ISyn_BTmjJEWI_gA7AXpA</recordid><startdate>2020</startdate><enddate>2020</enddate><creator>Ghasemzadeh, Hamzeh</creator><creator>Arjmandi, Meisam K.</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0001-5395-1908</orcidid><orcidid>https://orcid.org/0000-0002-4368-9106</orcidid></search><sort><creationdate>2020</creationdate><title>Toward Optimum Quantification of Pathology-Induced Noises: An Investigation of Information Missed by Human Auditory System</title><author>Ghasemzadeh, Hamzeh ; Arjmandi, Meisam K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c295t-df18ad0a2a097429ac9f67661fbbdc47e6927862c3405097f13e39a5de90843a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Acoustic analysis</topic><topic>Acoustic measurements</topic><topic>Acoustic noise</topic><topic>Acoustics</topic><topic>cepstral analysis</topic><topic>Estimation</topic><topic>instrumental assessment of voice</topic><topic>Mel frequency cepstral coefficient</topic><topic>Noise</topic><topic>Noise measurement</topic><topic>Parameter estimation</topic><topic>Pathology</topic><topic>Perception</topic><topic>Quality</topic><topic>Resonant frequencies</topic><topic>Signal resolution</topic><topic>Spectral sensitivity</topic><topic>Vocal tract</topic><topic>Voice</topic><topic>voice disorder</topic><topic>Wavelet analysis</topic><topic>Wavelet transforms</topic><topic>wavelet-based noise estimation</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ghasemzadeh, Hamzeh</creatorcontrib><creatorcontrib>Arjmandi, Meisam K.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE/ACM transactions on audio, speech, and language processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Ghasemzadeh, Hamzeh</au><au>Arjmandi, Meisam K.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Toward Optimum Quantification of Pathology-Induced Noises: An Investigation of Information Missed by Human Auditory System</atitle><jtitle>IEEE/ACM transactions on audio, speech, and language processing</jtitle><stitle>TASLP</stitle><date>2020</date><risdate>2020</risdate><volume>28</volume><spage>519</spage><epage>528</epage><pages>519-528</pages><issn>2329-9290</issn><eissn>2329-9304</eissn><coden>ITASD8</coden><abstract>Clinical diagnosis of voice disorder and evaluation of therapy outcome heavily rely on accurate quantification of voice quality, which is closely tied to the physiology and function of the laryngeal mechanism. Considering the evaluation methodology of the voice, two main categories of auditory-perceptual assessment and acoustic analysis can be identified. This article presents a new approach for acoustic analysis of voice quality, which brings several advantages to the field. The proposed approach is non-parametric in the sense that it does not require the estimation of the fundamental frequency or spectral response of the vocal tract. This reduces the computational complexity of the measurement and reduces the possible errors due to inaccurate estimation of those parameters. Additionally, the method does not make any assumption about the phonetic context and hence has the potential to be applied to connected speech. The proposed method benefits from the multiresolution structure of the wavelet analysis for estimating the noisy component of a voice in the spectro-temporal domain. The informativeness of the estimated noise for voice quality distinction is examined based on different noise-quantification approaches. It is shown that deviation from the model of the human auditory system (HAS) leads to performance improvement. Through several analyses, it is argued that using models of HAS for quantification of the noise leads to significant loss of information relevant to voice quality. Findings from this article suggest that perception-based measures of voice quality are highly restricted in capturing important aspects of acoustic that could assist with voice quality distinctions. This characteristic is inherent to HAS and cannot be alleviated, highlighting a significant limitation of perception-based measures.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/TASLP.2019.2959222</doi><tpages>10</tpages><orcidid>https://orcid.org/0000-0001-5395-1908</orcidid><orcidid>https://orcid.org/0000-0002-4368-9106</orcidid></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 2329-9290
ispartof	IEEE/ACM transactions on audio, speech, and language processing, 2020, Vol.28, p.519-528
issn	2329-9290 2329-9304
language	eng
recordid	cdi_crossref_primary_10_1109_TASLP_2019_2959222
source	IEEE Electronic Library (IEL)
subjects	Acoustic analysis Acoustic measurements Acoustic noise Acoustics cepstral analysis Estimation instrumental assessment of voice Mel frequency cepstral coefficient Noise Noise measurement Parameter estimation Pathology Perception Quality Resonant frequencies Signal resolution Spectral sensitivity Vocal tract Voice voice disorder Wavelet analysis Wavelet transforms wavelet-based noise estimation
title	Toward Optimum Quantification of Pathology-Induced Noises: An Investigation of Information Missed by Human Auditory System
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T08%3A23%3A17IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Toward%20Optimum%20Quantification%20of%20Pathology-Induced%20Noises:%20An%20Investigation%20of%20Information%20Missed%20by%20Human%20Auditory%20System&rft.jtitle=IEEE/ACM%20transactions%20on%20audio,%20speech,%20and%20language%20processing&rft.au=Ghasemzadeh,%20Hamzeh&rft.date=2020&rft.volume=28&rft.spage=519&rft.epage=528&rft.pages=519-528&rft.issn=2329-9290&rft.eissn=2329-9304&rft.coden=ITASD8&rft_id=info:doi/10.1109/TASLP.2019.2959222&rft_dat=%3Cproquest_RIE%3E2338695138%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2338695138&rft_id=info:pmid/&rft_ieee_id=8932600&rfr_iscdi=true