Toward Optimum Quantification of Pathology-Induced Noises: An Investigation of Information Missed by Human Auditory System

Clinical diagnosis of voice disorder and evaluation of therapy outcome heavily rely on accurate quantification of voice quality, which is closely tied to the physiology and function of the laryngeal mechanism. Considering the evaluation methodology of the voice, two main categories of auditory-perce...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE/ACM transactions on audio, speech, and language processing speech, and language processing, 2020, Vol.28, p.519-528
Hauptverfasser: Ghasemzadeh, Hamzeh, Arjmandi, Meisam K.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 528
container_issue
container_start_page 519
container_title IEEE/ACM transactions on audio, speech, and language processing
container_volume 28
creator Ghasemzadeh, Hamzeh
Arjmandi, Meisam K.
description Clinical diagnosis of voice disorder and evaluation of therapy outcome heavily rely on accurate quantification of voice quality, which is closely tied to the physiology and function of the laryngeal mechanism. Considering the evaluation methodology of the voice, two main categories of auditory-perceptual assessment and acoustic analysis can be identified. This article presents a new approach for acoustic analysis of voice quality, which brings several advantages to the field. The proposed approach is non-parametric in the sense that it does not require the estimation of the fundamental frequency or spectral response of the vocal tract. This reduces the computational complexity of the measurement and reduces the possible errors due to inaccurate estimation of those parameters. Additionally, the method does not make any assumption about the phonetic context and hence has the potential to be applied to connected speech. The proposed method benefits from the multiresolution structure of the wavelet analysis for estimating the noisy component of a voice in the spectro-temporal domain. The informativeness of the estimated noise for voice quality distinction is examined based on different noise-quantification approaches. It is shown that deviation from the model of the human auditory system (HAS) leads to performance improvement. Through several analyses, it is argued that using models of HAS for quantification of the noise leads to significant loss of information relevant to voice quality. Findings from this article suggest that perception-based measures of voice quality are highly restricted in capturing important aspects of acoustic that could assist with voice quality distinctions. This characteristic is inherent to HAS and cannot be alleviated, highlighting a significant limitation of perception-based measures.
doi_str_mv 10.1109/TASLP.2019.2959222
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TASLP_2019_2959222</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>8932600</ieee_id><sourcerecordid>2338695138</sourcerecordid><originalsourceid>FETCH-LOGICAL-c295t-df18ad0a2a097429ac9f67661fbbdc47e6927862c3405097f13e39a5de90843a3</originalsourceid><addsrcrecordid>eNo9kFtPAjEQhRujiQT5A_rSxOfFXvZW3whR2QQFAz43ZbfFEnaLbVez_nqLiz7NTPKdmTMHgGuMxhgjdreerObLMUGYjQlLGCHkDAwIJSxiFMXnfz1h6BKMnNshhDDKGMviAfhemy9hK7g4eF23NXxtReO10qXw2jTQKLgU_t3szbaLiqZqS1nBF6OddPdw0sCi-ZTO6-0_XTTK2Lofn7VzAd90cNbWooGTttLe2A6uOudlfQUulNg7OTrVIXh7fFhPZ9F88VRMJ_OoDN_4qFI4FxUSRKDgmDBRMpVmaYrVZlOVcSZTRrI8JSWNURIQhamkTCSVZCiPqaBDcNvvPVjz0Qa7fGda24STnFCapyzBNA8U6anSGuesVPxgdS1sxzHix5j5b8z8GDM_xRxEN71ISyn_BTmjJEWI_gA7AXpA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2338695138</pqid></control><display><type>article</type><title>Toward Optimum Quantification of Pathology-Induced Noises: An Investigation of Information Missed by Human Auditory System</title><source>IEEE Electronic Library (IEL)</source><creator>Ghasemzadeh, Hamzeh ; Arjmandi, Meisam K.</creator><creatorcontrib>Ghasemzadeh, Hamzeh ; Arjmandi, Meisam K.</creatorcontrib><description>Clinical diagnosis of voice disorder and evaluation of therapy outcome heavily rely on accurate quantification of voice quality, which is closely tied to the physiology and function of the laryngeal mechanism. Considering the evaluation methodology of the voice, two main categories of auditory-perceptual assessment and acoustic analysis can be identified. This article presents a new approach for acoustic analysis of voice quality, which brings several advantages to the field. The proposed approach is non-parametric in the sense that it does not require the estimation of the fundamental frequency or spectral response of the vocal tract. This reduces the computational complexity of the measurement and reduces the possible errors due to inaccurate estimation of those parameters. Additionally, the method does not make any assumption about the phonetic context and hence has the potential to be applied to connected speech. The proposed method benefits from the multiresolution structure of the wavelet analysis for estimating the noisy component of a voice in the spectro-temporal domain. The informativeness of the estimated noise for voice quality distinction is examined based on different noise-quantification approaches. It is shown that deviation from the model of the human auditory system (HAS) leads to performance improvement. Through several analyses, it is argued that using models of HAS for quantification of the noise leads to significant loss of information relevant to voice quality. Findings from this article suggest that perception-based measures of voice quality are highly restricted in capturing important aspects of acoustic that could assist with voice quality distinctions. This characteristic is inherent to HAS and cannot be alleviated, highlighting a significant limitation of perception-based measures.</description><identifier>ISSN: 2329-9290</identifier><identifier>EISSN: 2329-9304</identifier><identifier>DOI: 10.1109/TASLP.2019.2959222</identifier><identifier>CODEN: ITASD8</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Acoustic analysis ; Acoustic measurements ; Acoustic noise ; Acoustics ; cepstral analysis ; Estimation ; instrumental assessment of voice ; Mel frequency cepstral coefficient ; Noise ; Noise measurement ; Parameter estimation ; Pathology ; Perception ; Quality ; Resonant frequencies ; Signal resolution ; Spectral sensitivity ; Vocal tract ; Voice ; voice disorder ; Wavelet analysis ; Wavelet transforms ; wavelet-based noise estimation</subject><ispartof>IEEE/ACM transactions on audio, speech, and language processing, 2020, Vol.28, p.519-528</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2020</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c295t-df18ad0a2a097429ac9f67661fbbdc47e6927862c3405097f13e39a5de90843a3</citedby><cites>FETCH-LOGICAL-c295t-df18ad0a2a097429ac9f67661fbbdc47e6927862c3405097f13e39a5de90843a3</cites><orcidid>0000-0001-5395-1908 ; 0000-0002-4368-9106</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/8932600$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,4024,27923,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/8932600$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Ghasemzadeh, Hamzeh</creatorcontrib><creatorcontrib>Arjmandi, Meisam K.</creatorcontrib><title>Toward Optimum Quantification of Pathology-Induced Noises: An Investigation of Information Missed by Human Auditory System</title><title>IEEE/ACM transactions on audio, speech, and language processing</title><addtitle>TASLP</addtitle><description>Clinical diagnosis of voice disorder and evaluation of therapy outcome heavily rely on accurate quantification of voice quality, which is closely tied to the physiology and function of the laryngeal mechanism. Considering the evaluation methodology of the voice, two main categories of auditory-perceptual assessment and acoustic analysis can be identified. This article presents a new approach for acoustic analysis of voice quality, which brings several advantages to the field. The proposed approach is non-parametric in the sense that it does not require the estimation of the fundamental frequency or spectral response of the vocal tract. This reduces the computational complexity of the measurement and reduces the possible errors due to inaccurate estimation of those parameters. Additionally, the method does not make any assumption about the phonetic context and hence has the potential to be applied to connected speech. The proposed method benefits from the multiresolution structure of the wavelet analysis for estimating the noisy component of a voice in the spectro-temporal domain. The informativeness of the estimated noise for voice quality distinction is examined based on different noise-quantification approaches. It is shown that deviation from the model of the human auditory system (HAS) leads to performance improvement. Through several analyses, it is argued that using models of HAS for quantification of the noise leads to significant loss of information relevant to voice quality. Findings from this article suggest that perception-based measures of voice quality are highly restricted in capturing important aspects of acoustic that could assist with voice quality distinctions. This characteristic is inherent to HAS and cannot be alleviated, highlighting a significant limitation of perception-based measures.</description><subject>Acoustic analysis</subject><subject>Acoustic measurements</subject><subject>Acoustic noise</subject><subject>Acoustics</subject><subject>cepstral analysis</subject><subject>Estimation</subject><subject>instrumental assessment of voice</subject><subject>Mel frequency cepstral coefficient</subject><subject>Noise</subject><subject>Noise measurement</subject><subject>Parameter estimation</subject><subject>Pathology</subject><subject>Perception</subject><subject>Quality</subject><subject>Resonant frequencies</subject><subject>Signal resolution</subject><subject>Spectral sensitivity</subject><subject>Vocal tract</subject><subject>Voice</subject><subject>voice disorder</subject><subject>Wavelet analysis</subject><subject>Wavelet transforms</subject><subject>wavelet-based noise estimation</subject><issn>2329-9290</issn><issn>2329-9304</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kFtPAjEQhRujiQT5A_rSxOfFXvZW3whR2QQFAz43ZbfFEnaLbVez_nqLiz7NTPKdmTMHgGuMxhgjdreerObLMUGYjQlLGCHkDAwIJSxiFMXnfz1h6BKMnNshhDDKGMviAfhemy9hK7g4eF23NXxtReO10qXw2jTQKLgU_t3szbaLiqZqS1nBF6OddPdw0sCi-ZTO6-0_XTTK2Lofn7VzAd90cNbWooGTttLe2A6uOudlfQUulNg7OTrVIXh7fFhPZ9F88VRMJ_OoDN_4qFI4FxUSRKDgmDBRMpVmaYrVZlOVcSZTRrI8JSWNURIQhamkTCSVZCiPqaBDcNvvPVjz0Qa7fGda24STnFCapyzBNA8U6anSGuesVPxgdS1sxzHix5j5b8z8GDM_xRxEN71ISyn_BTmjJEWI_gA7AXpA</recordid><startdate>2020</startdate><enddate>2020</enddate><creator>Ghasemzadeh, Hamzeh</creator><creator>Arjmandi, Meisam K.</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0001-5395-1908</orcidid><orcidid>https://orcid.org/0000-0002-4368-9106</orcidid></search><sort><creationdate>2020</creationdate><title>Toward Optimum Quantification of Pathology-Induced Noises: An Investigation of Information Missed by Human Auditory System</title><author>Ghasemzadeh, Hamzeh ; Arjmandi, Meisam K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c295t-df18ad0a2a097429ac9f67661fbbdc47e6927862c3405097f13e39a5de90843a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Acoustic analysis</topic><topic>Acoustic measurements</topic><topic>Acoustic noise</topic><topic>Acoustics</topic><topic>cepstral analysis</topic><topic>Estimation</topic><topic>instrumental assessment of voice</topic><topic>Mel frequency cepstral coefficient</topic><topic>Noise</topic><topic>Noise measurement</topic><topic>Parameter estimation</topic><topic>Pathology</topic><topic>Perception</topic><topic>Quality</topic><topic>Resonant frequencies</topic><topic>Signal resolution</topic><topic>Spectral sensitivity</topic><topic>Vocal tract</topic><topic>Voice</topic><topic>voice disorder</topic><topic>Wavelet analysis</topic><topic>Wavelet transforms</topic><topic>wavelet-based noise estimation</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ghasemzadeh, Hamzeh</creatorcontrib><creatorcontrib>Arjmandi, Meisam K.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE/ACM transactions on audio, speech, and language processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Ghasemzadeh, Hamzeh</au><au>Arjmandi, Meisam K.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Toward Optimum Quantification of Pathology-Induced Noises: An Investigation of Information Missed by Human Auditory System</atitle><jtitle>IEEE/ACM transactions on audio, speech, and language processing</jtitle><stitle>TASLP</stitle><date>2020</date><risdate>2020</risdate><volume>28</volume><spage>519</spage><epage>528</epage><pages>519-528</pages><issn>2329-9290</issn><eissn>2329-9304</eissn><coden>ITASD8</coden><abstract>Clinical diagnosis of voice disorder and evaluation of therapy outcome heavily rely on accurate quantification of voice quality, which is closely tied to the physiology and function of the laryngeal mechanism. Considering the evaluation methodology of the voice, two main categories of auditory-perceptual assessment and acoustic analysis can be identified. This article presents a new approach for acoustic analysis of voice quality, which brings several advantages to the field. The proposed approach is non-parametric in the sense that it does not require the estimation of the fundamental frequency or spectral response of the vocal tract. This reduces the computational complexity of the measurement and reduces the possible errors due to inaccurate estimation of those parameters. Additionally, the method does not make any assumption about the phonetic context and hence has the potential to be applied to connected speech. The proposed method benefits from the multiresolution structure of the wavelet analysis for estimating the noisy component of a voice in the spectro-temporal domain. The informativeness of the estimated noise for voice quality distinction is examined based on different noise-quantification approaches. It is shown that deviation from the model of the human auditory system (HAS) leads to performance improvement. Through several analyses, it is argued that using models of HAS for quantification of the noise leads to significant loss of information relevant to voice quality. Findings from this article suggest that perception-based measures of voice quality are highly restricted in capturing important aspects of acoustic that could assist with voice quality distinctions. This characteristic is inherent to HAS and cannot be alleviated, highlighting a significant limitation of perception-based measures.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/TASLP.2019.2959222</doi><tpages>10</tpages><orcidid>https://orcid.org/0000-0001-5395-1908</orcidid><orcidid>https://orcid.org/0000-0002-4368-9106</orcidid></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 2329-9290
ispartof IEEE/ACM transactions on audio, speech, and language processing, 2020, Vol.28, p.519-528
issn 2329-9290
2329-9304
language eng
recordid cdi_crossref_primary_10_1109_TASLP_2019_2959222
source IEEE Electronic Library (IEL)
subjects Acoustic analysis
Acoustic measurements
Acoustic noise
Acoustics
cepstral analysis
Estimation
instrumental assessment of voice
Mel frequency cepstral coefficient
Noise
Noise measurement
Parameter estimation
Pathology
Perception
Quality
Resonant frequencies
Signal resolution
Spectral sensitivity
Vocal tract
Voice
voice disorder
Wavelet analysis
Wavelet transforms
wavelet-based noise estimation
title Toward Optimum Quantification of Pathology-Induced Noises: An Investigation of Information Missed by Human Auditory System
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T08%3A23%3A17IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Toward%20Optimum%20Quantification%20of%20Pathology-Induced%20Noises:%20An%20Investigation%20of%20Information%20Missed%20by%20Human%20Auditory%20System&rft.jtitle=IEEE/ACM%20transactions%20on%20audio,%20speech,%20and%20language%20processing&rft.au=Ghasemzadeh,%20Hamzeh&rft.date=2020&rft.volume=28&rft.spage=519&rft.epage=528&rft.pages=519-528&rft.issn=2329-9290&rft.eissn=2329-9304&rft.coden=ITASD8&rft_id=info:doi/10.1109/TASLP.2019.2959222&rft_dat=%3Cproquest_RIE%3E2338695138%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2338695138&rft_id=info:pmid/&rft_ieee_id=8932600&rfr_iscdi=true