An Objective Parameter to Classify Voice Signals Based on Variation in Energy Distribution

The purpose of this paper is to introduce an iterative nonlinear weighted method based on the variation in spectral energy distribution present in a voice signal to differentiate between four voice types: type 1 voice signals are nearly periodic, type 2 voice signals have strong modulations and subh...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of voice 2019-09, Vol.33 (5), p.591-602
Hauptverfasser: Liu, Boquan, Polce, Evan, Jiang, Jack
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 602
container_issue 5
container_start_page 591
container_title Journal of voice
container_volume 33
creator Liu, Boquan
Polce, Evan
Jiang, Jack
description The purpose of this paper is to introduce an iterative nonlinear weighted method based on the variation in spectral energy distribution present in a voice signal to differentiate between four voice types: type 1 voice signals are nearly periodic, type 2 voice signals have strong modulations and subharmonics, type 3 signals are chaotic, and type 4 signals are dominated by stochastic noise. A total of 135 voice signal samples of the sustained vowel /a/ were obtained from the Disordered Voice Database and then individually categorized into the appropriate voice types based on the classification system described in Sprecher et al (2010). Voice samples were analyzed using the nonlinear methods of spectrum convergence ratio, rate of divergence, and nonlinear energy difference ratio (NEDR) to investigate classifier efficacy. An iterative nonlinear weighted method based on the derivative of instantaneous frequency and Fourier transformations is applied to calculate spectral energy distributions. The distribution is then used to calculate the NEDR to classify voice signal types. Statistical analysis revealed that NEDR effectively differentiated between all four voice types (P 
doi_str_mv 10.1016/j.jvoice.2018.02.011
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2042749091</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0892199717305702</els_id><sourcerecordid>2042749091</sourcerecordid><originalsourceid>FETCH-LOGICAL-c362t-5852e849891c25f0c61dc95dd9fcdcdf51bd3066fa03a8f277cff9345f4338c73</originalsourceid><addsrcrecordid>eNp9kMtLAzEQxoMoWh__gUiOXnbNYx_JRai1PkCooPbgJaTJpGRpd2uyLfS_N6Xq0dMMM983H_ND6JKSnBJa3TR5s-m8gZwRKnLCckLpARpQUfOsKIU4RAMiJMuolPUJOo2xIYSwtD1GJ0zWopS8GqDPYYsnswZM7zeAX3XQS-gh4L7Do4WO0bstnu5i8Juft3oR8Z2OYHHX4qkOXvc-db7F4xbCfIvvfeyDn61343N05JIBLn7qGfp4GL-PnrKXyePzaPiSGV6xPitFyUAUUkhqWOmIqag1srRWOmONdSWdWU6qymnCtXCsro1zkhelKzgXpuZn6Hp_dxW6rzXEXi19NLBY6Ba6dVSMFKwuJJE0SYu91IQuxgBOrYJf6rBVlKgdVdWoPVW1o6oIU4lqsl39JKxnS7B_pl-MSXC7F0D6c-MhqGg8tAasDwmtsp3_P-Eb9MSKuA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2042749091</pqid></control><display><type>article</type><title>An Objective Parameter to Classify Voice Signals Based on Variation in Energy Distribution</title><source>MEDLINE</source><source>Elsevier ScienceDirect Journals</source><creator>Liu, Boquan ; Polce, Evan ; Jiang, Jack</creator><creatorcontrib>Liu, Boquan ; Polce, Evan ; Jiang, Jack</creatorcontrib><description>The purpose of this paper is to introduce an iterative nonlinear weighted method based on the variation in spectral energy distribution present in a voice signal to differentiate between four voice types: type 1 voice signals are nearly periodic, type 2 voice signals have strong modulations and subharmonics, type 3 signals are chaotic, and type 4 signals are dominated by stochastic noise. A total of 135 voice signal samples of the sustained vowel /a/ were obtained from the Disordered Voice Database and then individually categorized into the appropriate voice types based on the classification system described in Sprecher et al (2010). Voice samples were analyzed using the nonlinear methods of spectrum convergence ratio, rate of divergence, and nonlinear energy difference ratio (NEDR) to investigate classifier efficacy. An iterative nonlinear weighted method based on the derivative of instantaneous frequency and Fourier transformations is applied to calculate spectral energy distributions. The distribution is then used to calculate the NEDR to classify voice signal types. Statistical analysis revealed that NEDR effectively differentiated between all four voice types (P &lt; 0.001). Subsequent multiclass receiver operating characteristic analysis demonstrated that NEDR (area under the curve [95% CI] = 0.99 [0.96–1.0]) possessed the greatest classification accuracy relative to spectrum convergence ratio and rate of divergence. NEDR was shown to be an effective metric for objective differentiation between all four voice signal types. NEDR calculations occurred approximately instantaneously, constituting a substantial improvement over the tedious computational time required for calculation of previous nonlinear parameters. This metric could assist clinicians in the diagnosis of voice disorders and monitor the efficacy of treatment through observation of voice acoustical improvement over time.</description><identifier>ISSN: 0892-1997</identifier><identifier>EISSN: 1873-4588</identifier><identifier>DOI: 10.1016/j.jvoice.2018.02.011</identifier><identifier>PMID: 29785936</identifier><language>eng</language><publisher>United States: Elsevier Inc</publisher><subject>Acoustics ; Adolescent ; Adult ; Aged ; Aged, 80 and over ; Chaos ; Databases, Factual ; Derivative of instantaneous frequency ; Female ; Humans ; Male ; Middle Aged ; Nonlinear Dynamics ; Nonlinear energy difference ratio ; Nonlinear weighted ; Phonation ; Signal Processing, Computer-Assisted ; Sound Spectrography ; Speech Acoustics ; Speech Production Measurement ; Voice Quality ; Voice signal classification ; Young Adult</subject><ispartof>Journal of voice, 2019-09, Vol.33 (5), p.591-602</ispartof><rights>2018 The Voice Foundation</rights><rights>Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c362t-5852e849891c25f0c61dc95dd9fcdcdf51bd3066fa03a8f277cff9345f4338c73</citedby><cites>FETCH-LOGICAL-c362t-5852e849891c25f0c61dc95dd9fcdcdf51bd3066fa03a8f277cff9345f4338c73</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0892199717305702$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65534</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/29785936$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Liu, Boquan</creatorcontrib><creatorcontrib>Polce, Evan</creatorcontrib><creatorcontrib>Jiang, Jack</creatorcontrib><title>An Objective Parameter to Classify Voice Signals Based on Variation in Energy Distribution</title><title>Journal of voice</title><addtitle>J Voice</addtitle><description>The purpose of this paper is to introduce an iterative nonlinear weighted method based on the variation in spectral energy distribution present in a voice signal to differentiate between four voice types: type 1 voice signals are nearly periodic, type 2 voice signals have strong modulations and subharmonics, type 3 signals are chaotic, and type 4 signals are dominated by stochastic noise. A total of 135 voice signal samples of the sustained vowel /a/ were obtained from the Disordered Voice Database and then individually categorized into the appropriate voice types based on the classification system described in Sprecher et al (2010). Voice samples were analyzed using the nonlinear methods of spectrum convergence ratio, rate of divergence, and nonlinear energy difference ratio (NEDR) to investigate classifier efficacy. An iterative nonlinear weighted method based on the derivative of instantaneous frequency and Fourier transformations is applied to calculate spectral energy distributions. The distribution is then used to calculate the NEDR to classify voice signal types. Statistical analysis revealed that NEDR effectively differentiated between all four voice types (P &lt; 0.001). Subsequent multiclass receiver operating characteristic analysis demonstrated that NEDR (area under the curve [95% CI] = 0.99 [0.96–1.0]) possessed the greatest classification accuracy relative to spectrum convergence ratio and rate of divergence. NEDR was shown to be an effective metric for objective differentiation between all four voice signal types. NEDR calculations occurred approximately instantaneously, constituting a substantial improvement over the tedious computational time required for calculation of previous nonlinear parameters. This metric could assist clinicians in the diagnosis of voice disorders and monitor the efficacy of treatment through observation of voice acoustical improvement over time.</description><subject>Acoustics</subject><subject>Adolescent</subject><subject>Adult</subject><subject>Aged</subject><subject>Aged, 80 and over</subject><subject>Chaos</subject><subject>Databases, Factual</subject><subject>Derivative of instantaneous frequency</subject><subject>Female</subject><subject>Humans</subject><subject>Male</subject><subject>Middle Aged</subject><subject>Nonlinear Dynamics</subject><subject>Nonlinear energy difference ratio</subject><subject>Nonlinear weighted</subject><subject>Phonation</subject><subject>Signal Processing, Computer-Assisted</subject><subject>Sound Spectrography</subject><subject>Speech Acoustics</subject><subject>Speech Production Measurement</subject><subject>Voice Quality</subject><subject>Voice signal classification</subject><subject>Young Adult</subject><issn>0892-1997</issn><issn>1873-4588</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp9kMtLAzEQxoMoWh__gUiOXnbNYx_JRai1PkCooPbgJaTJpGRpd2uyLfS_N6Xq0dMMM983H_ND6JKSnBJa3TR5s-m8gZwRKnLCckLpARpQUfOsKIU4RAMiJMuolPUJOo2xIYSwtD1GJ0zWopS8GqDPYYsnswZM7zeAX3XQS-gh4L7Do4WO0bstnu5i8Juft3oR8Z2OYHHX4qkOXvc-db7F4xbCfIvvfeyDn61343N05JIBLn7qGfp4GL-PnrKXyePzaPiSGV6xPitFyUAUUkhqWOmIqag1srRWOmONdSWdWU6qymnCtXCsro1zkhelKzgXpuZn6Hp_dxW6rzXEXi19NLBY6Ba6dVSMFKwuJJE0SYu91IQuxgBOrYJf6rBVlKgdVdWoPVW1o6oIU4lqsl39JKxnS7B_pl-MSXC7F0D6c-MhqGg8tAasDwmtsp3_P-Eb9MSKuA</recordid><startdate>201909</startdate><enddate>201909</enddate><creator>Liu, Boquan</creator><creator>Polce, Evan</creator><creator>Jiang, Jack</creator><general>Elsevier Inc</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>201909</creationdate><title>An Objective Parameter to Classify Voice Signals Based on Variation in Energy Distribution</title><author>Liu, Boquan ; Polce, Evan ; Jiang, Jack</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c362t-5852e849891c25f0c61dc95dd9fcdcdf51bd3066fa03a8f277cff9345f4338c73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Acoustics</topic><topic>Adolescent</topic><topic>Adult</topic><topic>Aged</topic><topic>Aged, 80 and over</topic><topic>Chaos</topic><topic>Databases, Factual</topic><topic>Derivative of instantaneous frequency</topic><topic>Female</topic><topic>Humans</topic><topic>Male</topic><topic>Middle Aged</topic><topic>Nonlinear Dynamics</topic><topic>Nonlinear energy difference ratio</topic><topic>Nonlinear weighted</topic><topic>Phonation</topic><topic>Signal Processing, Computer-Assisted</topic><topic>Sound Spectrography</topic><topic>Speech Acoustics</topic><topic>Speech Production Measurement</topic><topic>Voice Quality</topic><topic>Voice signal classification</topic><topic>Young Adult</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Liu, Boquan</creatorcontrib><creatorcontrib>Polce, Evan</creatorcontrib><creatorcontrib>Jiang, Jack</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of voice</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Liu, Boquan</au><au>Polce, Evan</au><au>Jiang, Jack</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An Objective Parameter to Classify Voice Signals Based on Variation in Energy Distribution</atitle><jtitle>Journal of voice</jtitle><addtitle>J Voice</addtitle><date>2019-09</date><risdate>2019</risdate><volume>33</volume><issue>5</issue><spage>591</spage><epage>602</epage><pages>591-602</pages><issn>0892-1997</issn><eissn>1873-4588</eissn><abstract>The purpose of this paper is to introduce an iterative nonlinear weighted method based on the variation in spectral energy distribution present in a voice signal to differentiate between four voice types: type 1 voice signals are nearly periodic, type 2 voice signals have strong modulations and subharmonics, type 3 signals are chaotic, and type 4 signals are dominated by stochastic noise. A total of 135 voice signal samples of the sustained vowel /a/ were obtained from the Disordered Voice Database and then individually categorized into the appropriate voice types based on the classification system described in Sprecher et al (2010). Voice samples were analyzed using the nonlinear methods of spectrum convergence ratio, rate of divergence, and nonlinear energy difference ratio (NEDR) to investigate classifier efficacy. An iterative nonlinear weighted method based on the derivative of instantaneous frequency and Fourier transformations is applied to calculate spectral energy distributions. The distribution is then used to calculate the NEDR to classify voice signal types. Statistical analysis revealed that NEDR effectively differentiated between all four voice types (P &lt; 0.001). Subsequent multiclass receiver operating characteristic analysis demonstrated that NEDR (area under the curve [95% CI] = 0.99 [0.96–1.0]) possessed the greatest classification accuracy relative to spectrum convergence ratio and rate of divergence. NEDR was shown to be an effective metric for objective differentiation between all four voice signal types. NEDR calculations occurred approximately instantaneously, constituting a substantial improvement over the tedious computational time required for calculation of previous nonlinear parameters. This metric could assist clinicians in the diagnosis of voice disorders and monitor the efficacy of treatment through observation of voice acoustical improvement over time.</abstract><cop>United States</cop><pub>Elsevier Inc</pub><pmid>29785936</pmid><doi>10.1016/j.jvoice.2018.02.011</doi><tpages>12</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0892-1997
ispartof Journal of voice, 2019-09, Vol.33 (5), p.591-602
issn 0892-1997
1873-4588
language eng
recordid cdi_proquest_miscellaneous_2042749091
source MEDLINE; Elsevier ScienceDirect Journals
subjects Acoustics
Adolescent
Adult
Aged
Aged, 80 and over
Chaos
Databases, Factual
Derivative of instantaneous frequency
Female
Humans
Male
Middle Aged
Nonlinear Dynamics
Nonlinear energy difference ratio
Nonlinear weighted
Phonation
Signal Processing, Computer-Assisted
Sound Spectrography
Speech Acoustics
Speech Production Measurement
Voice Quality
Voice signal classification
Young Adult
title An Objective Parameter to Classify Voice Signals Based on Variation in Energy Distribution
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-16T11%3A42%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20Objective%20Parameter%20to%20Classify%20Voice%20Signals%20Based%20on%20Variation%20in%20Energy%20Distribution&rft.jtitle=Journal%20of%20voice&rft.au=Liu,%20Boquan&rft.date=2019-09&rft.volume=33&rft.issue=5&rft.spage=591&rft.epage=602&rft.pages=591-602&rft.issn=0892-1997&rft.eissn=1873-4588&rft_id=info:doi/10.1016/j.jvoice.2018.02.011&rft_dat=%3Cproquest_cross%3E2042749091%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2042749091&rft_id=info:pmid/29785936&rft_els_id=S0892199717305702&rfr_iscdi=true