Intertext Variability of Smoothed Cepstral Peak Prominence, Methods to Control It, and Its Diagnostic Properties

This study aimed to estimate the intertext variability of smoothed cepstral peak prominence (CPPS), examine whether sound-processing techniques improved its variability and diagnostic capability, and evaluate the degree of intertext variability in detail with reference to the CPPS variabilities in s...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of voice 2020-05, Vol.34 (3), p.305-319
Hauptverfasser: Kitayama, Itsuki, Hosokawa, Kiyohito, Iwahashi, Toshihiko, Iwahashi, Mio, Iwaki, Shinobu, Kato, Chieri, Yoshida, Misao, Umatani, Masanori, Matsushiro, Naoki, Ogawa, Makoto, Inohara, Hidenori
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 319
container_issue 3
container_start_page 305
container_title Journal of voice
container_volume 34
creator Kitayama, Itsuki
Hosokawa, Kiyohito
Iwahashi, Toshihiko
Iwahashi, Mio
Iwaki, Shinobu
Kato, Chieri
Yoshida, Misao
Umatani, Masanori
Matsushiro, Naoki
Ogawa, Makoto
Inohara, Hidenori
description This study aimed to estimate the intertext variability of smoothed cepstral peak prominence (CPPS), examine whether sound-processing techniques improved its variability and diagnostic capability, and evaluate the degree of intertext variability in detail with reference to the CPPS variabilities in sustained vowels. This was a retrospective study. Text readings of 58 Japanese syllables were recorded from 210 speakers with different diagnoses and varying degrees of dysphonia, and were divided into six passages. Applying the sound-processing techniques to those passages, we prepared three sample types: (1) nonprocessed, (2) only-loud, and (3) only-voiced samples. The intertext CPPS variability and diagnostic properties were compared across the passages and sample types. For detailed analysis, we subsequently extracted 63 normophonic speakers who maintained constant quality in their vowel utterances to evaluate the degree of intertext CPPS variability in relation to the variabilities between repeated identical vowels and across different vowels. Although several combinations of passages showed moderate-to-large CPPS variabilities, those variabilities were decreased by either technique, especially the deletion of silent segments, which resulted in the best diagnostic accuracy. The degree of intertext CPPS variability for the only-voiced samples was comparable to that of the CPPS variabilities in sustained vowels. The sound-processing technique removing silent segments should be applied to enhance the diagnostic properties of CPPS. The additional technique of deleting unvoiced segments is worth adopting if clinicians and researchers seek to attenuate the influence of text differences in calculating CPPS values.
doi_str_mv 10.1016/j.jvoice.2018.09.021
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2129534329</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0892199718302455</els_id><sourcerecordid>2129534329</sourcerecordid><originalsourceid>FETCH-LOGICAL-c362t-250caaa4b7a66ad3c5a83543f0bfc4702df0361431024249cba3627cb269e9b53</originalsourceid><addsrcrecordid>eNp9kE1P3DAQhi3UChboP0CVjz2QdPyRD1-Q0La0K1EVqR9Xy3Em4G0Sp7YXlX-PV0t77GkO8z7zah5CLhiUDFj9fltuH72zWHJgbQmqBM6OyIq1jShk1bavyApaxQumVHNCTmPcAgDP22NyIkC0irVqRZbNnDAk_JPoTxOc6dzo0hP1A_02eZ8esKdrXGIKZqR3aH7Ru-AnN-Ns8ZJ-wfTg-0iTp2s_p-BHukmX1Mx9npF-cOZ-9jE5u6eWXOMwnpPXgxkjvnmZZ-THzcfv68_F7ddPm_X1bWFFzVPBK7DGGNk1pq5NL2xlWlFJMUA3WNkA7wcQNZOCAZdcKtuZzDW247VC1VXijLw73F2C_73DmPTkosVxNDP6XdSccVUJKbjKUXmI2uBjDDjoJbjJhCfNQO9d660-uNZ71xqUzq4z9valYddN2P-D_srNgatDAPOfjw6DjtbtzfUuoE269-7_Dc_yUpJo</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2129534329</pqid></control><display><type>article</type><title>Intertext Variability of Smoothed Cepstral Peak Prominence, Methods to Control It, and Its Diagnostic Properties</title><source>MEDLINE</source><source>Access via ScienceDirect (Elsevier)</source><creator>Kitayama, Itsuki ; Hosokawa, Kiyohito ; Iwahashi, Toshihiko ; Iwahashi, Mio ; Iwaki, Shinobu ; Kato, Chieri ; Yoshida, Misao ; Umatani, Masanori ; Matsushiro, Naoki ; Ogawa, Makoto ; Inohara, Hidenori</creator><creatorcontrib>Kitayama, Itsuki ; Hosokawa, Kiyohito ; Iwahashi, Toshihiko ; Iwahashi, Mio ; Iwaki, Shinobu ; Kato, Chieri ; Yoshida, Misao ; Umatani, Masanori ; Matsushiro, Naoki ; Ogawa, Makoto ; Inohara, Hidenori</creatorcontrib><description>This study aimed to estimate the intertext variability of smoothed cepstral peak prominence (CPPS), examine whether sound-processing techniques improved its variability and diagnostic capability, and evaluate the degree of intertext variability in detail with reference to the CPPS variabilities in sustained vowels. This was a retrospective study. Text readings of 58 Japanese syllables were recorded from 210 speakers with different diagnoses and varying degrees of dysphonia, and were divided into six passages. Applying the sound-processing techniques to those passages, we prepared three sample types: (1) nonprocessed, (2) only-loud, and (3) only-voiced samples. The intertext CPPS variability and diagnostic properties were compared across the passages and sample types. For detailed analysis, we subsequently extracted 63 normophonic speakers who maintained constant quality in their vowel utterances to evaluate the degree of intertext CPPS variability in relation to the variabilities between repeated identical vowels and across different vowels. Although several combinations of passages showed moderate-to-large CPPS variabilities, those variabilities were decreased by either technique, especially the deletion of silent segments, which resulted in the best diagnostic accuracy. The degree of intertext CPPS variability for the only-voiced samples was comparable to that of the CPPS variabilities in sustained vowels. The sound-processing technique removing silent segments should be applied to enhance the diagnostic properties of CPPS. The additional technique of deleting unvoiced segments is worth adopting if clinicians and researchers seek to attenuate the influence of text differences in calculating CPPS values.</description><identifier>ISSN: 0892-1997</identifier><identifier>EISSN: 1873-4588</identifier><identifier>DOI: 10.1016/j.jvoice.2018.09.021</identifier><identifier>PMID: 30389189</identifier><language>eng</language><publisher>United States: Elsevier Inc</publisher><subject>Acoustics ; Adult ; Aged ; Cepstral peak prominence ; Concurrent validity ; Continuous speech ; Diagnostic accuracy ; Dysphonia - diagnosis ; Dysphonia - physiopathology ; Female ; Hoarseness - diagnosis ; Hoarseness - physiopathology ; Humans ; Intertext variability ; Male ; Middle Aged ; Predictive Value of Tests ; Retrospective Studies ; Sound Spectrography ; Speech Acoustics ; Speech Production Measurement ; Sustained vowel ; Voice Quality</subject><ispartof>Journal of voice, 2020-05, Vol.34 (3), p.305-319</ispartof><rights>2018 The Voice Foundation</rights><rights>Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c362t-250caaa4b7a66ad3c5a83543f0bfc4702df0361431024249cba3627cb269e9b53</citedby><cites>FETCH-LOGICAL-c362t-250caaa4b7a66ad3c5a83543f0bfc4702df0361431024249cba3627cb269e9b53</cites><orcidid>0000-0002-3991-9626 ; 0000-0002-1628-3856 ; 0000-0002-7074-1280</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.jvoice.2018.09.021$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/30389189$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Kitayama, Itsuki</creatorcontrib><creatorcontrib>Hosokawa, Kiyohito</creatorcontrib><creatorcontrib>Iwahashi, Toshihiko</creatorcontrib><creatorcontrib>Iwahashi, Mio</creatorcontrib><creatorcontrib>Iwaki, Shinobu</creatorcontrib><creatorcontrib>Kato, Chieri</creatorcontrib><creatorcontrib>Yoshida, Misao</creatorcontrib><creatorcontrib>Umatani, Masanori</creatorcontrib><creatorcontrib>Matsushiro, Naoki</creatorcontrib><creatorcontrib>Ogawa, Makoto</creatorcontrib><creatorcontrib>Inohara, Hidenori</creatorcontrib><title>Intertext Variability of Smoothed Cepstral Peak Prominence, Methods to Control It, and Its Diagnostic Properties</title><title>Journal of voice</title><addtitle>J Voice</addtitle><description>This study aimed to estimate the intertext variability of smoothed cepstral peak prominence (CPPS), examine whether sound-processing techniques improved its variability and diagnostic capability, and evaluate the degree of intertext variability in detail with reference to the CPPS variabilities in sustained vowels. This was a retrospective study. Text readings of 58 Japanese syllables were recorded from 210 speakers with different diagnoses and varying degrees of dysphonia, and were divided into six passages. Applying the sound-processing techniques to those passages, we prepared three sample types: (1) nonprocessed, (2) only-loud, and (3) only-voiced samples. The intertext CPPS variability and diagnostic properties were compared across the passages and sample types. For detailed analysis, we subsequently extracted 63 normophonic speakers who maintained constant quality in their vowel utterances to evaluate the degree of intertext CPPS variability in relation to the variabilities between repeated identical vowels and across different vowels. Although several combinations of passages showed moderate-to-large CPPS variabilities, those variabilities were decreased by either technique, especially the deletion of silent segments, which resulted in the best diagnostic accuracy. The degree of intertext CPPS variability for the only-voiced samples was comparable to that of the CPPS variabilities in sustained vowels. The sound-processing technique removing silent segments should be applied to enhance the diagnostic properties of CPPS. The additional technique of deleting unvoiced segments is worth adopting if clinicians and researchers seek to attenuate the influence of text differences in calculating CPPS values.</description><subject>Acoustics</subject><subject>Adult</subject><subject>Aged</subject><subject>Cepstral peak prominence</subject><subject>Concurrent validity</subject><subject>Continuous speech</subject><subject>Diagnostic accuracy</subject><subject>Dysphonia - diagnosis</subject><subject>Dysphonia - physiopathology</subject><subject>Female</subject><subject>Hoarseness - diagnosis</subject><subject>Hoarseness - physiopathology</subject><subject>Humans</subject><subject>Intertext variability</subject><subject>Male</subject><subject>Middle Aged</subject><subject>Predictive Value of Tests</subject><subject>Retrospective Studies</subject><subject>Sound Spectrography</subject><subject>Speech Acoustics</subject><subject>Speech Production Measurement</subject><subject>Sustained vowel</subject><subject>Voice Quality</subject><issn>0892-1997</issn><issn>1873-4588</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp9kE1P3DAQhi3UChboP0CVjz2QdPyRD1-Q0La0K1EVqR9Xy3Em4G0Sp7YXlX-PV0t77GkO8z7zah5CLhiUDFj9fltuH72zWHJgbQmqBM6OyIq1jShk1bavyApaxQumVHNCTmPcAgDP22NyIkC0irVqRZbNnDAk_JPoTxOc6dzo0hP1A_02eZ8esKdrXGIKZqR3aH7Ru-AnN-Ns8ZJ-wfTg-0iTp2s_p-BHukmX1Mx9npF-cOZ-9jE5u6eWXOMwnpPXgxkjvnmZZ-THzcfv68_F7ddPm_X1bWFFzVPBK7DGGNk1pq5NL2xlWlFJMUA3WNkA7wcQNZOCAZdcKtuZzDW247VC1VXijLw73F2C_73DmPTkosVxNDP6XdSccVUJKbjKUXmI2uBjDDjoJbjJhCfNQO9d660-uNZ71xqUzq4z9valYddN2P-D_srNgatDAPOfjw6DjtbtzfUuoE269-7_Dc_yUpJo</recordid><startdate>202005</startdate><enddate>202005</enddate><creator>Kitayama, Itsuki</creator><creator>Hosokawa, Kiyohito</creator><creator>Iwahashi, Toshihiko</creator><creator>Iwahashi, Mio</creator><creator>Iwaki, Shinobu</creator><creator>Kato, Chieri</creator><creator>Yoshida, Misao</creator><creator>Umatani, Masanori</creator><creator>Matsushiro, Naoki</creator><creator>Ogawa, Makoto</creator><creator>Inohara, Hidenori</creator><general>Elsevier Inc</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-3991-9626</orcidid><orcidid>https://orcid.org/0000-0002-1628-3856</orcidid><orcidid>https://orcid.org/0000-0002-7074-1280</orcidid></search><sort><creationdate>202005</creationdate><title>Intertext Variability of Smoothed Cepstral Peak Prominence, Methods to Control It, and Its Diagnostic Properties</title><author>Kitayama, Itsuki ; Hosokawa, Kiyohito ; Iwahashi, Toshihiko ; Iwahashi, Mio ; Iwaki, Shinobu ; Kato, Chieri ; Yoshida, Misao ; Umatani, Masanori ; Matsushiro, Naoki ; Ogawa, Makoto ; Inohara, Hidenori</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c362t-250caaa4b7a66ad3c5a83543f0bfc4702df0361431024249cba3627cb269e9b53</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Acoustics</topic><topic>Adult</topic><topic>Aged</topic><topic>Cepstral peak prominence</topic><topic>Concurrent validity</topic><topic>Continuous speech</topic><topic>Diagnostic accuracy</topic><topic>Dysphonia - diagnosis</topic><topic>Dysphonia - physiopathology</topic><topic>Female</topic><topic>Hoarseness - diagnosis</topic><topic>Hoarseness - physiopathology</topic><topic>Humans</topic><topic>Intertext variability</topic><topic>Male</topic><topic>Middle Aged</topic><topic>Predictive Value of Tests</topic><topic>Retrospective Studies</topic><topic>Sound Spectrography</topic><topic>Speech Acoustics</topic><topic>Speech Production Measurement</topic><topic>Sustained vowel</topic><topic>Voice Quality</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kitayama, Itsuki</creatorcontrib><creatorcontrib>Hosokawa, Kiyohito</creatorcontrib><creatorcontrib>Iwahashi, Toshihiko</creatorcontrib><creatorcontrib>Iwahashi, Mio</creatorcontrib><creatorcontrib>Iwaki, Shinobu</creatorcontrib><creatorcontrib>Kato, Chieri</creatorcontrib><creatorcontrib>Yoshida, Misao</creatorcontrib><creatorcontrib>Umatani, Masanori</creatorcontrib><creatorcontrib>Matsushiro, Naoki</creatorcontrib><creatorcontrib>Ogawa, Makoto</creatorcontrib><creatorcontrib>Inohara, Hidenori</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of voice</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kitayama, Itsuki</au><au>Hosokawa, Kiyohito</au><au>Iwahashi, Toshihiko</au><au>Iwahashi, Mio</au><au>Iwaki, Shinobu</au><au>Kato, Chieri</au><au>Yoshida, Misao</au><au>Umatani, Masanori</au><au>Matsushiro, Naoki</au><au>Ogawa, Makoto</au><au>Inohara, Hidenori</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Intertext Variability of Smoothed Cepstral Peak Prominence, Methods to Control It, and Its Diagnostic Properties</atitle><jtitle>Journal of voice</jtitle><addtitle>J Voice</addtitle><date>2020-05</date><risdate>2020</risdate><volume>34</volume><issue>3</issue><spage>305</spage><epage>319</epage><pages>305-319</pages><issn>0892-1997</issn><eissn>1873-4588</eissn><abstract>This study aimed to estimate the intertext variability of smoothed cepstral peak prominence (CPPS), examine whether sound-processing techniques improved its variability and diagnostic capability, and evaluate the degree of intertext variability in detail with reference to the CPPS variabilities in sustained vowels. This was a retrospective study. Text readings of 58 Japanese syllables were recorded from 210 speakers with different diagnoses and varying degrees of dysphonia, and were divided into six passages. Applying the sound-processing techniques to those passages, we prepared three sample types: (1) nonprocessed, (2) only-loud, and (3) only-voiced samples. The intertext CPPS variability and diagnostic properties were compared across the passages and sample types. For detailed analysis, we subsequently extracted 63 normophonic speakers who maintained constant quality in their vowel utterances to evaluate the degree of intertext CPPS variability in relation to the variabilities between repeated identical vowels and across different vowels. Although several combinations of passages showed moderate-to-large CPPS variabilities, those variabilities were decreased by either technique, especially the deletion of silent segments, which resulted in the best diagnostic accuracy. The degree of intertext CPPS variability for the only-voiced samples was comparable to that of the CPPS variabilities in sustained vowels. The sound-processing technique removing silent segments should be applied to enhance the diagnostic properties of CPPS. The additional technique of deleting unvoiced segments is worth adopting if clinicians and researchers seek to attenuate the influence of text differences in calculating CPPS values.</abstract><cop>United States</cop><pub>Elsevier Inc</pub><pmid>30389189</pmid><doi>10.1016/j.jvoice.2018.09.021</doi><tpages>15</tpages><orcidid>https://orcid.org/0000-0002-3991-9626</orcidid><orcidid>https://orcid.org/0000-0002-1628-3856</orcidid><orcidid>https://orcid.org/0000-0002-7074-1280</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0892-1997
ispartof Journal of voice, 2020-05, Vol.34 (3), p.305-319
issn 0892-1997
1873-4588
language eng
recordid cdi_proquest_miscellaneous_2129534329
source MEDLINE; Access via ScienceDirect (Elsevier)
subjects Acoustics
Adult
Aged
Cepstral peak prominence
Concurrent validity
Continuous speech
Diagnostic accuracy
Dysphonia - diagnosis
Dysphonia - physiopathology
Female
Hoarseness - diagnosis
Hoarseness - physiopathology
Humans
Intertext variability
Male
Middle Aged
Predictive Value of Tests
Retrospective Studies
Sound Spectrography
Speech Acoustics
Speech Production Measurement
Sustained vowel
Voice Quality
title Intertext Variability of Smoothed Cepstral Peak Prominence, Methods to Control It, and Its Diagnostic Properties
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T09%3A45%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Intertext%20Variability%20of%20Smoothed%20Cepstral%20Peak%20Prominence,%20Methods%20to%20Control%20It,%20and%20Its%20Diagnostic%20Properties&rft.jtitle=Journal%20of%20voice&rft.au=Kitayama,%20Itsuki&rft.date=2020-05&rft.volume=34&rft.issue=3&rft.spage=305&rft.epage=319&rft.pages=305-319&rft.issn=0892-1997&rft.eissn=1873-4588&rft_id=info:doi/10.1016/j.jvoice.2018.09.021&rft_dat=%3Cproquest_cross%3E2129534329%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2129534329&rft_id=info:pmid/30389189&rft_els_id=S0892199718302455&rfr_iscdi=true