Intertext Variability of Smoothed Cepstral Peak Prominence, Methods to Control It, and Its Diagnostic Properties
This study aimed to estimate the intertext variability of smoothed cepstral peak prominence (CPPS), examine whether sound-processing techniques improved its variability and diagnostic capability, and evaluate the degree of intertext variability in detail with reference to the CPPS variabilities in s...
Gespeichert in:
Veröffentlicht in: | Journal of voice 2020-05, Vol.34 (3), p.305-319 |
---|---|
Hauptverfasser: | , , , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 319 |
---|---|
container_issue | 3 |
container_start_page | 305 |
container_title | Journal of voice |
container_volume | 34 |
creator | Kitayama, Itsuki Hosokawa, Kiyohito Iwahashi, Toshihiko Iwahashi, Mio Iwaki, Shinobu Kato, Chieri Yoshida, Misao Umatani, Masanori Matsushiro, Naoki Ogawa, Makoto Inohara, Hidenori |
description | This study aimed to estimate the intertext variability of smoothed cepstral peak prominence (CPPS), examine whether sound-processing techniques improved its variability and diagnostic capability, and evaluate the degree of intertext variability in detail with reference to the CPPS variabilities in sustained vowels.
This was a retrospective study.
Text readings of 58 Japanese syllables were recorded from 210 speakers with different diagnoses and varying degrees of dysphonia, and were divided into six passages. Applying the sound-processing techniques to those passages, we prepared three sample types: (1) nonprocessed, (2) only-loud, and (3) only-voiced samples. The intertext CPPS variability and diagnostic properties were compared across the passages and sample types. For detailed analysis, we subsequently extracted 63 normophonic speakers who maintained constant quality in their vowel utterances to evaluate the degree of intertext CPPS variability in relation to the variabilities between repeated identical vowels and across different vowels.
Although several combinations of passages showed moderate-to-large CPPS variabilities, those variabilities were decreased by either technique, especially the deletion of silent segments, which resulted in the best diagnostic accuracy. The degree of intertext CPPS variability for the only-voiced samples was comparable to that of the CPPS variabilities in sustained vowels.
The sound-processing technique removing silent segments should be applied to enhance the diagnostic properties of CPPS. The additional technique of deleting unvoiced segments is worth adopting if clinicians and researchers seek to attenuate the influence of text differences in calculating CPPS values. |
doi_str_mv | 10.1016/j.jvoice.2018.09.021 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2129534329</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0892199718302455</els_id><sourcerecordid>2129534329</sourcerecordid><originalsourceid>FETCH-LOGICAL-c362t-250caaa4b7a66ad3c5a83543f0bfc4702df0361431024249cba3627cb269e9b53</originalsourceid><addsrcrecordid>eNp9kE1P3DAQhi3UChboP0CVjz2QdPyRD1-Q0La0K1EVqR9Xy3Em4G0Sp7YXlX-PV0t77GkO8z7zah5CLhiUDFj9fltuH72zWHJgbQmqBM6OyIq1jShk1bavyApaxQumVHNCTmPcAgDP22NyIkC0irVqRZbNnDAk_JPoTxOc6dzo0hP1A_02eZ8esKdrXGIKZqR3aH7Ru-AnN-Ns8ZJ-wfTg-0iTp2s_p-BHukmX1Mx9npF-cOZ-9jE5u6eWXOMwnpPXgxkjvnmZZ-THzcfv68_F7ddPm_X1bWFFzVPBK7DGGNk1pq5NL2xlWlFJMUA3WNkA7wcQNZOCAZdcKtuZzDW247VC1VXijLw73F2C_73DmPTkosVxNDP6XdSccVUJKbjKUXmI2uBjDDjoJbjJhCfNQO9d660-uNZ71xqUzq4z9valYddN2P-D_srNgatDAPOfjw6DjtbtzfUuoE269-7_Dc_yUpJo</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2129534329</pqid></control><display><type>article</type><title>Intertext Variability of Smoothed Cepstral Peak Prominence, Methods to Control It, and Its Diagnostic Properties</title><source>MEDLINE</source><source>Access via ScienceDirect (Elsevier)</source><creator>Kitayama, Itsuki ; Hosokawa, Kiyohito ; Iwahashi, Toshihiko ; Iwahashi, Mio ; Iwaki, Shinobu ; Kato, Chieri ; Yoshida, Misao ; Umatani, Masanori ; Matsushiro, Naoki ; Ogawa, Makoto ; Inohara, Hidenori</creator><creatorcontrib>Kitayama, Itsuki ; Hosokawa, Kiyohito ; Iwahashi, Toshihiko ; Iwahashi, Mio ; Iwaki, Shinobu ; Kato, Chieri ; Yoshida, Misao ; Umatani, Masanori ; Matsushiro, Naoki ; Ogawa, Makoto ; Inohara, Hidenori</creatorcontrib><description>This study aimed to estimate the intertext variability of smoothed cepstral peak prominence (CPPS), examine whether sound-processing techniques improved its variability and diagnostic capability, and evaluate the degree of intertext variability in detail with reference to the CPPS variabilities in sustained vowels.
This was a retrospective study.
Text readings of 58 Japanese syllables were recorded from 210 speakers with different diagnoses and varying degrees of dysphonia, and were divided into six passages. Applying the sound-processing techniques to those passages, we prepared three sample types: (1) nonprocessed, (2) only-loud, and (3) only-voiced samples. The intertext CPPS variability and diagnostic properties were compared across the passages and sample types. For detailed analysis, we subsequently extracted 63 normophonic speakers who maintained constant quality in their vowel utterances to evaluate the degree of intertext CPPS variability in relation to the variabilities between repeated identical vowels and across different vowels.
Although several combinations of passages showed moderate-to-large CPPS variabilities, those variabilities were decreased by either technique, especially the deletion of silent segments, which resulted in the best diagnostic accuracy. The degree of intertext CPPS variability for the only-voiced samples was comparable to that of the CPPS variabilities in sustained vowels.
The sound-processing technique removing silent segments should be applied to enhance the diagnostic properties of CPPS. The additional technique of deleting unvoiced segments is worth adopting if clinicians and researchers seek to attenuate the influence of text differences in calculating CPPS values.</description><identifier>ISSN: 0892-1997</identifier><identifier>EISSN: 1873-4588</identifier><identifier>DOI: 10.1016/j.jvoice.2018.09.021</identifier><identifier>PMID: 30389189</identifier><language>eng</language><publisher>United States: Elsevier Inc</publisher><subject>Acoustics ; Adult ; Aged ; Cepstral peak prominence ; Concurrent validity ; Continuous speech ; Diagnostic accuracy ; Dysphonia - diagnosis ; Dysphonia - physiopathology ; Female ; Hoarseness - diagnosis ; Hoarseness - physiopathology ; Humans ; Intertext variability ; Male ; Middle Aged ; Predictive Value of Tests ; Retrospective Studies ; Sound Spectrography ; Speech Acoustics ; Speech Production Measurement ; Sustained vowel ; Voice Quality</subject><ispartof>Journal of voice, 2020-05, Vol.34 (3), p.305-319</ispartof><rights>2018 The Voice Foundation</rights><rights>Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c362t-250caaa4b7a66ad3c5a83543f0bfc4702df0361431024249cba3627cb269e9b53</citedby><cites>FETCH-LOGICAL-c362t-250caaa4b7a66ad3c5a83543f0bfc4702df0361431024249cba3627cb269e9b53</cites><orcidid>0000-0002-3991-9626 ; 0000-0002-1628-3856 ; 0000-0002-7074-1280</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.jvoice.2018.09.021$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/30389189$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Kitayama, Itsuki</creatorcontrib><creatorcontrib>Hosokawa, Kiyohito</creatorcontrib><creatorcontrib>Iwahashi, Toshihiko</creatorcontrib><creatorcontrib>Iwahashi, Mio</creatorcontrib><creatorcontrib>Iwaki, Shinobu</creatorcontrib><creatorcontrib>Kato, Chieri</creatorcontrib><creatorcontrib>Yoshida, Misao</creatorcontrib><creatorcontrib>Umatani, Masanori</creatorcontrib><creatorcontrib>Matsushiro, Naoki</creatorcontrib><creatorcontrib>Ogawa, Makoto</creatorcontrib><creatorcontrib>Inohara, Hidenori</creatorcontrib><title>Intertext Variability of Smoothed Cepstral Peak Prominence, Methods to Control It, and Its Diagnostic Properties</title><title>Journal of voice</title><addtitle>J Voice</addtitle><description>This study aimed to estimate the intertext variability of smoothed cepstral peak prominence (CPPS), examine whether sound-processing techniques improved its variability and diagnostic capability, and evaluate the degree of intertext variability in detail with reference to the CPPS variabilities in sustained vowels.
This was a retrospective study.
Text readings of 58 Japanese syllables were recorded from 210 speakers with different diagnoses and varying degrees of dysphonia, and were divided into six passages. Applying the sound-processing techniques to those passages, we prepared three sample types: (1) nonprocessed, (2) only-loud, and (3) only-voiced samples. The intertext CPPS variability and diagnostic properties were compared across the passages and sample types. For detailed analysis, we subsequently extracted 63 normophonic speakers who maintained constant quality in their vowel utterances to evaluate the degree of intertext CPPS variability in relation to the variabilities between repeated identical vowels and across different vowels.
Although several combinations of passages showed moderate-to-large CPPS variabilities, those variabilities were decreased by either technique, especially the deletion of silent segments, which resulted in the best diagnostic accuracy. The degree of intertext CPPS variability for the only-voiced samples was comparable to that of the CPPS variabilities in sustained vowels.
The sound-processing technique removing silent segments should be applied to enhance the diagnostic properties of CPPS. The additional technique of deleting unvoiced segments is worth adopting if clinicians and researchers seek to attenuate the influence of text differences in calculating CPPS values.</description><subject>Acoustics</subject><subject>Adult</subject><subject>Aged</subject><subject>Cepstral peak prominence</subject><subject>Concurrent validity</subject><subject>Continuous speech</subject><subject>Diagnostic accuracy</subject><subject>Dysphonia - diagnosis</subject><subject>Dysphonia - physiopathology</subject><subject>Female</subject><subject>Hoarseness - diagnosis</subject><subject>Hoarseness - physiopathology</subject><subject>Humans</subject><subject>Intertext variability</subject><subject>Male</subject><subject>Middle Aged</subject><subject>Predictive Value of Tests</subject><subject>Retrospective Studies</subject><subject>Sound Spectrography</subject><subject>Speech Acoustics</subject><subject>Speech Production Measurement</subject><subject>Sustained vowel</subject><subject>Voice Quality</subject><issn>0892-1997</issn><issn>1873-4588</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp9kE1P3DAQhi3UChboP0CVjz2QdPyRD1-Q0La0K1EVqR9Xy3Em4G0Sp7YXlX-PV0t77GkO8z7zah5CLhiUDFj9fltuH72zWHJgbQmqBM6OyIq1jShk1bavyApaxQumVHNCTmPcAgDP22NyIkC0irVqRZbNnDAk_JPoTxOc6dzo0hP1A_02eZ8esKdrXGIKZqR3aH7Ru-AnN-Ns8ZJ-wfTg-0iTp2s_p-BHukmX1Mx9npF-cOZ-9jE5u6eWXOMwnpPXgxkjvnmZZ-THzcfv68_F7ddPm_X1bWFFzVPBK7DGGNk1pq5NL2xlWlFJMUA3WNkA7wcQNZOCAZdcKtuZzDW247VC1VXijLw73F2C_73DmPTkosVxNDP6XdSccVUJKbjKUXmI2uBjDDjoJbjJhCfNQO9d660-uNZ71xqUzq4z9valYddN2P-D_srNgatDAPOfjw6DjtbtzfUuoE269-7_Dc_yUpJo</recordid><startdate>202005</startdate><enddate>202005</enddate><creator>Kitayama, Itsuki</creator><creator>Hosokawa, Kiyohito</creator><creator>Iwahashi, Toshihiko</creator><creator>Iwahashi, Mio</creator><creator>Iwaki, Shinobu</creator><creator>Kato, Chieri</creator><creator>Yoshida, Misao</creator><creator>Umatani, Masanori</creator><creator>Matsushiro, Naoki</creator><creator>Ogawa, Makoto</creator><creator>Inohara, Hidenori</creator><general>Elsevier Inc</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-3991-9626</orcidid><orcidid>https://orcid.org/0000-0002-1628-3856</orcidid><orcidid>https://orcid.org/0000-0002-7074-1280</orcidid></search><sort><creationdate>202005</creationdate><title>Intertext Variability of Smoothed Cepstral Peak Prominence, Methods to Control It, and Its Diagnostic Properties</title><author>Kitayama, Itsuki ; Hosokawa, Kiyohito ; Iwahashi, Toshihiko ; Iwahashi, Mio ; Iwaki, Shinobu ; Kato, Chieri ; Yoshida, Misao ; Umatani, Masanori ; Matsushiro, Naoki ; Ogawa, Makoto ; Inohara, Hidenori</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c362t-250caaa4b7a66ad3c5a83543f0bfc4702df0361431024249cba3627cb269e9b53</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Acoustics</topic><topic>Adult</topic><topic>Aged</topic><topic>Cepstral peak prominence</topic><topic>Concurrent validity</topic><topic>Continuous speech</topic><topic>Diagnostic accuracy</topic><topic>Dysphonia - diagnosis</topic><topic>Dysphonia - physiopathology</topic><topic>Female</topic><topic>Hoarseness - diagnosis</topic><topic>Hoarseness - physiopathology</topic><topic>Humans</topic><topic>Intertext variability</topic><topic>Male</topic><topic>Middle Aged</topic><topic>Predictive Value of Tests</topic><topic>Retrospective Studies</topic><topic>Sound Spectrography</topic><topic>Speech Acoustics</topic><topic>Speech Production Measurement</topic><topic>Sustained vowel</topic><topic>Voice Quality</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kitayama, Itsuki</creatorcontrib><creatorcontrib>Hosokawa, Kiyohito</creatorcontrib><creatorcontrib>Iwahashi, Toshihiko</creatorcontrib><creatorcontrib>Iwahashi, Mio</creatorcontrib><creatorcontrib>Iwaki, Shinobu</creatorcontrib><creatorcontrib>Kato, Chieri</creatorcontrib><creatorcontrib>Yoshida, Misao</creatorcontrib><creatorcontrib>Umatani, Masanori</creatorcontrib><creatorcontrib>Matsushiro, Naoki</creatorcontrib><creatorcontrib>Ogawa, Makoto</creatorcontrib><creatorcontrib>Inohara, Hidenori</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of voice</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kitayama, Itsuki</au><au>Hosokawa, Kiyohito</au><au>Iwahashi, Toshihiko</au><au>Iwahashi, Mio</au><au>Iwaki, Shinobu</au><au>Kato, Chieri</au><au>Yoshida, Misao</au><au>Umatani, Masanori</au><au>Matsushiro, Naoki</au><au>Ogawa, Makoto</au><au>Inohara, Hidenori</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Intertext Variability of Smoothed Cepstral Peak Prominence, Methods to Control It, and Its Diagnostic Properties</atitle><jtitle>Journal of voice</jtitle><addtitle>J Voice</addtitle><date>2020-05</date><risdate>2020</risdate><volume>34</volume><issue>3</issue><spage>305</spage><epage>319</epage><pages>305-319</pages><issn>0892-1997</issn><eissn>1873-4588</eissn><abstract>This study aimed to estimate the intertext variability of smoothed cepstral peak prominence (CPPS), examine whether sound-processing techniques improved its variability and diagnostic capability, and evaluate the degree of intertext variability in detail with reference to the CPPS variabilities in sustained vowels.
This was a retrospective study.
Text readings of 58 Japanese syllables were recorded from 210 speakers with different diagnoses and varying degrees of dysphonia, and were divided into six passages. Applying the sound-processing techniques to those passages, we prepared three sample types: (1) nonprocessed, (2) only-loud, and (3) only-voiced samples. The intertext CPPS variability and diagnostic properties were compared across the passages and sample types. For detailed analysis, we subsequently extracted 63 normophonic speakers who maintained constant quality in their vowel utterances to evaluate the degree of intertext CPPS variability in relation to the variabilities between repeated identical vowels and across different vowels.
Although several combinations of passages showed moderate-to-large CPPS variabilities, those variabilities were decreased by either technique, especially the deletion of silent segments, which resulted in the best diagnostic accuracy. The degree of intertext CPPS variability for the only-voiced samples was comparable to that of the CPPS variabilities in sustained vowels.
The sound-processing technique removing silent segments should be applied to enhance the diagnostic properties of CPPS. The additional technique of deleting unvoiced segments is worth adopting if clinicians and researchers seek to attenuate the influence of text differences in calculating CPPS values.</abstract><cop>United States</cop><pub>Elsevier Inc</pub><pmid>30389189</pmid><doi>10.1016/j.jvoice.2018.09.021</doi><tpages>15</tpages><orcidid>https://orcid.org/0000-0002-3991-9626</orcidid><orcidid>https://orcid.org/0000-0002-1628-3856</orcidid><orcidid>https://orcid.org/0000-0002-7074-1280</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0892-1997 |
ispartof | Journal of voice, 2020-05, Vol.34 (3), p.305-319 |
issn | 0892-1997 1873-4588 |
language | eng |
recordid | cdi_proquest_miscellaneous_2129534329 |
source | MEDLINE; Access via ScienceDirect (Elsevier) |
subjects | Acoustics Adult Aged Cepstral peak prominence Concurrent validity Continuous speech Diagnostic accuracy Dysphonia - diagnosis Dysphonia - physiopathology Female Hoarseness - diagnosis Hoarseness - physiopathology Humans Intertext variability Male Middle Aged Predictive Value of Tests Retrospective Studies Sound Spectrography Speech Acoustics Speech Production Measurement Sustained vowel Voice Quality |
title | Intertext Variability of Smoothed Cepstral Peak Prominence, Methods to Control It, and Its Diagnostic Properties |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T09%3A45%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Intertext%20Variability%20of%20Smoothed%20Cepstral%20Peak%20Prominence,%20Methods%20to%20Control%20It,%20and%20Its%20Diagnostic%20Properties&rft.jtitle=Journal%20of%20voice&rft.au=Kitayama,%20Itsuki&rft.date=2020-05&rft.volume=34&rft.issue=3&rft.spage=305&rft.epage=319&rft.pages=305-319&rft.issn=0892-1997&rft.eissn=1873-4588&rft_id=info:doi/10.1016/j.jvoice.2018.09.021&rft_dat=%3Cproquest_cross%3E2129534329%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2129534329&rft_id=info:pmid/30389189&rft_els_id=S0892199718302455&rfr_iscdi=true |