Voice Pathology Detection Based eon Short-Term Jitter Estimations in Running Speech
In this paper, we investigate the use of jitter estimation over short time intervals (short-term jitter) for voice pathology detection in the case of running or continuous speech. Short-term jitter estimations are provided by the spectral jitter estimator (SJE), which is based on a mathematical desc...
Gespeichert in:
Veröffentlicht in: | Folia phoniatrica et logopaedica 2009-01, Vol.61 (3), p.153-170 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 170 |
---|---|
container_issue | 3 |
container_start_page | 153 |
container_title | Folia phoniatrica et logopaedica |
container_volume | 61 |
creator | Vasilakis, Miltiadis Stylianou, Yannis |
description | In this paper, we investigate the use of jitter estimation over short time intervals (short-term jitter) for voice pathology detection in the case of running or continuous speech. Short-term jitter estimations are provided by the spectral jitter estimator (SJE), which is based on a mathematical description of the jitter phenomenon. The SJE has been shown to be robust against errors in pitch period estimations, which makes it a good candidate for measuring jitter in continuous speech. On two large databases of sustained vowel recordings from healthy and pathological voices, we suggest a threshold for the SJE for pathology detection based on cross-database validation. Applying that to a database of continuous speech (reading text) from normophonic and dysphonic speakers, a second threshold and new features are suggested for monitoring jitter in continuous speech. Detection performance of the suggested thresholds and features was evaluated using receiver operating characteristic curves and their discriminative efficiency between healthy and pathological voices was judged using the area under the curve index. In terms of area under the curve, the suggested features for reading text provide a discrimination score of about 95%, while the second threshold provides a classification rate of 87.8%. Furthermore, estimated short-term jitter values from reading text were found to confirm the studies showing a decrease of jitter with increasing fundamental frequencies, and the more frequent presence of high jitter values in the case of pathological voices as time increases. |
doi_str_mv | 10.1159/000219951 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_67442240</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1782159231</sourcerecordid><originalsourceid>FETCH-LOGICAL-c366t-e63ba892200db4e9a48d31238737230afa196a714bdfa99b598c05d11a80262c3</originalsourceid><addsrcrecordid>eNpd0U1v1DAQBmCXFtFSeuCOKquHSj2keMZxbB_pd1GlIrZwjZxkdjdlN15s59B_j1e7KoKTLc0zr-wZxj6COAdQ9rMQAsFaBTvsyGojjVDKgkH7hh1AiVBYq3H3nxrgXq7ltkLrCvfZ-xif1zFo8B3bB6s0KCUO2OSn71vi31ya-4WfvfArStSm3g_8wkXqOOXbZO5DKp4oLPnXPiUK_DqmfunWLPJ-4N_HYeiHGZ-siNr5B_Z26haRjrbnIftxc_10eVc8PN7eX355KFpZVamgSjbOWEQhuqYk60rTSUBptNQohZs6sJXTUDbd1FnbKGtaoToAZwRW2MpDdrrJXQX_e6SY6mUfW1os3EB-jHWlyxKxFBme_Aef_RiG_LYaJea5GmkzOtugNvgYA03rVch_DC81iHq9hvp1DdkebwPHZkndX7mdawafNuCXCzMKr2Db_wfXwYQD</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>232951839</pqid></control><display><type>article</type><title>Voice Pathology Detection Based eon Short-Term Jitter Estimations in Running Speech</title><source>Karger Journals</source><source>MEDLINE</source><source>Alma/SFX Local Collection</source><creator>Vasilakis, Miltiadis ; Stylianou, Yannis</creator><creatorcontrib>Vasilakis, Miltiadis ; Stylianou, Yannis</creatorcontrib><description>In this paper, we investigate the use of jitter estimation over short time intervals (short-term jitter) for voice pathology detection in the case of running or continuous speech. Short-term jitter estimations are provided by the spectral jitter estimator (SJE), which is based on a mathematical description of the jitter phenomenon. The SJE has been shown to be robust against errors in pitch period estimations, which makes it a good candidate for measuring jitter in continuous speech. On two large databases of sustained vowel recordings from healthy and pathological voices, we suggest a threshold for the SJE for pathology detection based on cross-database validation. Applying that to a database of continuous speech (reading text) from normophonic and dysphonic speakers, a second threshold and new features are suggested for monitoring jitter in continuous speech. Detection performance of the suggested thresholds and features was evaluated using receiver operating characteristic curves and their discriminative efficiency between healthy and pathological voices was judged using the area under the curve index. In terms of area under the curve, the suggested features for reading text provide a discrimination score of about 95%, while the second threshold provides a classification rate of 87.8%. Furthermore, estimated short-term jitter values from reading text were found to confirm the studies showing a decrease of jitter with increasing fundamental frequencies, and the more frequent presence of high jitter values in the case of pathological voices as time increases.</description><identifier>ISSN: 1021-7762</identifier><identifier>ISBN: 9783805591812</identifier><identifier>ISBN: 3805591810</identifier><identifier>EISSN: 1421-9972</identifier><identifier>EISBN: 9783805591829</identifier><identifier>EISBN: 3805591829</identifier><identifier>DOI: 10.1159/000219951</identifier><identifier>PMID: 19571550</identifier><language>eng</language><publisher>Basel, Switzerland: S. Karger AG</publisher><subject>Algorithms ; Area Under Curve ; Databases, Factual ; Dysphonia - diagnosis ; Humans ; Phonetics ; Reading ; ROC Curve ; Speech ; Speech Acoustics ; Speech Production Measurement - methods ; Time Factors ; Voice Disorders - diagnosis</subject><ispartof>Folia phoniatrica et logopaedica, 2009-01, Vol.61 (3), p.153-170</ispartof><rights>2009 S. Karger AG, Basel</rights><rights>Copyright 2009 S. Karger AG, Basel.</rights><rights>Copyright (c) 2009 S. Karger AG, Basel</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c366t-e63ba892200db4e9a48d31238737230afa196a714bdfa99b598c05d11a80262c3</citedby><cites>FETCH-LOGICAL-c366t-e63ba892200db4e9a48d31238737230afa196a714bdfa99b598c05d11a80262c3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,2423,27901,27902</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/19571550$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Vasilakis, Miltiadis</creatorcontrib><creatorcontrib>Stylianou, Yannis</creatorcontrib><title>Voice Pathology Detection Based eon Short-Term Jitter Estimations in Running Speech</title><title>Folia phoniatrica et logopaedica</title><addtitle>Folia Phoniatr Logop</addtitle><description>In this paper, we investigate the use of jitter estimation over short time intervals (short-term jitter) for voice pathology detection in the case of running or continuous speech. Short-term jitter estimations are provided by the spectral jitter estimator (SJE), which is based on a mathematical description of the jitter phenomenon. The SJE has been shown to be robust against errors in pitch period estimations, which makes it a good candidate for measuring jitter in continuous speech. On two large databases of sustained vowel recordings from healthy and pathological voices, we suggest a threshold for the SJE for pathology detection based on cross-database validation. Applying that to a database of continuous speech (reading text) from normophonic and dysphonic speakers, a second threshold and new features are suggested for monitoring jitter in continuous speech. Detection performance of the suggested thresholds and features was evaluated using receiver operating characteristic curves and their discriminative efficiency between healthy and pathological voices was judged using the area under the curve index. In terms of area under the curve, the suggested features for reading text provide a discrimination score of about 95%, while the second threshold provides a classification rate of 87.8%. Furthermore, estimated short-term jitter values from reading text were found to confirm the studies showing a decrease of jitter with increasing fundamental frequencies, and the more frequent presence of high jitter values in the case of pathological voices as time increases.</description><subject>Algorithms</subject><subject>Area Under Curve</subject><subject>Databases, Factual</subject><subject>Dysphonia - diagnosis</subject><subject>Humans</subject><subject>Phonetics</subject><subject>Reading</subject><subject>ROC Curve</subject><subject>Speech</subject><subject>Speech Acoustics</subject><subject>Speech Production Measurement - methods</subject><subject>Time Factors</subject><subject>Voice Disorders - diagnosis</subject><issn>1021-7762</issn><issn>1421-9972</issn><isbn>9783805591812</isbn><isbn>3805591810</isbn><isbn>9783805591829</isbn><isbn>3805591829</isbn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2009</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>BENPR</sourceid><recordid>eNpd0U1v1DAQBmCXFtFSeuCOKquHSj2keMZxbB_pd1GlIrZwjZxkdjdlN15s59B_j1e7KoKTLc0zr-wZxj6COAdQ9rMQAsFaBTvsyGojjVDKgkH7hh1AiVBYq3H3nxrgXq7ltkLrCvfZ-xif1zFo8B3bB6s0KCUO2OSn71vi31ya-4WfvfArStSm3g_8wkXqOOXbZO5DKp4oLPnXPiUK_DqmfunWLPJ-4N_HYeiHGZ-siNr5B_Z26haRjrbnIftxc_10eVc8PN7eX355KFpZVamgSjbOWEQhuqYk60rTSUBptNQohZs6sJXTUDbd1FnbKGtaoToAZwRW2MpDdrrJXQX_e6SY6mUfW1os3EB-jHWlyxKxFBme_Aef_RiG_LYaJea5GmkzOtugNvgYA03rVch_DC81iHq9hvp1DdkebwPHZkndX7mdawafNuCXCzMKr2Db_wfXwYQD</recordid><startdate>20090101</startdate><enddate>20090101</enddate><creator>Vasilakis, Miltiadis</creator><creator>Stylianou, Yannis</creator><general>S. Karger AG</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7RV</scope><scope>7T9</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>88G</scope><scope>88I</scope><scope>8AF</scope><scope>8AO</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ALSLI</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>CCPQU</scope><scope>CPGLG</scope><scope>CRLPW</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>KB0</scope><scope>M0S</scope><scope>M1P</scope><scope>M2M</scope><scope>M2P</scope><scope>NAPCQ</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PSYQQ</scope><scope>Q9U</scope><scope>S0X</scope><scope>7X8</scope></search><sort><creationdate>20090101</creationdate><title>Voice Pathology Detection Based eon Short-Term Jitter Estimations in Running Speech</title><author>Vasilakis, Miltiadis ; Stylianou, Yannis</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c366t-e63ba892200db4e9a48d31238737230afa196a714bdfa99b598c05d11a80262c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2009</creationdate><topic>Algorithms</topic><topic>Area Under Curve</topic><topic>Databases, Factual</topic><topic>Dysphonia - diagnosis</topic><topic>Humans</topic><topic>Phonetics</topic><topic>Reading</topic><topic>ROC Curve</topic><topic>Speech</topic><topic>Speech Acoustics</topic><topic>Speech Production Measurement - methods</topic><topic>Time Factors</topic><topic>Voice Disorders - diagnosis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Vasilakis, Miltiadis</creatorcontrib><creatorcontrib>Stylianou, Yannis</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Nursing & Allied Health Database</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Psychology Database (Alumni)</collection><collection>Science Database (Alumni Edition)</collection><collection>STEM Database</collection><collection>ProQuest Pharma Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Social Science Premium Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>ProQuest One Community College</collection><collection>Linguistics Collection</collection><collection>Linguistics Database</collection><collection>ProQuest Central Korea</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Nursing & Allied Health Database (Alumni Edition)</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>ProQuest Psychology</collection><collection>Science Database</collection><collection>Nursing & Allied Health Premium</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest One Psychology</collection><collection>ProQuest Central Basic</collection><collection>SIRS Editorial</collection><collection>MEDLINE - Academic</collection><jtitle>Folia phoniatrica et logopaedica</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Vasilakis, Miltiadis</au><au>Stylianou, Yannis</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Voice Pathology Detection Based eon Short-Term Jitter Estimations in Running Speech</atitle><jtitle>Folia phoniatrica et logopaedica</jtitle><addtitle>Folia Phoniatr Logop</addtitle><date>2009-01-01</date><risdate>2009</risdate><volume>61</volume><issue>3</issue><spage>153</spage><epage>170</epage><pages>153-170</pages><issn>1021-7762</issn><eissn>1421-9972</eissn><isbn>9783805591812</isbn><isbn>3805591810</isbn><eisbn>9783805591829</eisbn><eisbn>3805591829</eisbn><abstract>In this paper, we investigate the use of jitter estimation over short time intervals (short-term jitter) for voice pathology detection in the case of running or continuous speech. Short-term jitter estimations are provided by the spectral jitter estimator (SJE), which is based on a mathematical description of the jitter phenomenon. The SJE has been shown to be robust against errors in pitch period estimations, which makes it a good candidate for measuring jitter in continuous speech. On two large databases of sustained vowel recordings from healthy and pathological voices, we suggest a threshold for the SJE for pathology detection based on cross-database validation. Applying that to a database of continuous speech (reading text) from normophonic and dysphonic speakers, a second threshold and new features are suggested for monitoring jitter in continuous speech. Detection performance of the suggested thresholds and features was evaluated using receiver operating characteristic curves and their discriminative efficiency between healthy and pathological voices was judged using the area under the curve index. In terms of area under the curve, the suggested features for reading text provide a discrimination score of about 95%, while the second threshold provides a classification rate of 87.8%. Furthermore, estimated short-term jitter values from reading text were found to confirm the studies showing a decrease of jitter with increasing fundamental frequencies, and the more frequent presence of high jitter values in the case of pathological voices as time increases.</abstract><cop>Basel, Switzerland</cop><pub>S. Karger AG</pub><pmid>19571550</pmid><doi>10.1159/000219951</doi><tpages>18</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1021-7762 |
ispartof | Folia phoniatrica et logopaedica, 2009-01, Vol.61 (3), p.153-170 |
issn | 1021-7762 1421-9972 |
language | eng |
recordid | cdi_proquest_miscellaneous_67442240 |
source | Karger Journals; MEDLINE; Alma/SFX Local Collection |
subjects | Algorithms Area Under Curve Databases, Factual Dysphonia - diagnosis Humans Phonetics Reading ROC Curve Speech Speech Acoustics Speech Production Measurement - methods Time Factors Voice Disorders - diagnosis |
title | Voice Pathology Detection Based eon Short-Term Jitter Estimations in Running Speech |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-07T08%3A14%3A56IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Voice%20Pathology%20Detection%20Based%20eon%20Short-Term%20Jitter%20Estimations%20in%20Running%20Speech&rft.jtitle=Folia%20phoniatrica%20et%20logopaedica&rft.au=Vasilakis,%20Miltiadis&rft.date=2009-01-01&rft.volume=61&rft.issue=3&rft.spage=153&rft.epage=170&rft.pages=153-170&rft.issn=1021-7762&rft.eissn=1421-9972&rft.isbn=9783805591812&rft.isbn_list=3805591810&rft_id=info:doi/10.1159/000219951&rft_dat=%3Cproquest_cross%3E1782159231%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&rft.eisbn=9783805591829&rft.eisbn_list=3805591829&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=232951839&rft_id=info:pmid/19571550&rfr_iscdi=true |