Analyzing fricative confusions in healthy and pathological speech using modified S-transform

Fricatives are a class of speech sounds that are produced when air passes through a partial constriction in the vocal tract resulting in a turbulent airflow with prominent energy in the high-frequency region. Place of constriction decides the resonances resulting in fricatives that differ in place o...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of speech technology 2024, Vol.27 (4), p.977-985
Hauptverfasser: Roopa, S., Karjigi, Veena, Chandrashekar, H. M.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 985
container_issue 4
container_start_page 977
container_title International journal of speech technology
container_volume 27
creator Roopa, S.
Karjigi, Veena
Chandrashekar, H. M.
description Fricatives are a class of speech sounds that are produced when air passes through a partial constriction in the vocal tract resulting in a turbulent airflow with prominent energy in the high-frequency region. Place of constriction decides the resonances resulting in fricatives that differ in place of articulation. The present study considers three classes of fricatives namely dental, alveolar and post-alveolar. To distinguish the fricatives based on place of articulation, it is important to have a signal representation with good frequency resolution at high frequencies. The standard S-transform exhibits the varying resolution with an uncontrolled window width and exhibits good frequency resolution at low-frequencies and good time resolution at high-frequencies. Modified S-transform introduces two adjustable parameters to control the width of the Gaussian window and provides better frequency resolution at high frequencies than S-transform and suitable for classification of fricatives based on place of articulation. The classification of fricatives in normal and pathological speech is attempted by using S-transform and modified S-transform spectrograms. Experimental results show that the use of modified S-transform provides higher fricative classification accuracy of 93.4% and 50% compared to 91.7% and 44.54% by using S-transform for normal and pathological speech respectively.
doi_str_mv 10.1007/s10772-024-10139-z
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3145726445</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3145726445</sourcerecordid><originalsourceid>FETCH-LOGICAL-c115z-68d97bd10afea638e7d78b0e0c2c0f9cc4049d3072aee43ae3296147849f972c3</originalsourceid><addsrcrecordid>eNp9kMtKxDAUhoMoOF5ewFXAdfXkMk2zHAZvMOBC3QkhkybTDp2kJh1h-vRGK7hzdQ6H7__hfAhdEbghAOI2ERCCFkB5QYAwWYxHaEbm-VQRAsd5ZxUpKCflKTpLaQsAUkg6Q-8Lr7vD2PoNdrE1emg_LTbBu31qg0-49bixuhuaA9a-xr0emtCFTSY7nHprTYMzmdO7ULeutTV-KYaofXIh7i7QidNdspe_8xy93d-9Lh-L1fPD03KxKgwh87Eoq1qKdU1AO6tLVllRi2oNFgw14KQxHLisGQiqreVMW0ZlSbiouHRSUMPO0fXU28fwsbdpUNuwj_mxpBjh2ULJ-TxTdKJMDClF61Qf252OB0VAfVtUk0WVLaofi2rMITaFUob9xsa_6n9SX938duY</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3145726445</pqid></control><display><type>article</type><title>Analyzing fricative confusions in healthy and pathological speech using modified S-transform</title><source>SpringerLink</source><creator>Roopa, S. ; Karjigi, Veena ; Chandrashekar, H. M.</creator><creatorcontrib>Roopa, S. ; Karjigi, Veena ; Chandrashekar, H. M.</creatorcontrib><description>Fricatives are a class of speech sounds that are produced when air passes through a partial constriction in the vocal tract resulting in a turbulent airflow with prominent energy in the high-frequency region. Place of constriction decides the resonances resulting in fricatives that differ in place of articulation. The present study considers three classes of fricatives namely dental, alveolar and post-alveolar. To distinguish the fricatives based on place of articulation, it is important to have a signal representation with good frequency resolution at high frequencies. The standard S-transform exhibits the varying resolution with an uncontrolled window width and exhibits good frequency resolution at low-frequencies and good time resolution at high-frequencies. Modified S-transform introduces two adjustable parameters to control the width of the Gaussian window and provides better frequency resolution at high frequencies than S-transform and suitable for classification of fricatives based on place of articulation. The classification of fricatives in normal and pathological speech is attempted by using S-transform and modified S-transform spectrograms. Experimental results show that the use of modified S-transform provides higher fricative classification accuracy of 93.4% and 50% compared to 91.7% and 44.54% by using S-transform for normal and pathological speech respectively.</description><identifier>ISSN: 1381-2416</identifier><identifier>EISSN: 1572-8110</identifier><identifier>DOI: 10.1007/s10772-024-10139-z</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Air flow ; Artificial Intelligence ; Classification ; Constrictions ; Engineering ; Fricatives ; Parameter modification ; Place of articulation ; Signal,Image and Speech Processing ; Social Sciences ; Spectrograms ; Speech sounds ; Transformations (mathematics) ; Vocal tract</subject><ispartof>International journal of speech technology, 2024, Vol.27 (4), p.977-985</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024 Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><rights>Copyright Springer Nature B.V. 2024</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c115z-68d97bd10afea638e7d78b0e0c2c0f9cc4049d3072aee43ae3296147849f972c3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s10772-024-10139-z$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s10772-024-10139-z$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Roopa, S.</creatorcontrib><creatorcontrib>Karjigi, Veena</creatorcontrib><creatorcontrib>Chandrashekar, H. M.</creatorcontrib><title>Analyzing fricative confusions in healthy and pathological speech using modified S-transform</title><title>International journal of speech technology</title><addtitle>Int J Speech Technol</addtitle><description>Fricatives are a class of speech sounds that are produced when air passes through a partial constriction in the vocal tract resulting in a turbulent airflow with prominent energy in the high-frequency region. Place of constriction decides the resonances resulting in fricatives that differ in place of articulation. The present study considers three classes of fricatives namely dental, alveolar and post-alveolar. To distinguish the fricatives based on place of articulation, it is important to have a signal representation with good frequency resolution at high frequencies. The standard S-transform exhibits the varying resolution with an uncontrolled window width and exhibits good frequency resolution at low-frequencies and good time resolution at high-frequencies. Modified S-transform introduces two adjustable parameters to control the width of the Gaussian window and provides better frequency resolution at high frequencies than S-transform and suitable for classification of fricatives based on place of articulation. The classification of fricatives in normal and pathological speech is attempted by using S-transform and modified S-transform spectrograms. Experimental results show that the use of modified S-transform provides higher fricative classification accuracy of 93.4% and 50% compared to 91.7% and 44.54% by using S-transform for normal and pathological speech respectively.</description><subject>Air flow</subject><subject>Artificial Intelligence</subject><subject>Classification</subject><subject>Constrictions</subject><subject>Engineering</subject><subject>Fricatives</subject><subject>Parameter modification</subject><subject>Place of articulation</subject><subject>Signal,Image and Speech Processing</subject><subject>Social Sciences</subject><subject>Spectrograms</subject><subject>Speech sounds</subject><subject>Transformations (mathematics)</subject><subject>Vocal tract</subject><issn>1381-2416</issn><issn>1572-8110</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kMtKxDAUhoMoOF5ewFXAdfXkMk2zHAZvMOBC3QkhkybTDp2kJh1h-vRGK7hzdQ6H7__hfAhdEbghAOI2ERCCFkB5QYAwWYxHaEbm-VQRAsd5ZxUpKCflKTpLaQsAUkg6Q-8Lr7vD2PoNdrE1emg_LTbBu31qg0-49bixuhuaA9a-xr0emtCFTSY7nHprTYMzmdO7ULeutTV-KYaofXIh7i7QidNdspe_8xy93d-9Lh-L1fPD03KxKgwh87Eoq1qKdU1AO6tLVllRi2oNFgw14KQxHLisGQiqreVMW0ZlSbiouHRSUMPO0fXU28fwsbdpUNuwj_mxpBjh2ULJ-TxTdKJMDClF61Qf252OB0VAfVtUk0WVLaofi2rMITaFUob9xsa_6n9SX938duY</recordid><startdate>2024</startdate><enddate>2024</enddate><creator>Roopa, S.</creator><creator>Karjigi, Veena</creator><creator>Chandrashekar, H. M.</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7T9</scope></search><sort><creationdate>2024</creationdate><title>Analyzing fricative confusions in healthy and pathological speech using modified S-transform</title><author>Roopa, S. ; Karjigi, Veena ; Chandrashekar, H. M.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c115z-68d97bd10afea638e7d78b0e0c2c0f9cc4049d3072aee43ae3296147849f972c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Air flow</topic><topic>Artificial Intelligence</topic><topic>Classification</topic><topic>Constrictions</topic><topic>Engineering</topic><topic>Fricatives</topic><topic>Parameter modification</topic><topic>Place of articulation</topic><topic>Signal,Image and Speech Processing</topic><topic>Social Sciences</topic><topic>Spectrograms</topic><topic>Speech sounds</topic><topic>Transformations (mathematics)</topic><topic>Vocal tract</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Roopa, S.</creatorcontrib><creatorcontrib>Karjigi, Veena</creatorcontrib><creatorcontrib>Chandrashekar, H. M.</creatorcontrib><collection>CrossRef</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><jtitle>International journal of speech technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Roopa, S.</au><au>Karjigi, Veena</au><au>Chandrashekar, H. M.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Analyzing fricative confusions in healthy and pathological speech using modified S-transform</atitle><jtitle>International journal of speech technology</jtitle><stitle>Int J Speech Technol</stitle><date>2024</date><risdate>2024</risdate><volume>27</volume><issue>4</issue><spage>977</spage><epage>985</epage><pages>977-985</pages><issn>1381-2416</issn><eissn>1572-8110</eissn><abstract>Fricatives are a class of speech sounds that are produced when air passes through a partial constriction in the vocal tract resulting in a turbulent airflow with prominent energy in the high-frequency region. Place of constriction decides the resonances resulting in fricatives that differ in place of articulation. The present study considers three classes of fricatives namely dental, alveolar and post-alveolar. To distinguish the fricatives based on place of articulation, it is important to have a signal representation with good frequency resolution at high frequencies. The standard S-transform exhibits the varying resolution with an uncontrolled window width and exhibits good frequency resolution at low-frequencies and good time resolution at high-frequencies. Modified S-transform introduces two adjustable parameters to control the width of the Gaussian window and provides better frequency resolution at high frequencies than S-transform and suitable for classification of fricatives based on place of articulation. The classification of fricatives in normal and pathological speech is attempted by using S-transform and modified S-transform spectrograms. Experimental results show that the use of modified S-transform provides higher fricative classification accuracy of 93.4% and 50% compared to 91.7% and 44.54% by using S-transform for normal and pathological speech respectively.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s10772-024-10139-z</doi><tpages>9</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1381-2416
ispartof International journal of speech technology, 2024, Vol.27 (4), p.977-985
issn 1381-2416
1572-8110
language eng
recordid cdi_proquest_journals_3145726445
source SpringerLink
subjects Air flow
Artificial Intelligence
Classification
Constrictions
Engineering
Fricatives
Parameter modification
Place of articulation
Signal,Image and Speech Processing
Social Sciences
Spectrograms
Speech sounds
Transformations (mathematics)
Vocal tract
title Analyzing fricative confusions in healthy and pathological speech using modified S-transform
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T20%3A15%3A08IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Analyzing%20fricative%20confusions%20in%20healthy%20and%20pathological%20speech%20using%20modified%20S-transform&rft.jtitle=International%20journal%20of%20speech%20technology&rft.au=Roopa,%20S.&rft.date=2024&rft.volume=27&rft.issue=4&rft.spage=977&rft.epage=985&rft.pages=977-985&rft.issn=1381-2416&rft.eissn=1572-8110&rft_id=info:doi/10.1007/s10772-024-10139-z&rft_dat=%3Cproquest_cross%3E3145726445%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3145726445&rft_id=info:pmid/&rfr_iscdi=true