Analyzing fricative confusions in healthy and pathological speech using modified S-transform
Fricatives are a class of speech sounds that are produced when air passes through a partial constriction in the vocal tract resulting in a turbulent airflow with prominent energy in the high-frequency region. Place of constriction decides the resonances resulting in fricatives that differ in place o...
Gespeichert in:
Veröffentlicht in: | International journal of speech technology 2024, Vol.27 (4), p.977-985 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 985 |
---|---|
container_issue | 4 |
container_start_page | 977 |
container_title | International journal of speech technology |
container_volume | 27 |
creator | Roopa, S. Karjigi, Veena Chandrashekar, H. M. |
description | Fricatives are a class of speech sounds that are produced when air passes through a partial constriction in the vocal tract resulting in a turbulent airflow with prominent energy in the high-frequency region. Place of constriction decides the resonances resulting in fricatives that differ in place of articulation. The present study considers three classes of fricatives namely dental, alveolar and post-alveolar. To distinguish the fricatives based on place of articulation, it is important to have a signal representation with good frequency resolution at high frequencies. The standard S-transform exhibits the varying resolution with an uncontrolled window width and exhibits good frequency resolution at low-frequencies and good time resolution at high-frequencies. Modified S-transform introduces two adjustable parameters to control the width of the Gaussian window and provides better frequency resolution at high frequencies than S-transform and suitable for classification of fricatives based on place of articulation. The classification of fricatives in normal and pathological speech is attempted by using S-transform and modified S-transform spectrograms. Experimental results show that the use of modified S-transform provides higher fricative classification accuracy of 93.4% and 50% compared to 91.7% and 44.54% by using S-transform for normal and pathological speech respectively. |
doi_str_mv | 10.1007/s10772-024-10139-z |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3145726445</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3145726445</sourcerecordid><originalsourceid>FETCH-LOGICAL-c115z-68d97bd10afea638e7d78b0e0c2c0f9cc4049d3072aee43ae3296147849f972c3</originalsourceid><addsrcrecordid>eNp9kMtKxDAUhoMoOF5ewFXAdfXkMk2zHAZvMOBC3QkhkybTDp2kJh1h-vRGK7hzdQ6H7__hfAhdEbghAOI2ERCCFkB5QYAwWYxHaEbm-VQRAsd5ZxUpKCflKTpLaQsAUkg6Q-8Lr7vD2PoNdrE1emg_LTbBu31qg0-49bixuhuaA9a-xr0emtCFTSY7nHprTYMzmdO7ULeutTV-KYaofXIh7i7QidNdspe_8xy93d-9Lh-L1fPD03KxKgwh87Eoq1qKdU1AO6tLVllRi2oNFgw14KQxHLisGQiqreVMW0ZlSbiouHRSUMPO0fXU28fwsbdpUNuwj_mxpBjh2ULJ-TxTdKJMDClF61Qf252OB0VAfVtUk0WVLaofi2rMITaFUob9xsa_6n9SX938duY</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3145726445</pqid></control><display><type>article</type><title>Analyzing fricative confusions in healthy and pathological speech using modified S-transform</title><source>SpringerLink</source><creator>Roopa, S. ; Karjigi, Veena ; Chandrashekar, H. M.</creator><creatorcontrib>Roopa, S. ; Karjigi, Veena ; Chandrashekar, H. M.</creatorcontrib><description>Fricatives are a class of speech sounds that are produced when air passes through a partial constriction in the vocal tract resulting in a turbulent airflow with prominent energy in the high-frequency region. Place of constriction decides the resonances resulting in fricatives that differ in place of articulation. The present study considers three classes of fricatives namely dental, alveolar and post-alveolar. To distinguish the fricatives based on place of articulation, it is important to have a signal representation with good frequency resolution at high frequencies. The standard S-transform exhibits the varying resolution with an uncontrolled window width and exhibits good frequency resolution at low-frequencies and good time resolution at high-frequencies. Modified S-transform introduces two adjustable parameters to control the width of the Gaussian window and provides better frequency resolution at high frequencies than S-transform and suitable for classification of fricatives based on place of articulation. The classification of fricatives in normal and pathological speech is attempted by using S-transform and modified S-transform spectrograms. Experimental results show that the use of modified S-transform provides higher fricative classification accuracy of 93.4% and 50% compared to 91.7% and 44.54% by using S-transform for normal and pathological speech respectively.</description><identifier>ISSN: 1381-2416</identifier><identifier>EISSN: 1572-8110</identifier><identifier>DOI: 10.1007/s10772-024-10139-z</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Air flow ; Artificial Intelligence ; Classification ; Constrictions ; Engineering ; Fricatives ; Parameter modification ; Place of articulation ; Signal,Image and Speech Processing ; Social Sciences ; Spectrograms ; Speech sounds ; Transformations (mathematics) ; Vocal tract</subject><ispartof>International journal of speech technology, 2024, Vol.27 (4), p.977-985</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024 Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><rights>Copyright Springer Nature B.V. 2024</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c115z-68d97bd10afea638e7d78b0e0c2c0f9cc4049d3072aee43ae3296147849f972c3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s10772-024-10139-z$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s10772-024-10139-z$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Roopa, S.</creatorcontrib><creatorcontrib>Karjigi, Veena</creatorcontrib><creatorcontrib>Chandrashekar, H. M.</creatorcontrib><title>Analyzing fricative confusions in healthy and pathological speech using modified S-transform</title><title>International journal of speech technology</title><addtitle>Int J Speech Technol</addtitle><description>Fricatives are a class of speech sounds that are produced when air passes through a partial constriction in the vocal tract resulting in a turbulent airflow with prominent energy in the high-frequency region. Place of constriction decides the resonances resulting in fricatives that differ in place of articulation. The present study considers three classes of fricatives namely dental, alveolar and post-alveolar. To distinguish the fricatives based on place of articulation, it is important to have a signal representation with good frequency resolution at high frequencies. The standard S-transform exhibits the varying resolution with an uncontrolled window width and exhibits good frequency resolution at low-frequencies and good time resolution at high-frequencies. Modified S-transform introduces two adjustable parameters to control the width of the Gaussian window and provides better frequency resolution at high frequencies than S-transform and suitable for classification of fricatives based on place of articulation. The classification of fricatives in normal and pathological speech is attempted by using S-transform and modified S-transform spectrograms. Experimental results show that the use of modified S-transform provides higher fricative classification accuracy of 93.4% and 50% compared to 91.7% and 44.54% by using S-transform for normal and pathological speech respectively.</description><subject>Air flow</subject><subject>Artificial Intelligence</subject><subject>Classification</subject><subject>Constrictions</subject><subject>Engineering</subject><subject>Fricatives</subject><subject>Parameter modification</subject><subject>Place of articulation</subject><subject>Signal,Image and Speech Processing</subject><subject>Social Sciences</subject><subject>Spectrograms</subject><subject>Speech sounds</subject><subject>Transformations (mathematics)</subject><subject>Vocal tract</subject><issn>1381-2416</issn><issn>1572-8110</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kMtKxDAUhoMoOF5ewFXAdfXkMk2zHAZvMOBC3QkhkybTDp2kJh1h-vRGK7hzdQ6H7__hfAhdEbghAOI2ERCCFkB5QYAwWYxHaEbm-VQRAsd5ZxUpKCflKTpLaQsAUkg6Q-8Lr7vD2PoNdrE1emg_LTbBu31qg0-49bixuhuaA9a-xr0emtCFTSY7nHprTYMzmdO7ULeutTV-KYaofXIh7i7QidNdspe_8xy93d-9Lh-L1fPD03KxKgwh87Eoq1qKdU1AO6tLVllRi2oNFgw14KQxHLisGQiqreVMW0ZlSbiouHRSUMPO0fXU28fwsbdpUNuwj_mxpBjh2ULJ-TxTdKJMDClF61Qf252OB0VAfVtUk0WVLaofi2rMITaFUob9xsa_6n9SX938duY</recordid><startdate>2024</startdate><enddate>2024</enddate><creator>Roopa, S.</creator><creator>Karjigi, Veena</creator><creator>Chandrashekar, H. M.</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7T9</scope></search><sort><creationdate>2024</creationdate><title>Analyzing fricative confusions in healthy and pathological speech using modified S-transform</title><author>Roopa, S. ; Karjigi, Veena ; Chandrashekar, H. M.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c115z-68d97bd10afea638e7d78b0e0c2c0f9cc4049d3072aee43ae3296147849f972c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Air flow</topic><topic>Artificial Intelligence</topic><topic>Classification</topic><topic>Constrictions</topic><topic>Engineering</topic><topic>Fricatives</topic><topic>Parameter modification</topic><topic>Place of articulation</topic><topic>Signal,Image and Speech Processing</topic><topic>Social Sciences</topic><topic>Spectrograms</topic><topic>Speech sounds</topic><topic>Transformations (mathematics)</topic><topic>Vocal tract</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Roopa, S.</creatorcontrib><creatorcontrib>Karjigi, Veena</creatorcontrib><creatorcontrib>Chandrashekar, H. M.</creatorcontrib><collection>CrossRef</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><jtitle>International journal of speech technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Roopa, S.</au><au>Karjigi, Veena</au><au>Chandrashekar, H. M.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Analyzing fricative confusions in healthy and pathological speech using modified S-transform</atitle><jtitle>International journal of speech technology</jtitle><stitle>Int J Speech Technol</stitle><date>2024</date><risdate>2024</risdate><volume>27</volume><issue>4</issue><spage>977</spage><epage>985</epage><pages>977-985</pages><issn>1381-2416</issn><eissn>1572-8110</eissn><abstract>Fricatives are a class of speech sounds that are produced when air passes through a partial constriction in the vocal tract resulting in a turbulent airflow with prominent energy in the high-frequency region. Place of constriction decides the resonances resulting in fricatives that differ in place of articulation. The present study considers three classes of fricatives namely dental, alveolar and post-alveolar. To distinguish the fricatives based on place of articulation, it is important to have a signal representation with good frequency resolution at high frequencies. The standard S-transform exhibits the varying resolution with an uncontrolled window width and exhibits good frequency resolution at low-frequencies and good time resolution at high-frequencies. Modified S-transform introduces two adjustable parameters to control the width of the Gaussian window and provides better frequency resolution at high frequencies than S-transform and suitable for classification of fricatives based on place of articulation. The classification of fricatives in normal and pathological speech is attempted by using S-transform and modified S-transform spectrograms. Experimental results show that the use of modified S-transform provides higher fricative classification accuracy of 93.4% and 50% compared to 91.7% and 44.54% by using S-transform for normal and pathological speech respectively.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s10772-024-10139-z</doi><tpages>9</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1381-2416 |
ispartof | International journal of speech technology, 2024, Vol.27 (4), p.977-985 |
issn | 1381-2416 1572-8110 |
language | eng |
recordid | cdi_proquest_journals_3145726445 |
source | SpringerLink |
subjects | Air flow Artificial Intelligence Classification Constrictions Engineering Fricatives Parameter modification Place of articulation Signal,Image and Speech Processing Social Sciences Spectrograms Speech sounds Transformations (mathematics) Vocal tract |
title | Analyzing fricative confusions in healthy and pathological speech using modified S-transform |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T20%3A15%3A08IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Analyzing%20fricative%20confusions%20in%20healthy%20and%20pathological%20speech%20using%20modified%20S-transform&rft.jtitle=International%20journal%20of%20speech%20technology&rft.au=Roopa,%20S.&rft.date=2024&rft.volume=27&rft.issue=4&rft.spage=977&rft.epage=985&rft.pages=977-985&rft.issn=1381-2416&rft.eissn=1572-8110&rft_id=info:doi/10.1007/s10772-024-10139-z&rft_dat=%3Cproquest_cross%3E3145726445%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3145726445&rft_id=info:pmid/&rfr_iscdi=true |