Automatic segmentation of continuous speech using minimum phase group delay functions

In this paper, we present a new algorithm to automatically segment a continuous speech signal into syllable-like segments. The algorithm for segmentation is based on processing the short-term energy function of the continuous speech signal. The short-term energy function is a positive function and c...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Speech communication 2004-04, Vol.42 (3), p.429-446
Hauptverfasser: Kamakshi Prasad, V, Nagarajan, T, Murthy, Hema A
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 446
container_issue 3
container_start_page 429
container_title Speech communication
container_volume 42
creator Kamakshi Prasad, V
Nagarajan, T
Murthy, Hema A
description In this paper, we present a new algorithm to automatically segment a continuous speech signal into syllable-like segments. The algorithm for segmentation is based on processing the short-term energy function of the continuous speech signal. The short-term energy function is a positive function and can therefore be processed in a manner similar to that of the magnitude spectrum. In this paper, we employ an algorithm, based on group delay processing of the magnitude spectrum to determine segment boundaries in the speech signal. The experiments have been carried out on TIMIT and TIDIGITS databases. The error in segment boundary is ⩽20% of syllable duration for 70% of the syllables. In addition to true segments, an overall 5% insertions and deletions have also been observed.
doi_str_mv 10.1016/j.specom.2003.12.002
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_85580892</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0167639303001444</els_id><sourcerecordid>85332415</sourcerecordid><originalsourceid>FETCH-LOGICAL-c462t-cb8ee5ac608ca4f48fd5469bae99b2efdadd44bd5008f4bad9a0d864e25705293</originalsourceid><addsrcrecordid>eNqNkE1LxDAQhoMouH78Aw-56K01SZNuehEW8QsEL3oOaTLZzdImNWkF_71dV_AmnmYOz_vO8CB0QUlJCa2vt2UewMS-ZIRUJWUlIewALahcsmJJJTtEixlbFnXVVMfoJOctIYRLyRbobTWNsdejNzjDuocwznsMODpsYhh9mOKU8VwPZoOn7MMa9z74furxsNEZ8DrFacAWOv2J3RTMLp3P0JHTXYbzn3mK3u7vXm8fi-eXh6fb1XNheM3GwrQSQGhTE2k0d1w6K3jdtBqapmXgrLaW89YKQqTjrbaNJlbWHJhYEsGa6hRd7XuHFN8nyKPqfTbQdTrA_LeSQkgiG_YPsKoYp2IG-R40KeacwKkh-V6nT0WJ2slWW7WXrXayFWVqlj3HLn_6dTa6c0kH4_NvVuzUf_9xs-dgtvLhIalsPAQD1icwo7LR_33oC4igmXY</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>85332415</pqid></control><display><type>article</type><title>Automatic segmentation of continuous speech using minimum phase group delay functions</title><source>Access via ScienceDirect (Elsevier)</source><creator>Kamakshi Prasad, V ; Nagarajan, T ; Murthy, Hema A</creator><creatorcontrib>Kamakshi Prasad, V ; Nagarajan, T ; Murthy, Hema A</creatorcontrib><description>In this paper, we present a new algorithm to automatically segment a continuous speech signal into syllable-like segments. The algorithm for segmentation is based on processing the short-term energy function of the continuous speech signal. The short-term energy function is a positive function and can therefore be processed in a manner similar to that of the magnitude spectrum. In this paper, we employ an algorithm, based on group delay processing of the magnitude spectrum to determine segment boundaries in the speech signal. The experiments have been carried out on TIMIT and TIDIGITS databases. The error in segment boundary is ⩽20% of syllable duration for 70% of the syllables. In addition to true segments, an overall 5% insertions and deletions have also been observed.</description><identifier>ISSN: 0167-6393</identifier><identifier>EISSN: 1872-7182</identifier><identifier>DOI: 10.1016/j.specom.2003.12.002</identifier><identifier>CODEN: SCOMDH</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Applied sciences ; Exact sciences and technology ; Information, signal and communications theory ; Minimum phase group delay functions ; Root cepstrum ; Signal and communications theory ; Signal processing ; Signal representation. Spectral analysis ; Signal, noise ; Speech processing ; Speech segmentation ; Telecommunications and information theory</subject><ispartof>Speech communication, 2004-04, Vol.42 (3), p.429-446</ispartof><rights>2003 Elsevier B.V.</rights><rights>2004 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c462t-cb8ee5ac608ca4f48fd5469bae99b2efdadd44bd5008f4bad9a0d864e25705293</citedby><cites>FETCH-LOGICAL-c462t-cb8ee5ac608ca4f48fd5469bae99b2efdadd44bd5008f4bad9a0d864e25705293</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.specom.2003.12.002$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=15639392$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Kamakshi Prasad, V</creatorcontrib><creatorcontrib>Nagarajan, T</creatorcontrib><creatorcontrib>Murthy, Hema A</creatorcontrib><title>Automatic segmentation of continuous speech using minimum phase group delay functions</title><title>Speech communication</title><description>In this paper, we present a new algorithm to automatically segment a continuous speech signal into syllable-like segments. The algorithm for segmentation is based on processing the short-term energy function of the continuous speech signal. The short-term energy function is a positive function and can therefore be processed in a manner similar to that of the magnitude spectrum. In this paper, we employ an algorithm, based on group delay processing of the magnitude spectrum to determine segment boundaries in the speech signal. The experiments have been carried out on TIMIT and TIDIGITS databases. The error in segment boundary is ⩽20% of syllable duration for 70% of the syllables. In addition to true segments, an overall 5% insertions and deletions have also been observed.</description><subject>Applied sciences</subject><subject>Exact sciences and technology</subject><subject>Information, signal and communications theory</subject><subject>Minimum phase group delay functions</subject><subject>Root cepstrum</subject><subject>Signal and communications theory</subject><subject>Signal processing</subject><subject>Signal representation. Spectral analysis</subject><subject>Signal, noise</subject><subject>Speech processing</subject><subject>Speech segmentation</subject><subject>Telecommunications and information theory</subject><issn>0167-6393</issn><issn>1872-7182</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2004</creationdate><recordtype>article</recordtype><recordid>eNqNkE1LxDAQhoMouH78Aw-56K01SZNuehEW8QsEL3oOaTLZzdImNWkF_71dV_AmnmYOz_vO8CB0QUlJCa2vt2UewMS-ZIRUJWUlIewALahcsmJJJTtEixlbFnXVVMfoJOctIYRLyRbobTWNsdejNzjDuocwznsMODpsYhh9mOKU8VwPZoOn7MMa9z74furxsNEZ8DrFacAWOv2J3RTMLp3P0JHTXYbzn3mK3u7vXm8fi-eXh6fb1XNheM3GwrQSQGhTE2k0d1w6K3jdtBqapmXgrLaW89YKQqTjrbaNJlbWHJhYEsGa6hRd7XuHFN8nyKPqfTbQdTrA_LeSQkgiG_YPsKoYp2IG-R40KeacwKkh-V6nT0WJ2slWW7WXrXayFWVqlj3HLn_6dTa6c0kH4_NvVuzUf_9xs-dgtvLhIalsPAQD1icwo7LR_33oC4igmXY</recordid><startdate>20040401</startdate><enddate>20040401</enddate><creator>Kamakshi Prasad, V</creator><creator>Nagarajan, T</creator><creator>Murthy, Hema A</creator><general>Elsevier B.V</general><general>Elsevier</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8BM</scope><scope>7T9</scope></search><sort><creationdate>20040401</creationdate><title>Automatic segmentation of continuous speech using minimum phase group delay functions</title><author>Kamakshi Prasad, V ; Nagarajan, T ; Murthy, Hema A</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c462t-cb8ee5ac608ca4f48fd5469bae99b2efdadd44bd5008f4bad9a0d864e25705293</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Applied sciences</topic><topic>Exact sciences and technology</topic><topic>Information, signal and communications theory</topic><topic>Minimum phase group delay functions</topic><topic>Root cepstrum</topic><topic>Signal and communications theory</topic><topic>Signal processing</topic><topic>Signal representation. Spectral analysis</topic><topic>Signal, noise</topic><topic>Speech processing</topic><topic>Speech segmentation</topic><topic>Telecommunications and information theory</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kamakshi Prasad, V</creatorcontrib><creatorcontrib>Nagarajan, T</creatorcontrib><creatorcontrib>Murthy, Hema A</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>ComDisDome</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><jtitle>Speech communication</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kamakshi Prasad, V</au><au>Nagarajan, T</au><au>Murthy, Hema A</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Automatic segmentation of continuous speech using minimum phase group delay functions</atitle><jtitle>Speech communication</jtitle><date>2004-04-01</date><risdate>2004</risdate><volume>42</volume><issue>3</issue><spage>429</spage><epage>446</epage><pages>429-446</pages><issn>0167-6393</issn><eissn>1872-7182</eissn><coden>SCOMDH</coden><abstract>In this paper, we present a new algorithm to automatically segment a continuous speech signal into syllable-like segments. The algorithm for segmentation is based on processing the short-term energy function of the continuous speech signal. The short-term energy function is a positive function and can therefore be processed in a manner similar to that of the magnitude spectrum. In this paper, we employ an algorithm, based on group delay processing of the magnitude spectrum to determine segment boundaries in the speech signal. The experiments have been carried out on TIMIT and TIDIGITS databases. The error in segment boundary is ⩽20% of syllable duration for 70% of the syllables. In addition to true segments, an overall 5% insertions and deletions have also been observed.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.specom.2003.12.002</doi><tpages>18</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0167-6393
ispartof Speech communication, 2004-04, Vol.42 (3), p.429-446
issn 0167-6393
1872-7182
language eng
recordid cdi_proquest_miscellaneous_85580892
source Access via ScienceDirect (Elsevier)
subjects Applied sciences
Exact sciences and technology
Information, signal and communications theory
Minimum phase group delay functions
Root cepstrum
Signal and communications theory
Signal processing
Signal representation. Spectral analysis
Signal, noise
Speech processing
Speech segmentation
Telecommunications and information theory
title Automatic segmentation of continuous speech using minimum phase group delay functions
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-18T16%3A28%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Automatic%20segmentation%20of%20continuous%20speech%20using%20minimum%20phase%20group%20delay%20functions&rft.jtitle=Speech%20communication&rft.au=Kamakshi%20Prasad,%20V&rft.date=2004-04-01&rft.volume=42&rft.issue=3&rft.spage=429&rft.epage=446&rft.pages=429-446&rft.issn=0167-6393&rft.eissn=1872-7182&rft.coden=SCOMDH&rft_id=info:doi/10.1016/j.specom.2003.12.002&rft_dat=%3Cproquest_cross%3E85332415%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=85332415&rft_id=info:pmid/&rft_els_id=S0167639303001444&rfr_iscdi=true