Automatic segmentation of continuous speech using minimum phase group delay functions

In this paper, we present a new algorithm to automatically segment a continuous speech signal into syllable-like segments. The algorithm for segmentation is based on processing the short-term energy function of the continuous speech signal. The short-term energy function is a positive function and c...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Speech communication 2004-04, Vol.42 (3), p.429-446
Hauptverfasser:	Kamakshi Prasad, V, Nagarajan, T, Murthy, Hema A
Format:	Artikel
Sprache:	eng
Schlagworte:	Applied sciences Exact sciences and technology Information, signal and communications theory Minimum phase group delay functions Root cepstrum Signal and communications theory Signal processing Signal representation. Spectral analysis Signal, noise Speech processing Speech segmentation Telecommunications and information theory
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	446
container_issue	3
container_start_page	429
container_title	Speech communication
container_volume	42
creator	Kamakshi Prasad, V Nagarajan, T Murthy, Hema A
description	In this paper, we present a new algorithm to automatically segment a continuous speech signal into syllable-like segments. The algorithm for segmentation is based on processing the short-term energy function of the continuous speech signal. The short-term energy function is a positive function and can therefore be processed in a manner similar to that of the magnitude spectrum. In this paper, we employ an algorithm, based on group delay processing of the magnitude spectrum to determine segment boundaries in the speech signal. The experiments have been carried out on TIMIT and TIDIGITS databases. The error in segment boundary is ⩽20% of syllable duration for 70% of the syllables. In addition to true segments, an overall 5% insertions and deletions have also been observed.
doi_str_mv	10.1016/j.specom.2003.12.002
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_85580892</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0167639303001444</els_id><sourcerecordid>85332415</sourcerecordid><originalsourceid>FETCH-LOGICAL-c462t-cb8ee5ac608ca4f48fd5469bae99b2efdadd44bd5008f4bad9a0d864e25705293</originalsourceid><addsrcrecordid>eNqNkE1LxDAQhoMouH78Aw-56K01SZNuehEW8QsEL3oOaTLZzdImNWkF_71dV_AmnmYOz_vO8CB0QUlJCa2vt2UewMS-ZIRUJWUlIewALahcsmJJJTtEixlbFnXVVMfoJOctIYRLyRbobTWNsdejNzjDuocwznsMODpsYhh9mOKU8VwPZoOn7MMa9z74furxsNEZ8DrFacAWOv2J3RTMLp3P0JHTXYbzn3mK3u7vXm8fi-eXh6fb1XNheM3GwrQSQGhTE2k0d1w6K3jdtBqapmXgrLaW89YKQqTjrbaNJlbWHJhYEsGa6hRd7XuHFN8nyKPqfTbQdTrA_LeSQkgiG_YPsKoYp2IG-R40KeacwKkh-V6nT0WJ2slWW7WXrXayFWVqlj3HLn_6dTa6c0kH4_NvVuzUf_9xs-dgtvLhIalsPAQD1icwo7LR_33oC4igmXY</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>85332415</pqid></control><display><type>article</type><title>Automatic segmentation of continuous speech using minimum phase group delay functions</title><source>Access via ScienceDirect (Elsevier)</source><creator>Kamakshi Prasad, V ; Nagarajan, T ; Murthy, Hema A</creator><creatorcontrib>Kamakshi Prasad, V ; Nagarajan, T ; Murthy, Hema A</creatorcontrib><description>In this paper, we present a new algorithm to automatically segment a continuous speech signal into syllable-like segments. The algorithm for segmentation is based on processing the short-term energy function of the continuous speech signal. The short-term energy function is a positive function and can therefore be processed in a manner similar to that of the magnitude spectrum. In this paper, we employ an algorithm, based on group delay processing of the magnitude spectrum to determine segment boundaries in the speech signal. The experiments have been carried out on TIMIT and TIDIGITS databases. The error in segment boundary is ⩽20% of syllable duration for 70% of the syllables. In addition to true segments, an overall 5% insertions and deletions have also been observed.</description><identifier>ISSN: 0167-6393</identifier><identifier>EISSN: 1872-7182</identifier><identifier>DOI: 10.1016/j.specom.2003.12.002</identifier><identifier>CODEN: SCOMDH</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Applied sciences ; Exact sciences and technology ; Information, signal and communications theory ; Minimum phase group delay functions ; Root cepstrum ; Signal and communications theory ; Signal processing ; Signal representation. Spectral analysis ; Signal, noise ; Speech processing ; Speech segmentation ; Telecommunications and information theory</subject><ispartof>Speech communication, 2004-04, Vol.42 (3), p.429-446</ispartof><rights>2003 Elsevier B.V.</rights><rights>2004 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c462t-cb8ee5ac608ca4f48fd5469bae99b2efdadd44bd5008f4bad9a0d864e25705293</citedby><cites>FETCH-LOGICAL-c462t-cb8ee5ac608ca4f48fd5469bae99b2efdadd44bd5008f4bad9a0d864e25705293</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.specom.2003.12.002$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=15639392$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Kamakshi Prasad, V</creatorcontrib><creatorcontrib>Nagarajan, T</creatorcontrib><creatorcontrib>Murthy, Hema A</creatorcontrib><title>Automatic segmentation of continuous speech using minimum phase group delay functions</title><title>Speech communication</title><description>In this paper, we present a new algorithm to automatically segment a continuous speech signal into syllable-like segments. The algorithm for segmentation is based on processing the short-term energy function of the continuous speech signal. The short-term energy function is a positive function and can therefore be processed in a manner similar to that of the magnitude spectrum. In this paper, we employ an algorithm, based on group delay processing of the magnitude spectrum to determine segment boundaries in the speech signal. The experiments have been carried out on TIMIT and TIDIGITS databases. The error in segment boundary is ⩽20% of syllable duration for 70% of the syllables. In addition to true segments, an overall 5% insertions and deletions have also been observed.</description><subject>Applied sciences</subject><subject>Exact sciences and technology</subject><subject>Information, signal and communications theory</subject><subject>Minimum phase group delay functions</subject><subject>Root cepstrum</subject><subject>Signal and communications theory</subject><subject>Signal processing</subject><subject>Signal representation. Spectral analysis</subject><subject>Signal, noise</subject><subject>Speech processing</subject><subject>Speech segmentation</subject><subject>Telecommunications and information theory</subject><issn>0167-6393</issn><issn>1872-7182</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2004</creationdate><recordtype>article</recordtype><recordid>eNqNkE1LxDAQhoMouH78Aw-56K01SZNuehEW8QsEL3oOaTLZzdImNWkF_71dV_AmnmYOz_vO8CB0QUlJCa2vt2UewMS-ZIRUJWUlIewALahcsmJJJTtEixlbFnXVVMfoJOctIYRLyRbobTWNsdejNzjDuocwznsMODpsYhh9mOKU8VwPZoOn7MMa9z74furxsNEZ8DrFacAWOv2J3RTMLp3P0JHTXYbzn3mK3u7vXm8fi-eXh6fb1XNheM3GwrQSQGhTE2k0d1w6K3jdtBqapmXgrLaW89YKQqTjrbaNJlbWHJhYEsGa6hRd7XuHFN8nyKPqfTbQdTrA_LeSQkgiG_YPsKoYp2IG-R40KeacwKkh-V6nT0WJ2slWW7WXrXayFWVqlj3HLn_6dTa6c0kH4_NvVuzUf_9xs-dgtvLhIalsPAQD1icwo7LR_33oC4igmXY</recordid><startdate>20040401</startdate><enddate>20040401</enddate><creator>Kamakshi Prasad, V</creator><creator>Nagarajan, T</creator><creator>Murthy, Hema A</creator><general>Elsevier B.V</general><general>Elsevier</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8BM</scope><scope>7T9</scope></search><sort><creationdate>20040401</creationdate><title>Automatic segmentation of continuous speech using minimum phase group delay functions</title><author>Kamakshi Prasad, V ; Nagarajan, T ; Murthy, Hema A</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c462t-cb8ee5ac608ca4f48fd5469bae99b2efdadd44bd5008f4bad9a0d864e25705293</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Applied sciences</topic><topic>Exact sciences and technology</topic><topic>Information, signal and communications theory</topic><topic>Minimum phase group delay functions</topic><topic>Root cepstrum</topic><topic>Signal and communications theory</topic><topic>Signal processing</topic><topic>Signal representation. Spectral analysis</topic><topic>Signal, noise</topic><topic>Speech processing</topic><topic>Speech segmentation</topic><topic>Telecommunications and information theory</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kamakshi Prasad, V</creatorcontrib><creatorcontrib>Nagarajan, T</creatorcontrib><creatorcontrib>Murthy, Hema A</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>ComDisDome</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><jtitle>Speech communication</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kamakshi Prasad, V</au><au>Nagarajan, T</au><au>Murthy, Hema A</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Automatic segmentation of continuous speech using minimum phase group delay functions</atitle><jtitle>Speech communication</jtitle><date>2004-04-01</date><risdate>2004</risdate><volume>42</volume><issue>3</issue><spage>429</spage><epage>446</epage><pages>429-446</pages><issn>0167-6393</issn><eissn>1872-7182</eissn><coden>SCOMDH</coden><abstract>In this paper, we present a new algorithm to automatically segment a continuous speech signal into syllable-like segments. The algorithm for segmentation is based on processing the short-term energy function of the continuous speech signal. The short-term energy function is a positive function and can therefore be processed in a manner similar to that of the magnitude spectrum. In this paper, we employ an algorithm, based on group delay processing of the magnitude spectrum to determine segment boundaries in the speech signal. The experiments have been carried out on TIMIT and TIDIGITS databases. The error in segment boundary is ⩽20% of syllable duration for 70% of the syllables. In addition to true segments, an overall 5% insertions and deletions have also been observed.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.specom.2003.12.002</doi><tpages>18</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 0167-6393
ispartof	Speech communication, 2004-04, Vol.42 (3), p.429-446
issn	0167-6393 1872-7182
language	eng
recordid	cdi_proquest_miscellaneous_85580892
source	Access via ScienceDirect (Elsevier)
subjects	Applied sciences Exact sciences and technology Information, signal and communications theory Minimum phase group delay functions Root cepstrum Signal and communications theory Signal processing Signal representation. Spectral analysis Signal, noise Speech processing Speech segmentation Telecommunications and information theory
title	Automatic segmentation of continuous speech using minimum phase group delay functions
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-18T16%3A28%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Automatic%20segmentation%20of%20continuous%20speech%20using%20minimum%20phase%20group%20delay%20functions&rft.jtitle=Speech%20communication&rft.au=Kamakshi%20Prasad,%20V&rft.date=2004-04-01&rft.volume=42&rft.issue=3&rft.spage=429&rft.epage=446&rft.pages=429-446&rft.issn=0167-6393&rft.eissn=1872-7182&rft.coden=SCOMDH&rft_id=info:doi/10.1016/j.specom.2003.12.002&rft_dat=%3Cproquest_cross%3E85332415%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=85332415&rft_id=info:pmid/&rft_els_id=S0167639303001444&rfr_iscdi=true