Automatic segmentation of continuous speech using minimum phase group delay functions
In this paper, we present a new algorithm to automatically segment a continuous speech signal into syllable-like segments. The algorithm for segmentation is based on processing the short-term energy function of the continuous speech signal. The short-term energy function is a positive function and c...
Gespeichert in:
Veröffentlicht in: | Speech communication 2004-04, Vol.42 (3), p.429-446 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 446 |
---|---|
container_issue | 3 |
container_start_page | 429 |
container_title | Speech communication |
container_volume | 42 |
creator | Kamakshi Prasad, V Nagarajan, T Murthy, Hema A |
description | In this paper, we present a new algorithm to automatically segment a continuous speech signal into syllable-like segments. The algorithm for segmentation is based on processing the short-term energy function of the continuous speech signal. The short-term energy function is a positive function and can therefore be processed in a manner similar to that of the magnitude spectrum. In this paper, we employ an algorithm, based on group delay processing of the magnitude spectrum to determine segment boundaries in the speech signal. The experiments have been carried out on TIMIT and TIDIGITS databases. The error in segment boundary is ⩽20% of syllable duration for 70% of the syllables. In addition to true segments, an overall 5% insertions and deletions have also been observed. |
doi_str_mv | 10.1016/j.specom.2003.12.002 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_85580892</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0167639303001444</els_id><sourcerecordid>85332415</sourcerecordid><originalsourceid>FETCH-LOGICAL-c462t-cb8ee5ac608ca4f48fd5469bae99b2efdadd44bd5008f4bad9a0d864e25705293</originalsourceid><addsrcrecordid>eNqNkE1LxDAQhoMouH78Aw-56K01SZNuehEW8QsEL3oOaTLZzdImNWkF_71dV_AmnmYOz_vO8CB0QUlJCa2vt2UewMS-ZIRUJWUlIewALahcsmJJJTtEixlbFnXVVMfoJOctIYRLyRbobTWNsdejNzjDuocwznsMODpsYhh9mOKU8VwPZoOn7MMa9z74furxsNEZ8DrFacAWOv2J3RTMLp3P0JHTXYbzn3mK3u7vXm8fi-eXh6fb1XNheM3GwrQSQGhTE2k0d1w6K3jdtBqapmXgrLaW89YKQqTjrbaNJlbWHJhYEsGa6hRd7XuHFN8nyKPqfTbQdTrA_LeSQkgiG_YPsKoYp2IG-R40KeacwKkh-V6nT0WJ2slWW7WXrXayFWVqlj3HLn_6dTa6c0kH4_NvVuzUf_9xs-dgtvLhIalsPAQD1icwo7LR_33oC4igmXY</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>85332415</pqid></control><display><type>article</type><title>Automatic segmentation of continuous speech using minimum phase group delay functions</title><source>Access via ScienceDirect (Elsevier)</source><creator>Kamakshi Prasad, V ; Nagarajan, T ; Murthy, Hema A</creator><creatorcontrib>Kamakshi Prasad, V ; Nagarajan, T ; Murthy, Hema A</creatorcontrib><description>In this paper, we present a new algorithm to automatically segment a continuous speech signal into syllable-like segments. The algorithm for segmentation is based on processing the short-term energy function of the continuous speech signal. The short-term energy function is a positive function and can therefore be processed in a manner similar to that of the magnitude spectrum. In this paper, we employ an algorithm, based on group delay processing of the magnitude spectrum to determine segment boundaries in the speech signal. The experiments have been carried out on TIMIT and TIDIGITS databases. The error in segment boundary is ⩽20% of syllable duration for 70% of the syllables. In addition to true segments, an overall 5% insertions and deletions have also been observed.</description><identifier>ISSN: 0167-6393</identifier><identifier>EISSN: 1872-7182</identifier><identifier>DOI: 10.1016/j.specom.2003.12.002</identifier><identifier>CODEN: SCOMDH</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Applied sciences ; Exact sciences and technology ; Information, signal and communications theory ; Minimum phase group delay functions ; Root cepstrum ; Signal and communications theory ; Signal processing ; Signal representation. Spectral analysis ; Signal, noise ; Speech processing ; Speech segmentation ; Telecommunications and information theory</subject><ispartof>Speech communication, 2004-04, Vol.42 (3), p.429-446</ispartof><rights>2003 Elsevier B.V.</rights><rights>2004 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c462t-cb8ee5ac608ca4f48fd5469bae99b2efdadd44bd5008f4bad9a0d864e25705293</citedby><cites>FETCH-LOGICAL-c462t-cb8ee5ac608ca4f48fd5469bae99b2efdadd44bd5008f4bad9a0d864e25705293</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.specom.2003.12.002$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=15639392$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Kamakshi Prasad, V</creatorcontrib><creatorcontrib>Nagarajan, T</creatorcontrib><creatorcontrib>Murthy, Hema A</creatorcontrib><title>Automatic segmentation of continuous speech using minimum phase group delay functions</title><title>Speech communication</title><description>In this paper, we present a new algorithm to automatically segment a continuous speech signal into syllable-like segments. The algorithm for segmentation is based on processing the short-term energy function of the continuous speech signal. The short-term energy function is a positive function and can therefore be processed in a manner similar to that of the magnitude spectrum. In this paper, we employ an algorithm, based on group delay processing of the magnitude spectrum to determine segment boundaries in the speech signal. The experiments have been carried out on TIMIT and TIDIGITS databases. The error in segment boundary is ⩽20% of syllable duration for 70% of the syllables. In addition to true segments, an overall 5% insertions and deletions have also been observed.</description><subject>Applied sciences</subject><subject>Exact sciences and technology</subject><subject>Information, signal and communications theory</subject><subject>Minimum phase group delay functions</subject><subject>Root cepstrum</subject><subject>Signal and communications theory</subject><subject>Signal processing</subject><subject>Signal representation. Spectral analysis</subject><subject>Signal, noise</subject><subject>Speech processing</subject><subject>Speech segmentation</subject><subject>Telecommunications and information theory</subject><issn>0167-6393</issn><issn>1872-7182</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2004</creationdate><recordtype>article</recordtype><recordid>eNqNkE1LxDAQhoMouH78Aw-56K01SZNuehEW8QsEL3oOaTLZzdImNWkF_71dV_AmnmYOz_vO8CB0QUlJCa2vt2UewMS-ZIRUJWUlIewALahcsmJJJTtEixlbFnXVVMfoJOctIYRLyRbobTWNsdejNzjDuocwznsMODpsYhh9mOKU8VwPZoOn7MMa9z74furxsNEZ8DrFacAWOv2J3RTMLp3P0JHTXYbzn3mK3u7vXm8fi-eXh6fb1XNheM3GwrQSQGhTE2k0d1w6K3jdtBqapmXgrLaW89YKQqTjrbaNJlbWHJhYEsGa6hRd7XuHFN8nyKPqfTbQdTrA_LeSQkgiG_YPsKoYp2IG-R40KeacwKkh-V6nT0WJ2slWW7WXrXayFWVqlj3HLn_6dTa6c0kH4_NvVuzUf_9xs-dgtvLhIalsPAQD1icwo7LR_33oC4igmXY</recordid><startdate>20040401</startdate><enddate>20040401</enddate><creator>Kamakshi Prasad, V</creator><creator>Nagarajan, T</creator><creator>Murthy, Hema A</creator><general>Elsevier B.V</general><general>Elsevier</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8BM</scope><scope>7T9</scope></search><sort><creationdate>20040401</creationdate><title>Automatic segmentation of continuous speech using minimum phase group delay functions</title><author>Kamakshi Prasad, V ; Nagarajan, T ; Murthy, Hema A</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c462t-cb8ee5ac608ca4f48fd5469bae99b2efdadd44bd5008f4bad9a0d864e25705293</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Applied sciences</topic><topic>Exact sciences and technology</topic><topic>Information, signal and communications theory</topic><topic>Minimum phase group delay functions</topic><topic>Root cepstrum</topic><topic>Signal and communications theory</topic><topic>Signal processing</topic><topic>Signal representation. Spectral analysis</topic><topic>Signal, noise</topic><topic>Speech processing</topic><topic>Speech segmentation</topic><topic>Telecommunications and information theory</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kamakshi Prasad, V</creatorcontrib><creatorcontrib>Nagarajan, T</creatorcontrib><creatorcontrib>Murthy, Hema A</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>ComDisDome</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><jtitle>Speech communication</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kamakshi Prasad, V</au><au>Nagarajan, T</au><au>Murthy, Hema A</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Automatic segmentation of continuous speech using minimum phase group delay functions</atitle><jtitle>Speech communication</jtitle><date>2004-04-01</date><risdate>2004</risdate><volume>42</volume><issue>3</issue><spage>429</spage><epage>446</epage><pages>429-446</pages><issn>0167-6393</issn><eissn>1872-7182</eissn><coden>SCOMDH</coden><abstract>In this paper, we present a new algorithm to automatically segment a continuous speech signal into syllable-like segments. The algorithm for segmentation is based on processing the short-term energy function of the continuous speech signal. The short-term energy function is a positive function and can therefore be processed in a manner similar to that of the magnitude spectrum. In this paper, we employ an algorithm, based on group delay processing of the magnitude spectrum to determine segment boundaries in the speech signal. The experiments have been carried out on TIMIT and TIDIGITS databases. The error in segment boundary is ⩽20% of syllable duration for 70% of the syllables. In addition to true segments, an overall 5% insertions and deletions have also been observed.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.specom.2003.12.002</doi><tpages>18</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0167-6393 |
ispartof | Speech communication, 2004-04, Vol.42 (3), p.429-446 |
issn | 0167-6393 1872-7182 |
language | eng |
recordid | cdi_proquest_miscellaneous_85580892 |
source | Access via ScienceDirect (Elsevier) |
subjects | Applied sciences Exact sciences and technology Information, signal and communications theory Minimum phase group delay functions Root cepstrum Signal and communications theory Signal processing Signal representation. Spectral analysis Signal, noise Speech processing Speech segmentation Telecommunications and information theory |
title | Automatic segmentation of continuous speech using minimum phase group delay functions |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-18T16%3A28%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Automatic%20segmentation%20of%20continuous%20speech%20using%20minimum%20phase%20group%20delay%20functions&rft.jtitle=Speech%20communication&rft.au=Kamakshi%20Prasad,%20V&rft.date=2004-04-01&rft.volume=42&rft.issue=3&rft.spage=429&rft.epage=446&rft.pages=429-446&rft.issn=0167-6393&rft.eissn=1872-7182&rft.coden=SCOMDH&rft_id=info:doi/10.1016/j.specom.2003.12.002&rft_dat=%3Cproquest_cross%3E85332415%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=85332415&rft_id=info:pmid/&rft_els_id=S0167639303001444&rfr_iscdi=true |