Optimization of a temporal decomposition model of speech
A dynamic programming based optimization strategy for a modified temporal decomposition (TD) model of speech is presented. In previous work with the SBEL-TD algorithm, the event localization was performed based on a spectral stability criteria. Although this approach gave reasonably good results (sp...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 474 vol.1 |
---|---|
container_issue | |
container_start_page | 471 |
container_title | |
container_volume | 1 |
creator | Athaudage, C.N. Bradley, A.B. Lech, M. |
description | A dynamic programming based optimization strategy for a modified temporal decomposition (TD) model of speech is presented. In previous work with the SBEL-TD algorithm, the event localization was performed based on a spectral stability criteria. Although this approach gave reasonably good results (spectral distortion of about 1.5 dB), there was no assurance on the optimality of event locations. In this present work we have optimized the event localization task using a dynamic programming based optimization strategy. An overlapping buffering technique is also proposed to ensure smooth transitions between consecutive speech parameter blocks. Simulation results show that an improved TD model accuracy in terms of the spectral distortion (0.9 dB) can be achieved. The new optimised algorithm also adds a new degree of freedom, TD resolution (events/sec), to the analysis paradigm which can be effectively used to control the TD model accuracy. |
doi_str_mv | 10.1109/ISSPA.1999.818214 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_818214</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>818214</ieee_id><sourcerecordid>818214</sourcerecordid><originalsourceid>FETCH-LOGICAL-i87t-d755b735082b0c55482098fa16985b29611f40b0794e6d8e6e542628dde5aa623</originalsourceid><addsrcrecordid>eNotj8tqwzAURAWl0JL6A9qVf8CurqSrxzKEPgKBFJJ9kK1romBXxvKm_fq6TWczB2YYGMYegdcA3D1vD4ePdQ3OudqCFaBuWOHMglpJVAh4x4qcL3yRdBqlvmd2P85xiN9-jumzTF3py5mGMU2-LwO1acEc_7IhBep_G3kkas8P7Lbzfabi31fs-Ppy3LxXu_3bdrPeVdGauQoGsTESuRUNbxGVFdzZzoN2FhvhNECneMONU6SDJU2ohBY2BELvtZAr9nSdjUR0Gqc4-OnrdH0nfwB36ERQ</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Optimization of a temporal decomposition model of speech</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Athaudage, C.N. ; Bradley, A.B. ; Lech, M.</creator><creatorcontrib>Athaudage, C.N. ; Bradley, A.B. ; Lech, M.</creatorcontrib><description>A dynamic programming based optimization strategy for a modified temporal decomposition (TD) model of speech is presented. In previous work with the SBEL-TD algorithm, the event localization was performed based on a spectral stability criteria. Although this approach gave reasonably good results (spectral distortion of about 1.5 dB), there was no assurance on the optimality of event locations. In this present work we have optimized the event localization task using a dynamic programming based optimization strategy. An overlapping buffering technique is also proposed to ensure smooth transitions between consecutive speech parameter blocks. Simulation results show that an improved TD model accuracy in terms of the spectral distortion (0.9 dB) can be achieved. The new optimised algorithm also adds a new degree of freedom, TD resolution (events/sec), to the analysis paradigm which can be effectively used to control the TD model accuracy.</description><identifier>ISBN: 9781864354515</identifier><identifier>ISBN: 1864354518</identifier><identifier>DOI: 10.1109/ISSPA.1999.818214</identifier><language>eng</language><publisher>IEEE</publisher><subject>Algorithm design and analysis ; Bit rate ; Dynamic programming ; Lapping ; Signal processing ; Speech analysis ; Speech coding ; Stability</subject><ispartof>ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and its Applications (IEEE Cat. No.99EX359), 1999, Vol.1, p.471-474 vol.1</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/818214$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2052,4036,4037,27902,54895</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/818214$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Athaudage, C.N.</creatorcontrib><creatorcontrib>Bradley, A.B.</creatorcontrib><creatorcontrib>Lech, M.</creatorcontrib><title>Optimization of a temporal decomposition model of speech</title><title>ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and its Applications (IEEE Cat. No.99EX359)</title><addtitle>ISSPA</addtitle><description>A dynamic programming based optimization strategy for a modified temporal decomposition (TD) model of speech is presented. In previous work with the SBEL-TD algorithm, the event localization was performed based on a spectral stability criteria. Although this approach gave reasonably good results (spectral distortion of about 1.5 dB), there was no assurance on the optimality of event locations. In this present work we have optimized the event localization task using a dynamic programming based optimization strategy. An overlapping buffering technique is also proposed to ensure smooth transitions between consecutive speech parameter blocks. Simulation results show that an improved TD model accuracy in terms of the spectral distortion (0.9 dB) can be achieved. The new optimised algorithm also adds a new degree of freedom, TD resolution (events/sec), to the analysis paradigm which can be effectively used to control the TD model accuracy.</description><subject>Algorithm design and analysis</subject><subject>Bit rate</subject><subject>Dynamic programming</subject><subject>Lapping</subject><subject>Signal processing</subject><subject>Speech analysis</subject><subject>Speech coding</subject><subject>Stability</subject><isbn>9781864354515</isbn><isbn>1864354518</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>1999</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotj8tqwzAURAWl0JL6A9qVf8CurqSrxzKEPgKBFJJ9kK1romBXxvKm_fq6TWczB2YYGMYegdcA3D1vD4ePdQ3OudqCFaBuWOHMglpJVAh4x4qcL3yRdBqlvmd2P85xiN9-jumzTF3py5mGMU2-LwO1acEc_7IhBep_G3kkas8P7Lbzfabi31fs-Ppy3LxXu_3bdrPeVdGauQoGsTESuRUNbxGVFdzZzoN2FhvhNECneMONU6SDJU2ohBY2BELvtZAr9nSdjUR0Gqc4-OnrdH0nfwB36ERQ</recordid><startdate>1999</startdate><enddate>1999</enddate><creator>Athaudage, C.N.</creator><creator>Bradley, A.B.</creator><creator>Lech, M.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>1999</creationdate><title>Optimization of a temporal decomposition model of speech</title><author>Athaudage, C.N. ; Bradley, A.B. ; Lech, M.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i87t-d755b735082b0c55482098fa16985b29611f40b0794e6d8e6e542628dde5aa623</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>1999</creationdate><topic>Algorithm design and analysis</topic><topic>Bit rate</topic><topic>Dynamic programming</topic><topic>Lapping</topic><topic>Signal processing</topic><topic>Speech analysis</topic><topic>Speech coding</topic><topic>Stability</topic><toplevel>online_resources</toplevel><creatorcontrib>Athaudage, C.N.</creatorcontrib><creatorcontrib>Bradley, A.B.</creatorcontrib><creatorcontrib>Lech, M.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Athaudage, C.N.</au><au>Bradley, A.B.</au><au>Lech, M.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Optimization of a temporal decomposition model of speech</atitle><btitle>ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and its Applications (IEEE Cat. No.99EX359)</btitle><stitle>ISSPA</stitle><date>1999</date><risdate>1999</risdate><volume>1</volume><spage>471</spage><epage>474 vol.1</epage><pages>471-474 vol.1</pages><isbn>9781864354515</isbn><isbn>1864354518</isbn><abstract>A dynamic programming based optimization strategy for a modified temporal decomposition (TD) model of speech is presented. In previous work with the SBEL-TD algorithm, the event localization was performed based on a spectral stability criteria. Although this approach gave reasonably good results (spectral distortion of about 1.5 dB), there was no assurance on the optimality of event locations. In this present work we have optimized the event localization task using a dynamic programming based optimization strategy. An overlapping buffering technique is also proposed to ensure smooth transitions between consecutive speech parameter blocks. Simulation results show that an improved TD model accuracy in terms of the spectral distortion (0.9 dB) can be achieved. The new optimised algorithm also adds a new degree of freedom, TD resolution (events/sec), to the analysis paradigm which can be effectively used to control the TD model accuracy.</abstract><pub>IEEE</pub><doi>10.1109/ISSPA.1999.818214</doi></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISBN: 9781864354515 |
ispartof | ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and its Applications (IEEE Cat. No.99EX359), 1999, Vol.1, p.471-474 vol.1 |
issn | |
language | eng |
recordid | cdi_ieee_primary_818214 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Algorithm design and analysis Bit rate Dynamic programming Lapping Signal processing Speech analysis Speech coding Stability |
title | Optimization of a temporal decomposition model of speech |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-30T11%3A42%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Optimization%20of%20a%20temporal%20decomposition%20model%20of%20speech&rft.btitle=ISSPA%20'99.%20Proceedings%20of%20the%20Fifth%20International%20Symposium%20on%20Signal%20Processing%20and%20its%20Applications%20(IEEE%20Cat.%20No.99EX359)&rft.au=Athaudage,%20C.N.&rft.date=1999&rft.volume=1&rft.spage=471&rft.epage=474%20vol.1&rft.pages=471-474%20vol.1&rft.isbn=9781864354515&rft.isbn_list=1864354518&rft_id=info:doi/10.1109/ISSPA.1999.818214&rft_dat=%3Cieee_6IE%3E818214%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=818214&rfr_iscdi=true |