Speech concatenation and synthesis using an overlap-add sinusoidal model
In this paper, an algorithm for the concatenation of speech signal segments taken from disjoint utterances is presented. The algorithm is based on the analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model, which is capable of performing high quality pitch- and time-scale modification of both...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 364 vol. 1 |
---|---|
container_issue | |
container_start_page | 361 |
container_title | |
container_volume | 1 |
creator | Macon, M.W. Clements, M.A. |
description | In this paper, an algorithm for the concatenation of speech signal segments taken from disjoint utterances is presented. The algorithm is based on the analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model, which is capable of performing high quality pitch- and time-scale modification of both speech and music signals. With the incorporation of concatenation and smoothing techniques, the model is capable of smoothing the transitions between separately-analyzed speech segments by matching the time- and frequency-domain characteristics of the signals at their boundaries. The application of these techniques in a text-to-speech system based on concatenation of diphone sinusoidal models is also presented. |
doi_str_mv | 10.1109/ICASSP.1996.541107 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_541107</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>541107</ieee_id><sourcerecordid>541107</sourcerecordid><originalsourceid>FETCH-LOGICAL-i104t-5724478814934276361579069752441d4c4dc0fef86541dd1ffdc0cafc910c463</originalsourceid><addsrcrecordid>eNotUNlqwzAQFD2gJvUP5Ek_YFer03osoW0KgRbcQt-C0NGoOLKxnEL-voJ0GVhmBpbZQWgNpAUg-uF189j37y1oLVvBi6SuUEWZ0g1o8nWNaq06UsAYaAo3qAJBSSOB6ztU5_xDynAhaMcrtO0n7-0B2zFZs_hkljgmbJLD-ZyWg88x41OO6btoePz182Cmxrhix3TKY3RmwMfR-eEe3QYzZF__7xX6fH762Gyb3dtLCbxrIhC-NEJRzlXXlTCMUyWZBKE0kVqJYoDjljtLgg-dLK85ByEUbk2wGojlkq3Q-nI3eu_30xyPZj7vLzWwP40STpQ</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Speech concatenation and synthesis using an overlap-add sinusoidal model</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Macon, M.W. ; Clements, M.A.</creator><creatorcontrib>Macon, M.W. ; Clements, M.A.</creatorcontrib><description>In this paper, an algorithm for the concatenation of speech signal segments taken from disjoint utterances is presented. The algorithm is based on the analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model, which is capable of performing high quality pitch- and time-scale modification of both speech and music signals. With the incorporation of concatenation and smoothing techniques, the model is capable of smoothing the transitions between separately-analyzed speech segments by matching the time- and frequency-domain characteristics of the signals at their boundaries. The application of these techniques in a text-to-speech system based on concatenation of diphone sinusoidal models is also presented.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9780780331921</identifier><identifier>ISBN: 0780331923</identifier><identifier>EISSN: 2379-190X</identifier><identifier>DOI: 10.1109/ICASSP.1996.541107</identifier><language>eng</language><publisher>IEEE</publisher><subject>Algorithm design and analysis ; Fast Fourier transforms ; Frequency ; Multiple signal classification ; Performance analysis ; Signal analysis ; Signal synthesis ; Smoothing methods ; Speech analysis ; Speech synthesis</subject><ispartof>1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, 1996, Vol.1, p.361-364 vol. 1</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/541107$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,4050,4051,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/541107$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Macon, M.W.</creatorcontrib><creatorcontrib>Clements, M.A.</creatorcontrib><title>Speech concatenation and synthesis using an overlap-add sinusoidal model</title><title>1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings</title><addtitle>ICASSP</addtitle><description>In this paper, an algorithm for the concatenation of speech signal segments taken from disjoint utterances is presented. The algorithm is based on the analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model, which is capable of performing high quality pitch- and time-scale modification of both speech and music signals. With the incorporation of concatenation and smoothing techniques, the model is capable of smoothing the transitions between separately-analyzed speech segments by matching the time- and frequency-domain characteristics of the signals at their boundaries. The application of these techniques in a text-to-speech system based on concatenation of diphone sinusoidal models is also presented.</description><subject>Algorithm design and analysis</subject><subject>Fast Fourier transforms</subject><subject>Frequency</subject><subject>Multiple signal classification</subject><subject>Performance analysis</subject><subject>Signal analysis</subject><subject>Signal synthesis</subject><subject>Smoothing methods</subject><subject>Speech analysis</subject><subject>Speech synthesis</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9780780331921</isbn><isbn>0780331923</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>1996</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotUNlqwzAQFD2gJvUP5Ek_YFer03osoW0KgRbcQt-C0NGoOLKxnEL-voJ0GVhmBpbZQWgNpAUg-uF189j37y1oLVvBi6SuUEWZ0g1o8nWNaq06UsAYaAo3qAJBSSOB6ztU5_xDynAhaMcrtO0n7-0B2zFZs_hkljgmbJLD-ZyWg88x41OO6btoePz182Cmxrhix3TKY3RmwMfR-eEe3QYzZF__7xX6fH762Gyb3dtLCbxrIhC-NEJRzlXXlTCMUyWZBKE0kVqJYoDjljtLgg-dLK85ByEUbk2wGojlkq3Q-nI3eu_30xyPZj7vLzWwP40STpQ</recordid><startdate>1996</startdate><enddate>1996</enddate><creator>Macon, M.W.</creator><creator>Clements, M.A.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>1996</creationdate><title>Speech concatenation and synthesis using an overlap-add sinusoidal model</title><author>Macon, M.W. ; Clements, M.A.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i104t-5724478814934276361579069752441d4c4dc0fef86541dd1ffdc0cafc910c463</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>1996</creationdate><topic>Algorithm design and analysis</topic><topic>Fast Fourier transforms</topic><topic>Frequency</topic><topic>Multiple signal classification</topic><topic>Performance analysis</topic><topic>Signal analysis</topic><topic>Signal synthesis</topic><topic>Smoothing methods</topic><topic>Speech analysis</topic><topic>Speech synthesis</topic><toplevel>online_resources</toplevel><creatorcontrib>Macon, M.W.</creatorcontrib><creatorcontrib>Clements, M.A.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Macon, M.W.</au><au>Clements, M.A.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Speech concatenation and synthesis using an overlap-add sinusoidal model</atitle><btitle>1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings</btitle><stitle>ICASSP</stitle><date>1996</date><risdate>1996</risdate><volume>1</volume><spage>361</spage><epage>364 vol. 1</epage><pages>361-364 vol. 1</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9780780331921</isbn><isbn>0780331923</isbn><abstract>In this paper, an algorithm for the concatenation of speech signal segments taken from disjoint utterances is presented. The algorithm is based on the analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model, which is capable of performing high quality pitch- and time-scale modification of both speech and music signals. With the incorporation of concatenation and smoothing techniques, the model is capable of smoothing the transitions between separately-analyzed speech segments by matching the time- and frequency-domain characteristics of the signals at their boundaries. The application of these techniques in a text-to-speech system based on concatenation of diphone sinusoidal models is also presented.</abstract><pub>IEEE</pub><doi>10.1109/ICASSP.1996.541107</doi></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1520-6149 |
ispartof | 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, 1996, Vol.1, p.361-364 vol. 1 |
issn | 1520-6149 2379-190X |
language | eng |
recordid | cdi_ieee_primary_541107 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Algorithm design and analysis Fast Fourier transforms Frequency Multiple signal classification Performance analysis Signal analysis Signal synthesis Smoothing methods Speech analysis Speech synthesis |
title | Speech concatenation and synthesis using an overlap-add sinusoidal model |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-19T05%3A10%3A41IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Speech%20concatenation%20and%20synthesis%20using%20an%20overlap-add%20sinusoidal%20model&rft.btitle=1996%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech,%20and%20Signal%20Processing%20Conference%20Proceedings&rft.au=Macon,%20M.W.&rft.date=1996&rft.volume=1&rft.spage=361&rft.epage=364%20vol.%201&rft.pages=361-364%20vol.%201&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9780780331921&rft.isbn_list=0780331923&rft_id=info:doi/10.1109/ICASSP.1996.541107&rft_dat=%3Cieee_6IE%3E541107%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=541107&rfr_iscdi=true |