Speech concatenation and synthesis using an overlap-add sinusoidal model

In this paper, an algorithm for the concatenation of speech signal segments taken from disjoint utterances is presented. The algorithm is based on the analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model, which is capable of performing high quality pitch- and time-scale modification of both...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Macon, M.W., Clements, M.A.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 364 vol. 1
container_issue
container_start_page 361
container_title
container_volume 1
creator Macon, M.W.
Clements, M.A.
description In this paper, an algorithm for the concatenation of speech signal segments taken from disjoint utterances is presented. The algorithm is based on the analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model, which is capable of performing high quality pitch- and time-scale modification of both speech and music signals. With the incorporation of concatenation and smoothing techniques, the model is capable of smoothing the transitions between separately-analyzed speech segments by matching the time- and frequency-domain characteristics of the signals at their boundaries. The application of these techniques in a text-to-speech system based on concatenation of diphone sinusoidal models is also presented.
doi_str_mv 10.1109/ICASSP.1996.541107
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_541107</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>541107</ieee_id><sourcerecordid>541107</sourcerecordid><originalsourceid>FETCH-LOGICAL-i104t-5724478814934276361579069752441d4c4dc0fef86541dd1ffdc0cafc910c463</originalsourceid><addsrcrecordid>eNotUNlqwzAQFD2gJvUP5Ek_YFer03osoW0KgRbcQt-C0NGoOLKxnEL-voJ0GVhmBpbZQWgNpAUg-uF189j37y1oLVvBi6SuUEWZ0g1o8nWNaq06UsAYaAo3qAJBSSOB6ztU5_xDynAhaMcrtO0n7-0B2zFZs_hkljgmbJLD-ZyWg88x41OO6btoePz182Cmxrhix3TKY3RmwMfR-eEe3QYzZF__7xX6fH762Gyb3dtLCbxrIhC-NEJRzlXXlTCMUyWZBKE0kVqJYoDjljtLgg-dLK85ByEUbk2wGojlkq3Q-nI3eu_30xyPZj7vLzWwP40STpQ</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Speech concatenation and synthesis using an overlap-add sinusoidal model</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Macon, M.W. ; Clements, M.A.</creator><creatorcontrib>Macon, M.W. ; Clements, M.A.</creatorcontrib><description>In this paper, an algorithm for the concatenation of speech signal segments taken from disjoint utterances is presented. The algorithm is based on the analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model, which is capable of performing high quality pitch- and time-scale modification of both speech and music signals. With the incorporation of concatenation and smoothing techniques, the model is capable of smoothing the transitions between separately-analyzed speech segments by matching the time- and frequency-domain characteristics of the signals at their boundaries. The application of these techniques in a text-to-speech system based on concatenation of diphone sinusoidal models is also presented.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9780780331921</identifier><identifier>ISBN: 0780331923</identifier><identifier>EISSN: 2379-190X</identifier><identifier>DOI: 10.1109/ICASSP.1996.541107</identifier><language>eng</language><publisher>IEEE</publisher><subject>Algorithm design and analysis ; Fast Fourier transforms ; Frequency ; Multiple signal classification ; Performance analysis ; Signal analysis ; Signal synthesis ; Smoothing methods ; Speech analysis ; Speech synthesis</subject><ispartof>1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, 1996, Vol.1, p.361-364 vol. 1</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/541107$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,4050,4051,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/541107$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Macon, M.W.</creatorcontrib><creatorcontrib>Clements, M.A.</creatorcontrib><title>Speech concatenation and synthesis using an overlap-add sinusoidal model</title><title>1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings</title><addtitle>ICASSP</addtitle><description>In this paper, an algorithm for the concatenation of speech signal segments taken from disjoint utterances is presented. The algorithm is based on the analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model, which is capable of performing high quality pitch- and time-scale modification of both speech and music signals. With the incorporation of concatenation and smoothing techniques, the model is capable of smoothing the transitions between separately-analyzed speech segments by matching the time- and frequency-domain characteristics of the signals at their boundaries. The application of these techniques in a text-to-speech system based on concatenation of diphone sinusoidal models is also presented.</description><subject>Algorithm design and analysis</subject><subject>Fast Fourier transforms</subject><subject>Frequency</subject><subject>Multiple signal classification</subject><subject>Performance analysis</subject><subject>Signal analysis</subject><subject>Signal synthesis</subject><subject>Smoothing methods</subject><subject>Speech analysis</subject><subject>Speech synthesis</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9780780331921</isbn><isbn>0780331923</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>1996</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotUNlqwzAQFD2gJvUP5Ek_YFer03osoW0KgRbcQt-C0NGoOLKxnEL-voJ0GVhmBpbZQWgNpAUg-uF189j37y1oLVvBi6SuUEWZ0g1o8nWNaq06UsAYaAo3qAJBSSOB6ztU5_xDynAhaMcrtO0n7-0B2zFZs_hkljgmbJLD-ZyWg88x41OO6btoePz182Cmxrhix3TKY3RmwMfR-eEe3QYzZF__7xX6fH762Gyb3dtLCbxrIhC-NEJRzlXXlTCMUyWZBKE0kVqJYoDjljtLgg-dLK85ByEUbk2wGojlkq3Q-nI3eu_30xyPZj7vLzWwP40STpQ</recordid><startdate>1996</startdate><enddate>1996</enddate><creator>Macon, M.W.</creator><creator>Clements, M.A.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>1996</creationdate><title>Speech concatenation and synthesis using an overlap-add sinusoidal model</title><author>Macon, M.W. ; Clements, M.A.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i104t-5724478814934276361579069752441d4c4dc0fef86541dd1ffdc0cafc910c463</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>1996</creationdate><topic>Algorithm design and analysis</topic><topic>Fast Fourier transforms</topic><topic>Frequency</topic><topic>Multiple signal classification</topic><topic>Performance analysis</topic><topic>Signal analysis</topic><topic>Signal synthesis</topic><topic>Smoothing methods</topic><topic>Speech analysis</topic><topic>Speech synthesis</topic><toplevel>online_resources</toplevel><creatorcontrib>Macon, M.W.</creatorcontrib><creatorcontrib>Clements, M.A.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Macon, M.W.</au><au>Clements, M.A.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Speech concatenation and synthesis using an overlap-add sinusoidal model</atitle><btitle>1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings</btitle><stitle>ICASSP</stitle><date>1996</date><risdate>1996</risdate><volume>1</volume><spage>361</spage><epage>364 vol. 1</epage><pages>361-364 vol. 1</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9780780331921</isbn><isbn>0780331923</isbn><abstract>In this paper, an algorithm for the concatenation of speech signal segments taken from disjoint utterances is presented. The algorithm is based on the analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model, which is capable of performing high quality pitch- and time-scale modification of both speech and music signals. With the incorporation of concatenation and smoothing techniques, the model is capable of smoothing the transitions between separately-analyzed speech segments by matching the time- and frequency-domain characteristics of the signals at their boundaries. The application of these techniques in a text-to-speech system based on concatenation of diphone sinusoidal models is also presented.</abstract><pub>IEEE</pub><doi>10.1109/ICASSP.1996.541107</doi></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1520-6149
ispartof 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, 1996, Vol.1, p.361-364 vol. 1
issn 1520-6149
2379-190X
language eng
recordid cdi_ieee_primary_541107
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Algorithm design and analysis
Fast Fourier transforms
Frequency
Multiple signal classification
Performance analysis
Signal analysis
Signal synthesis
Smoothing methods
Speech analysis
Speech synthesis
title Speech concatenation and synthesis using an overlap-add sinusoidal model
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-19T05%3A10%3A41IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Speech%20concatenation%20and%20synthesis%20using%20an%20overlap-add%20sinusoidal%20model&rft.btitle=1996%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech,%20and%20Signal%20Processing%20Conference%20Proceedings&rft.au=Macon,%20M.W.&rft.date=1996&rft.volume=1&rft.spage=361&rft.epage=364%20vol.%201&rft.pages=361-364%20vol.%201&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9780780331921&rft.isbn_list=0780331923&rft_id=info:doi/10.1109/ICASSP.1996.541107&rft_dat=%3Cieee_6IE%3E541107%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=541107&rfr_iscdi=true