Speech concatenation and synthesis using an overlap-add sinusoidal model

In this paper, an algorithm for the concatenation of speech signal segments taken from disjoint utterances is presented. The algorithm is based on the analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model, which is capable of performing high quality pitch- and time-scale modification of both...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Macon, M.W., Clements, M.A.
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Algorithm design and analysis Fast Fourier transforms Frequency Multiple signal classification Performance analysis Signal analysis Signal synthesis Smoothing methods Speech analysis Speech synthesis
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	364 vol. 1
container_issue
container_start_page	361
container_title
container_volume	1
creator	Macon, M.W. Clements, M.A.
description	In this paper, an algorithm for the concatenation of speech signal segments taken from disjoint utterances is presented. The algorithm is based on the analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model, which is capable of performing high quality pitch- and time-scale modification of both speech and music signals. With the incorporation of concatenation and smoothing techniques, the model is capable of smoothing the transitions between separately-analyzed speech segments by matching the time- and frequency-domain characteristics of the signals at their boundaries. The application of these techniques in a text-to-speech system based on concatenation of diphone sinusoidal models is also presented.
doi_str_mv	10.1109/ICASSP.1996.541107
format	Conference Proceeding
fullrecord	<record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_541107</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>541107</ieee_id><sourcerecordid>541107</sourcerecordid><originalsourceid>FETCH-LOGICAL-i104t-5724478814934276361579069752441d4c4dc0fef86541dd1ffdc0cafc910c463</originalsourceid><addsrcrecordid>eNotUNlqwzAQFD2gJvUP5Ek_YFer03osoW0KgRbcQt-C0NGoOLKxnEL-voJ0GVhmBpbZQWgNpAUg-uF189j37y1oLVvBi6SuUEWZ0g1o8nWNaq06UsAYaAo3qAJBSSOB6ztU5_xDynAhaMcrtO0n7-0B2zFZs_hkljgmbJLD-ZyWg88x41OO6btoePz182Cmxrhix3TKY3RmwMfR-eEe3QYzZF__7xX6fH762Gyb3dtLCbxrIhC-NEJRzlXXlTCMUyWZBKE0kVqJYoDjljtLgg-dLK85ByEUbk2wGojlkq3Q-nI3eu_30xyPZj7vLzWwP40STpQ</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Speech concatenation and synthesis using an overlap-add sinusoidal model</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Macon, M.W. ; Clements, M.A.</creator><creatorcontrib>Macon, M.W. ; Clements, M.A.</creatorcontrib><description>In this paper, an algorithm for the concatenation of speech signal segments taken from disjoint utterances is presented. The algorithm is based on the analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model, which is capable of performing high quality pitch- and time-scale modification of both speech and music signals. With the incorporation of concatenation and smoothing techniques, the model is capable of smoothing the transitions between separately-analyzed speech segments by matching the time- and frequency-domain characteristics of the signals at their boundaries. The application of these techniques in a text-to-speech system based on concatenation of diphone sinusoidal models is also presented.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9780780331921</identifier><identifier>ISBN: 0780331923</identifier><identifier>EISSN: 2379-190X</identifier><identifier>DOI: 10.1109/ICASSP.1996.541107</identifier><language>eng</language><publisher>IEEE</publisher><subject>Algorithm design and analysis ; Fast Fourier transforms ; Frequency ; Multiple signal classification ; Performance analysis ; Signal analysis ; Signal synthesis ; Smoothing methods ; Speech analysis ; Speech synthesis</subject><ispartof>1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, 1996, Vol.1, p.361-364 vol. 1</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/541107$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,4050,4051,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/541107$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Macon, M.W.</creatorcontrib><creatorcontrib>Clements, M.A.</creatorcontrib><title>Speech concatenation and synthesis using an overlap-add sinusoidal model</title><title>1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings</title><addtitle>ICASSP</addtitle><description>In this paper, an algorithm for the concatenation of speech signal segments taken from disjoint utterances is presented. The algorithm is based on the analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model, which is capable of performing high quality pitch- and time-scale modification of both speech and music signals. With the incorporation of concatenation and smoothing techniques, the model is capable of smoothing the transitions between separately-analyzed speech segments by matching the time- and frequency-domain characteristics of the signals at their boundaries. The application of these techniques in a text-to-speech system based on concatenation of diphone sinusoidal models is also presented.</description><subject>Algorithm design and analysis</subject><subject>Fast Fourier transforms</subject><subject>Frequency</subject><subject>Multiple signal classification</subject><subject>Performance analysis</subject><subject>Signal analysis</subject><subject>Signal synthesis</subject><subject>Smoothing methods</subject><subject>Speech analysis</subject><subject>Speech synthesis</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9780780331921</isbn><isbn>0780331923</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>1996</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotUNlqwzAQFD2gJvUP5Ek_YFer03osoW0KgRbcQt-C0NGoOLKxnEL-voJ0GVhmBpbZQWgNpAUg-uF189j37y1oLVvBi6SuUEWZ0g1o8nWNaq06UsAYaAo3qAJBSSOB6ztU5_xDynAhaMcrtO0n7-0B2zFZs_hkljgmbJLD-ZyWg88x41OO6btoePz182Cmxrhix3TKY3RmwMfR-eEe3QYzZF__7xX6fH762Gyb3dtLCbxrIhC-NEJRzlXXlTCMUyWZBKE0kVqJYoDjljtLgg-dLK85ByEUbk2wGojlkq3Q-nI3eu_30xyPZj7vLzWwP40STpQ</recordid><startdate>1996</startdate><enddate>1996</enddate><creator>Macon, M.W.</creator><creator>Clements, M.A.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>1996</creationdate><title>Speech concatenation and synthesis using an overlap-add sinusoidal model</title><author>Macon, M.W. ; Clements, M.A.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i104t-5724478814934276361579069752441d4c4dc0fef86541dd1ffdc0cafc910c463</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>1996</creationdate><topic>Algorithm design and analysis</topic><topic>Fast Fourier transforms</topic><topic>Frequency</topic><topic>Multiple signal classification</topic><topic>Performance analysis</topic><topic>Signal analysis</topic><topic>Signal synthesis</topic><topic>Smoothing methods</topic><topic>Speech analysis</topic><topic>Speech synthesis</topic><toplevel>online_resources</toplevel><creatorcontrib>Macon, M.W.</creatorcontrib><creatorcontrib>Clements, M.A.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Macon, M.W.</au><au>Clements, M.A.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Speech concatenation and synthesis using an overlap-add sinusoidal model</atitle><btitle>1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings</btitle><stitle>ICASSP</stitle><date>1996</date><risdate>1996</risdate><volume>1</volume><spage>361</spage><epage>364 vol. 1</epage><pages>361-364 vol. 1</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9780780331921</isbn><isbn>0780331923</isbn><abstract>In this paper, an algorithm for the concatenation of speech signal segments taken from disjoint utterances is presented. The algorithm is based on the analysis-by-synthesis/overlap-add (ABS/OLA) sinusoidal model, which is capable of performing high quality pitch- and time-scale modification of both speech and music signals. With the incorporation of concatenation and smoothing techniques, the model is capable of smoothing the transitions between separately-analyzed speech segments by matching the time- and frequency-domain characteristics of the signals at their boundaries. The application of these techniques in a text-to-speech system based on concatenation of diphone sinusoidal models is also presented.</abstract><pub>IEEE</pub><doi>10.1109/ICASSP.1996.541107</doi></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1520-6149
ispartof	1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, 1996, Vol.1, p.361-364 vol. 1
issn	1520-6149 2379-190X
language	eng
recordid	cdi_ieee_primary_541107
source	IEEE Electronic Library (IEL) Conference Proceedings
subjects	Algorithm design and analysis Fast Fourier transforms Frequency Multiple signal classification Performance analysis Signal analysis Signal synthesis Smoothing methods Speech analysis Speech synthesis
title	Speech concatenation and synthesis using an overlap-add sinusoidal model
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-19T05%3A10%3A41IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Speech%20concatenation%20and%20synthesis%20using%20an%20overlap-add%20sinusoidal%20model&rft.btitle=1996%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech,%20and%20Signal%20Processing%20Conference%20Proceedings&rft.au=Macon,%20M.W.&rft.date=1996&rft.volume=1&rft.spage=361&rft.epage=364%20vol.%201&rft.pages=361-364%20vol.%201&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9780780331921&rft.isbn_list=0780331923&rft_id=info:doi/10.1109/ICASSP.1996.541107&rft_dat=%3Cieee_6IE%3E541107%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=541107&rfr_iscdi=true