METHOD AND APPARATUS FOR TRAINING MODEL, METHOD AND APPARATUS FOR SYNTHESIZING SPEECH, DEVICE, STORAGE MEDIUM AND COMPUTER PROGRAM PRODUCT

The present disclosure discloses a method and apparatus for training a model, a method and apparatus for synthesizing a speech, a device, a storage medium and a computer program product, and relates to the field of natural language processing and deep learning technology. At a training stage of a mo...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	GAO, Zhengkun, ZHANG, Junteng, WANG, Wenfu, SUN, Tao
Format:	Patent
Sprache:	eng ; fre ; ger
Schlagworte:	ACOUSTICS CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	GAO, Zhengkun ZHANG, Junteng WANG, Wenfu SUN, Tao
description	The present disclosure discloses a method and apparatus for training a model, a method and apparatus for synthesizing a speech, a device, a storage medium and a computer program product, and relates to the field of natural language processing and deep learning technology. At a training stage of a model, an implementation specifically includes: determining a phoneme feature and a prosodic word boundary feature of sample text data; inserting a pause character into the phoneme feature according to the prosodic word boundary feature to obtain a combined feature of the sample text data; and training an initial speech synthesis model according to the combined feature of the sample text data, to obtain a target speech synthesis model. According to the technique in the present disclosure, the fluency of the speech synthesis is improved.
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_EP3879525A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>EP3879525A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_EP3879525A13</originalsourceid><addsrcrecordid>eNqNi7sKwjAUQLs4iPoP9wPqoKVox5DcNgHzILkRdClF4iRaqF_hV9uKq-BwOMs58-ylkaQVwMyIc8wzigFq64E8U0aZBrQVeMjhZxhOhiQGdZ7i4BC5zEHgUXHMIZD1rMHxFirqz82tdpHQg_O28UxPFpHTMptdu9uQVl8vMqiRuFyn_tGmoe8u6Z6eLbpiv6vKbck2xR_JGze3Png</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>METHOD AND APPARATUS FOR TRAINING MODEL, METHOD AND APPARATUS FOR SYNTHESIZING SPEECH, DEVICE, STORAGE MEDIUM AND COMPUTER PROGRAM PRODUCT</title><source>esp@cenet</source><creator>GAO, Zhengkun ; ZHANG, Junteng ; WANG, Wenfu ; SUN, Tao</creator><creatorcontrib>GAO, Zhengkun ; ZHANG, Junteng ; WANG, Wenfu ; SUN, Tao</creatorcontrib><description>The present disclosure discloses a method and apparatus for training a model, a method and apparatus for synthesizing a speech, a device, a storage medium and a computer program product, and relates to the field of natural language processing and deep learning technology. At a training stage of a model, an implementation specifically includes: determining a phoneme feature and a prosodic word boundary feature of sample text data; inserting a pause character into the phoneme feature according to the prosodic word boundary feature to obtain a combined feature of the sample text data; and training an initial speech synthesis model according to the combined feature of the sample text data, to obtain a target speech synthesis model. According to the technique in the present disclosure, the fluency of the speech synthesis is improved.</description><language>eng ; fre ; ger</language><subject>ACOUSTICS ; CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20210915&DB=EPODOC&CC=EP&NR=3879525A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76290</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20210915&DB=EPODOC&CC=EP&NR=3879525A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>GAO, Zhengkun</creatorcontrib><creatorcontrib>ZHANG, Junteng</creatorcontrib><creatorcontrib>WANG, Wenfu</creatorcontrib><creatorcontrib>SUN, Tao</creatorcontrib><title>METHOD AND APPARATUS FOR TRAINING MODEL, METHOD AND APPARATUS FOR SYNTHESIZING SPEECH, DEVICE, STORAGE MEDIUM AND COMPUTER PROGRAM PRODUCT</title><description>The present disclosure discloses a method and apparatus for training a model, a method and apparatus for synthesizing a speech, a device, a storage medium and a computer program product, and relates to the field of natural language processing and deep learning technology. At a training stage of a model, an implementation specifically includes: determining a phoneme feature and a prosodic word boundary feature of sample text data; inserting a pause character into the phoneme feature according to the prosodic word boundary feature to obtain a combined feature of the sample text data; and training an initial speech synthesis model according to the combined feature of the sample text data, to obtain a target speech synthesis model. According to the technique in the present disclosure, the fluency of the speech synthesis is improved.</description><subject>ACOUSTICS</subject><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNi7sKwjAUQLs4iPoP9wPqoKVox5DcNgHzILkRdClF4iRaqF_hV9uKq-BwOMs58-ylkaQVwMyIc8wzigFq64E8U0aZBrQVeMjhZxhOhiQGdZ7i4BC5zEHgUXHMIZD1rMHxFirqz82tdpHQg_O28UxPFpHTMptdu9uQVl8vMqiRuFyn_tGmoe8u6Z6eLbpiv6vKbck2xR_JGze3Png</recordid><startdate>20210915</startdate><enddate>20210915</enddate><creator>GAO, Zhengkun</creator><creator>ZHANG, Junteng</creator><creator>WANG, Wenfu</creator><creator>SUN, Tao</creator><scope>EVB</scope></search><sort><creationdate>20210915</creationdate><title>METHOD AND APPARATUS FOR TRAINING MODEL, METHOD AND APPARATUS FOR SYNTHESIZING SPEECH, DEVICE, STORAGE MEDIUM AND COMPUTER PROGRAM PRODUCT</title><author>GAO, Zhengkun ; ZHANG, Junteng ; WANG, Wenfu ; SUN, Tao</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_EP3879525A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; fre ; ger</language><creationdate>2021</creationdate><topic>ACOUSTICS</topic><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>GAO, Zhengkun</creatorcontrib><creatorcontrib>ZHANG, Junteng</creatorcontrib><creatorcontrib>WANG, Wenfu</creatorcontrib><creatorcontrib>SUN, Tao</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>GAO, Zhengkun</au><au>ZHANG, Junteng</au><au>WANG, Wenfu</au><au>SUN, Tao</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>METHOD AND APPARATUS FOR TRAINING MODEL, METHOD AND APPARATUS FOR SYNTHESIZING SPEECH, DEVICE, STORAGE MEDIUM AND COMPUTER PROGRAM PRODUCT</title><date>2021-09-15</date><risdate>2021</risdate><abstract>The present disclosure discloses a method and apparatus for training a model, a method and apparatus for synthesizing a speech, a device, a storage medium and a computer program product, and relates to the field of natural language processing and deep learning technology. At a training stage of a model, an implementation specifically includes: determining a phoneme feature and a prosodic word boundary feature of sample text data; inserting a pause character into the phoneme feature according to the prosodic word boundary feature to obtain a combined feature of the sample text data; and training an initial speech synthesis model according to the combined feature of the sample text data, to obtain a target speech synthesis model. According to the technique in the present disclosure, the fluency of the speech synthesis is improved.</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	eng ; fre ; ger
recordid	cdi_epo_espacenet_EP3879525A1
source	esp@cenet
subjects	ACOUSTICS CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
title	METHOD AND APPARATUS FOR TRAINING MODEL, METHOD AND APPARATUS FOR SYNTHESIZING SPEECH, DEVICE, STORAGE MEDIUM AND COMPUTER PROGRAM PRODUCT
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T17%3A39%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=GAO,%20Zhengkun&rft.date=2021-09-15&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EEP3879525A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true