METHOD AND APPARATUS FOR TRAINING MODEL, METHOD AND APPARATUS FOR SYNTHESIZING SPEECH, DEVICE, STORAGE MEDIUM AND COMPUTER PROGRAM PRODUCT

The present disclosure discloses a method and apparatus for training a model, a method and apparatus for synthesizing a speech, a device, a storage medium and a computer program product, and relates to the field of natural language processing and deep learning technology. At a training stage of a mo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: GAO, Zhengkun, ZHANG, Junteng, WANG, Wenfu, SUN, Tao
Format: Patent
Sprache:eng ; fre ; ger
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator GAO, Zhengkun
ZHANG, Junteng
WANG, Wenfu
SUN, Tao
description The present disclosure discloses a method and apparatus for training a model, a method and apparatus for synthesizing a speech, a device, a storage medium and a computer program product, and relates to the field of natural language processing and deep learning technology. At a training stage of a model, an implementation specifically includes: determining a phoneme feature and a prosodic word boundary feature of sample text data; inserting a pause character into the phoneme feature according to the prosodic word boundary feature to obtain a combined feature of the sample text data; and training an initial speech synthesis model according to the combined feature of the sample text data, to obtain a target speech synthesis model. According to the technique in the present disclosure, the fluency of the speech synthesis is improved.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_EP3879525A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>EP3879525A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_EP3879525A13</originalsourceid><addsrcrecordid>eNqNi7sKwjAUQLs4iPoP9wPqoKVox5DcNgHzILkRdClF4iRaqF_hV9uKq-BwOMs58-ylkaQVwMyIc8wzigFq64E8U0aZBrQVeMjhZxhOhiQGdZ7i4BC5zEHgUXHMIZD1rMHxFirqz82tdpHQg_O28UxPFpHTMptdu9uQVl8vMqiRuFyn_tGmoe8u6Z6eLbpiv6vKbck2xR_JGze3Png</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>METHOD AND APPARATUS FOR TRAINING MODEL, METHOD AND APPARATUS FOR SYNTHESIZING SPEECH, DEVICE, STORAGE MEDIUM AND COMPUTER PROGRAM PRODUCT</title><source>esp@cenet</source><creator>GAO, Zhengkun ; ZHANG, Junteng ; WANG, Wenfu ; SUN, Tao</creator><creatorcontrib>GAO, Zhengkun ; ZHANG, Junteng ; WANG, Wenfu ; SUN, Tao</creatorcontrib><description>The present disclosure discloses a method and apparatus for training a model, a method and apparatus for synthesizing a speech, a device, a storage medium and a computer program product, and relates to the field of natural language processing and deep learning technology. At a training stage of a model, an implementation specifically includes: determining a phoneme feature and a prosodic word boundary feature of sample text data; inserting a pause character into the phoneme feature according to the prosodic word boundary feature to obtain a combined feature of the sample text data; and training an initial speech synthesis model according to the combined feature of the sample text data, to obtain a target speech synthesis model. According to the technique in the present disclosure, the fluency of the speech synthesis is improved.</description><language>eng ; fre ; ger</language><subject>ACOUSTICS ; CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210915&amp;DB=EPODOC&amp;CC=EP&amp;NR=3879525A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76290</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210915&amp;DB=EPODOC&amp;CC=EP&amp;NR=3879525A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>GAO, Zhengkun</creatorcontrib><creatorcontrib>ZHANG, Junteng</creatorcontrib><creatorcontrib>WANG, Wenfu</creatorcontrib><creatorcontrib>SUN, Tao</creatorcontrib><title>METHOD AND APPARATUS FOR TRAINING MODEL, METHOD AND APPARATUS FOR SYNTHESIZING SPEECH, DEVICE, STORAGE MEDIUM AND COMPUTER PROGRAM PRODUCT</title><description>The present disclosure discloses a method and apparatus for training a model, a method and apparatus for synthesizing a speech, a device, a storage medium and a computer program product, and relates to the field of natural language processing and deep learning technology. At a training stage of a model, an implementation specifically includes: determining a phoneme feature and a prosodic word boundary feature of sample text data; inserting a pause character into the phoneme feature according to the prosodic word boundary feature to obtain a combined feature of the sample text data; and training an initial speech synthesis model according to the combined feature of the sample text data, to obtain a target speech synthesis model. According to the technique in the present disclosure, the fluency of the speech synthesis is improved.</description><subject>ACOUSTICS</subject><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNi7sKwjAUQLs4iPoP9wPqoKVox5DcNgHzILkRdClF4iRaqF_hV9uKq-BwOMs58-ylkaQVwMyIc8wzigFq64E8U0aZBrQVeMjhZxhOhiQGdZ7i4BC5zEHgUXHMIZD1rMHxFirqz82tdpHQg_O28UxPFpHTMptdu9uQVl8vMqiRuFyn_tGmoe8u6Z6eLbpiv6vKbck2xR_JGze3Png</recordid><startdate>20210915</startdate><enddate>20210915</enddate><creator>GAO, Zhengkun</creator><creator>ZHANG, Junteng</creator><creator>WANG, Wenfu</creator><creator>SUN, Tao</creator><scope>EVB</scope></search><sort><creationdate>20210915</creationdate><title>METHOD AND APPARATUS FOR TRAINING MODEL, METHOD AND APPARATUS FOR SYNTHESIZING SPEECH, DEVICE, STORAGE MEDIUM AND COMPUTER PROGRAM PRODUCT</title><author>GAO, Zhengkun ; ZHANG, Junteng ; WANG, Wenfu ; SUN, Tao</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_EP3879525A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; fre ; ger</language><creationdate>2021</creationdate><topic>ACOUSTICS</topic><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>GAO, Zhengkun</creatorcontrib><creatorcontrib>ZHANG, Junteng</creatorcontrib><creatorcontrib>WANG, Wenfu</creatorcontrib><creatorcontrib>SUN, Tao</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>GAO, Zhengkun</au><au>ZHANG, Junteng</au><au>WANG, Wenfu</au><au>SUN, Tao</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>METHOD AND APPARATUS FOR TRAINING MODEL, METHOD AND APPARATUS FOR SYNTHESIZING SPEECH, DEVICE, STORAGE MEDIUM AND COMPUTER PROGRAM PRODUCT</title><date>2021-09-15</date><risdate>2021</risdate><abstract>The present disclosure discloses a method and apparatus for training a model, a method and apparatus for synthesizing a speech, a device, a storage medium and a computer program product, and relates to the field of natural language processing and deep learning technology. At a training stage of a model, an implementation specifically includes: determining a phoneme feature and a prosodic word boundary feature of sample text data; inserting a pause character into the phoneme feature according to the prosodic word boundary feature to obtain a combined feature of the sample text data; and training an initial speech synthesis model according to the combined feature of the sample text data, to obtain a target speech synthesis model. According to the technique in the present disclosure, the fluency of the speech synthesis is improved.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng ; fre ; ger
recordid cdi_epo_espacenet_EP3879525A1
source esp@cenet
subjects ACOUSTICS
CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
MUSICAL INSTRUMENTS
PHYSICS
SPEECH ANALYSIS OR SYNTHESIS
SPEECH OR AUDIO CODING OR DECODING
SPEECH OR VOICE PROCESSING
SPEECH RECOGNITION
title METHOD AND APPARATUS FOR TRAINING MODEL, METHOD AND APPARATUS FOR SYNTHESIZING SPEECH, DEVICE, STORAGE MEDIUM AND COMPUTER PROGRAM PRODUCT
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T17%3A39%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=GAO,%20Zhengkun&rft.date=2021-09-15&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EEP3879525A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true