Speech synthesis method and device, computer equipment and storage medium

The invention relates to a speech synthesis method and device, computer equipment, a storage medium and a computer program product. The method comprises the following steps: acquiring reading emotion information when a source speaking object reads voice data; fusing the reading emotion information a...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	XIE RUI, ZHOU YANG, LIU CHANG, XIONG JIA, ZENG RUIHONG, CHEN GUANGYAO, LAN XIANG, PAN ZISHENG, MA JINLONG, XU ZHIJIAN, HUANG XIANGKANG, WU HUIYANG
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	XIE RUI ZHOU YANG LIU CHANG XIONG JIA ZENG RUIHONG CHEN GUANGYAO LAN XIANG PAN ZISHENG MA JINLONG XU ZHIJIAN HUANG XIANGKANG WU HUIYANG
description	The invention relates to a speech synthesis method and device, computer equipment, a storage medium and a computer program product. The method comprises the following steps: acquiring reading emotion information when a source speaking object reads voice data; fusing the reading emotion information and the tone information corresponding to the target speaking object to obtain a tone emotion fusion vector; obtaining a phoneme vector sequence corresponding to a speech text to be synthesized, and inputting the tone emotion fusion vector and the phoneme vector sequence into an emotion transfer speech synthesis model to obtain emotion transfer speech; wherein the emotion migration voice comprises the voice of the target speaking object reading the to-be-synthesized voice text according to the reading emotion of the source speaking object. By adopting the method, the synthesis efficiency of the emotional speech can be effectively improved. 本申请涉及一种语音合成方法、装置、计算机设备、存储介质和计算机程序产品。所述方法包括：获取源说话对象朗读语音数据时的朗读情感信息；对所述朗读情感信息和目标
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN117524261A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN117524261A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN117524261A3</originalsourceid><addsrcrecordid>eNqNi7sKAjEUBdNYiPoP116LrK9aFkUbG-2XkBxNwDzceyP494r4AVZTzMxQHc8FsJ74lcSDA1OE-OzIJEcOz2AxI5tjqYKe8KihRCT5apbcmxs-hws1jtXgau6MyY8jNd3vLu1hjpI7cDEWCdK1J603q2bZrPV28U_zBvNpNMw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Speech synthesis method and device, computer equipment and storage medium</title><source>esp@cenet</source><creator>XIE RUI ; ZHOU YANG ; LIU CHANG ; XIONG JIA ; ZENG RUIHONG ; CHEN GUANGYAO ; LAN XIANG ; PAN ZISHENG ; MA JINLONG ; XU ZHIJIAN ; HUANG XIANGKANG ; WU HUIYANG</creator><creatorcontrib>XIE RUI ; ZHOU YANG ; LIU CHANG ; XIONG JIA ; ZENG RUIHONG ; CHEN GUANGYAO ; LAN XIANG ; PAN ZISHENG ; MA JINLONG ; XU ZHIJIAN ; HUANG XIANGKANG ; WU HUIYANG</creatorcontrib><description>The invention relates to a speech synthesis method and device, computer equipment, a storage medium and a computer program product. The method comprises the following steps: acquiring reading emotion information when a source speaking object reads voice data; fusing the reading emotion information and the tone information corresponding to the target speaking object to obtain a tone emotion fusion vector; obtaining a phoneme vector sequence corresponding to a speech text to be synthesized, and inputting the tone emotion fusion vector and the phoneme vector sequence into an emotion transfer speech synthesis model to obtain emotion transfer speech; wherein the emotion migration voice comprises the voice of the target speaking object reading the to-be-synthesized voice text according to the reading emotion of the source speaking object. By adopting the method, the synthesis efficiency of the emotional speech can be effectively improved. 本申请涉及一种语音合成方法、装置、计算机设备、存储介质和计算机程序产品。所述方法包括：获取源说话对象朗读语音数据时的朗读情感信息；对所述朗读情感信息和目标</description><language>chi ; eng</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2024</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20240206&DB=EPODOC&CC=CN&NR=117524261A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,778,883,25547,76298</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20240206&DB=EPODOC&CC=CN&NR=117524261A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>XIE RUI</creatorcontrib><creatorcontrib>ZHOU YANG</creatorcontrib><creatorcontrib>LIU CHANG</creatorcontrib><creatorcontrib>XIONG JIA</creatorcontrib><creatorcontrib>ZENG RUIHONG</creatorcontrib><creatorcontrib>CHEN GUANGYAO</creatorcontrib><creatorcontrib>LAN XIANG</creatorcontrib><creatorcontrib>PAN ZISHENG</creatorcontrib><creatorcontrib>MA JINLONG</creatorcontrib><creatorcontrib>XU ZHIJIAN</creatorcontrib><creatorcontrib>HUANG XIANGKANG</creatorcontrib><creatorcontrib>WU HUIYANG</creatorcontrib><title>Speech synthesis method and device, computer equipment and storage medium</title><description>The invention relates to a speech synthesis method and device, computer equipment, a storage medium and a computer program product. The method comprises the following steps: acquiring reading emotion information when a source speaking object reads voice data; fusing the reading emotion information and the tone information corresponding to the target speaking object to obtain a tone emotion fusion vector; obtaining a phoneme vector sequence corresponding to a speech text to be synthesized, and inputting the tone emotion fusion vector and the phoneme vector sequence into an emotion transfer speech synthesis model to obtain emotion transfer speech; wherein the emotion migration voice comprises the voice of the target speaking object reading the to-be-synthesized voice text according to the reading emotion of the source speaking object. By adopting the method, the synthesis efficiency of the emotional speech can be effectively improved. 本申请涉及一种语音合成方法、装置、计算机设备、存储介质和计算机程序产品。所述方法包括：获取源说话对象朗读语音数据时的朗读情感信息；对所述朗读情感信息和目标</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2024</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNi7sKAjEUBdNYiPoP116LrK9aFkUbG-2XkBxNwDzceyP494r4AVZTzMxQHc8FsJ74lcSDA1OE-OzIJEcOz2AxI5tjqYKe8KihRCT5apbcmxs-hws1jtXgau6MyY8jNd3vLu1hjpI7cDEWCdK1J603q2bZrPV28U_zBvNpNMw</recordid><startdate>20240206</startdate><enddate>20240206</enddate><creator>XIE RUI</creator><creator>ZHOU YANG</creator><creator>LIU CHANG</creator><creator>XIONG JIA</creator><creator>ZENG RUIHONG</creator><creator>CHEN GUANGYAO</creator><creator>LAN XIANG</creator><creator>PAN ZISHENG</creator><creator>MA JINLONG</creator><creator>XU ZHIJIAN</creator><creator>HUANG XIANGKANG</creator><creator>WU HUIYANG</creator><scope>EVB</scope></search><sort><creationdate>20240206</creationdate><title>Speech synthesis method and device, computer equipment and storage medium</title><author>XIE RUI ; ZHOU YANG ; LIU CHANG ; XIONG JIA ; ZENG RUIHONG ; CHEN GUANGYAO ; LAN XIANG ; PAN ZISHENG ; MA JINLONG ; XU ZHIJIAN ; HUANG XIANGKANG ; WU HUIYANG</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN117524261A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2024</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>XIE RUI</creatorcontrib><creatorcontrib>ZHOU YANG</creatorcontrib><creatorcontrib>LIU CHANG</creatorcontrib><creatorcontrib>XIONG JIA</creatorcontrib><creatorcontrib>ZENG RUIHONG</creatorcontrib><creatorcontrib>CHEN GUANGYAO</creatorcontrib><creatorcontrib>LAN XIANG</creatorcontrib><creatorcontrib>PAN ZISHENG</creatorcontrib><creatorcontrib>MA JINLONG</creatorcontrib><creatorcontrib>XU ZHIJIAN</creatorcontrib><creatorcontrib>HUANG XIANGKANG</creatorcontrib><creatorcontrib>WU HUIYANG</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>XIE RUI</au><au>ZHOU YANG</au><au>LIU CHANG</au><au>XIONG JIA</au><au>ZENG RUIHONG</au><au>CHEN GUANGYAO</au><au>LAN XIANG</au><au>PAN ZISHENG</au><au>MA JINLONG</au><au>XU ZHIJIAN</au><au>HUANG XIANGKANG</au><au>WU HUIYANG</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Speech synthesis method and device, computer equipment and storage medium</title><date>2024-02-06</date><risdate>2024</risdate><abstract>The invention relates to a speech synthesis method and device, computer equipment, a storage medium and a computer program product. The method comprises the following steps: acquiring reading emotion information when a source speaking object reads voice data; fusing the reading emotion information and the tone information corresponding to the target speaking object to obtain a tone emotion fusion vector; obtaining a phoneme vector sequence corresponding to a speech text to be synthesized, and inputting the tone emotion fusion vector and the phoneme vector sequence into an emotion transfer speech synthesis model to obtain emotion transfer speech; wherein the emotion migration voice comprises the voice of the target speaking object reading the to-be-synthesized voice text according to the reading emotion of the source speaking object. By adopting the method, the synthesis efficiency of the emotional speech can be effectively improved. 本申请涉及一种语音合成方法、装置、计算机设备、存储介质和计算机程序产品。所述方法包括：获取源说话对象朗读语音数据时的朗读情感信息；对所述朗读情感信息和目标</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	chi ; eng
recordid	cdi_epo_espacenet_CN117524261A
source	esp@cenet
subjects	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
title	Speech synthesis method and device, computer equipment and storage medium
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T01%3A06%3A10IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=XIE%20RUI&rft.date=2024-02-06&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN117524261A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true