Voice conversion method and device, electronic equipment and storage medium

The embodiment of the invention discloses a voice conversion method and device, electronic equipment and a storage medium. The method comprises the following steps: acquiring first voice data of a source speaker and second voice data of a target speaker; extracting a first linear spectrum from the f...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	ZHONG RONGXIU, DENG CHAO, YANG HUIBAO, XU LE, LIU YING, ZHANG SHILEI
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	ZHONG RONGXIU DENG CHAO YANG HUIBAO XU LE LIU YING ZHANG SHILEI
description	The embodiment of the invention discloses a voice conversion method and device, electronic equipment and a storage medium. The method comprises the following steps: acquiring first voice data of a source speaker and second voice data of a target speaker; extracting a first linear spectrum from the first voice data, and extracting a first speaker vector from the second voice data; encoding the first linear spectrum to obtain first content distribution of the first voice data; performing distribution transformation on the first content distribution according to the first speaker vector to obtain second content distribution; and reconstructing voice data of the target speaker according to the first speaker vector and the second content distribution. 本实施例公开了一种语音转换方法、装置、电子设备和存储介质，该方法包括：获取源说话人的第一语音数据和目标说话人的第二语音数据；对所述第一语音数据提取第一线性谱，对所述第二语音数据提取第一说话人向量；通过对所述第一线性谱进行编码，得出所述第一语音数据的第一内容分布；根据所述第一说话人向量，对所述第一内容分布进行变换分布，得到第二内容分布；根据所述第一说话人向量和所述第二内容分布，重构所述目标说话人的语音数据。
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN117037823A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN117037823A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN117037823A3</originalsourceid><addsrcrecordid>eNqNi7EKwjAURbM4iPoPz13BmqGuUhRBcBLXEpJrDTTvxSTt91vED3A6wzlnrq4P8RZkhUek7IUpoLzEkWFHDuMkN4QetiRhbwnvwccALt8gF0mmw_Q4P4Slmj1Nn7H6caHW59O9uWwRpUWOxoJR2uZWVfVO14e9Pup_mg9oajWH</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Voice conversion method and device, electronic equipment and storage medium</title><source>esp@cenet</source><creator>ZHONG RONGXIU ; DENG CHAO ; YANG HUIBAO ; XU LE ; LIU YING ; ZHANG SHILEI</creator><creatorcontrib>ZHONG RONGXIU ; DENG CHAO ; YANG HUIBAO ; XU LE ; LIU YING ; ZHANG SHILEI</creatorcontrib><description>The embodiment of the invention discloses a voice conversion method and device, electronic equipment and a storage medium. The method comprises the following steps: acquiring first voice data of a source speaker and second voice data of a target speaker; extracting a first linear spectrum from the first voice data, and extracting a first speaker vector from the second voice data; encoding the first linear spectrum to obtain first content distribution of the first voice data; performing distribution transformation on the first content distribution according to the first speaker vector to obtain second content distribution; and reconstructing voice data of the target speaker according to the first speaker vector and the second content distribution. 本实施例公开了一种语音转换方法、装置、电子设备和存储介质，该方法包括：获取源说话人的第一语音数据和目标说话人的第二语音数据；对所述第一语音数据提取第一线性谱，对所述第二语音数据提取第一说话人向量；通过对所述第一线性谱进行编码，得出所述第一语音数据的第一内容分布；根据所述第一说话人向量，对所述第一内容分布进行变换分布，得到第二内容分布；根据所述第一说话人向量和所述第二内容分布，重构所述目标说话人的语音数据。</description><language>chi ; eng</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2023</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20231110&DB=EPODOC&CC=CN&NR=117037823A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25563,76318</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20231110&DB=EPODOC&CC=CN&NR=117037823A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>ZHONG RONGXIU</creatorcontrib><creatorcontrib>DENG CHAO</creatorcontrib><creatorcontrib>YANG HUIBAO</creatorcontrib><creatorcontrib>XU LE</creatorcontrib><creatorcontrib>LIU YING</creatorcontrib><creatorcontrib>ZHANG SHILEI</creatorcontrib><title>Voice conversion method and device, electronic equipment and storage medium</title><description>The embodiment of the invention discloses a voice conversion method and device, electronic equipment and a storage medium. The method comprises the following steps: acquiring first voice data of a source speaker and second voice data of a target speaker; extracting a first linear spectrum from the first voice data, and extracting a first speaker vector from the second voice data; encoding the first linear spectrum to obtain first content distribution of the first voice data; performing distribution transformation on the first content distribution according to the first speaker vector to obtain second content distribution; and reconstructing voice data of the target speaker according to the first speaker vector and the second content distribution. 本实施例公开了一种语音转换方法、装置、电子设备和存储介质，该方法包括：获取源说话人的第一语音数据和目标说话人的第二语音数据；对所述第一语音数据提取第一线性谱，对所述第二语音数据提取第一说话人向量；通过对所述第一线性谱进行编码，得出所述第一语音数据的第一内容分布；根据所述第一说话人向量，对所述第一内容分布进行变换分布，得到第二内容分布；根据所述第一说话人向量和所述第二内容分布，重构所述目标说话人的语音数据。</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2023</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNi7EKwjAURbM4iPoPz13BmqGuUhRBcBLXEpJrDTTvxSTt91vED3A6wzlnrq4P8RZkhUek7IUpoLzEkWFHDuMkN4QetiRhbwnvwccALt8gF0mmw_Q4P4Slmj1Nn7H6caHW59O9uWwRpUWOxoJR2uZWVfVO14e9Pup_mg9oajWH</recordid><startdate>20231110</startdate><enddate>20231110</enddate><creator>ZHONG RONGXIU</creator><creator>DENG CHAO</creator><creator>YANG HUIBAO</creator><creator>XU LE</creator><creator>LIU YING</creator><creator>ZHANG SHILEI</creator><scope>EVB</scope></search><sort><creationdate>20231110</creationdate><title>Voice conversion method and device, electronic equipment and storage medium</title><author>ZHONG RONGXIU ; DENG CHAO ; YANG HUIBAO ; XU LE ; LIU YING ; ZHANG SHILEI</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN117037823A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2023</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>ZHONG RONGXIU</creatorcontrib><creatorcontrib>DENG CHAO</creatorcontrib><creatorcontrib>YANG HUIBAO</creatorcontrib><creatorcontrib>XU LE</creatorcontrib><creatorcontrib>LIU YING</creatorcontrib><creatorcontrib>ZHANG SHILEI</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>ZHONG RONGXIU</au><au>DENG CHAO</au><au>YANG HUIBAO</au><au>XU LE</au><au>LIU YING</au><au>ZHANG SHILEI</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Voice conversion method and device, electronic equipment and storage medium</title><date>2023-11-10</date><risdate>2023</risdate><abstract>The embodiment of the invention discloses a voice conversion method and device, electronic equipment and a storage medium. The method comprises the following steps: acquiring first voice data of a source speaker and second voice data of a target speaker; extracting a first linear spectrum from the first voice data, and extracting a first speaker vector from the second voice data; encoding the first linear spectrum to obtain first content distribution of the first voice data; performing distribution transformation on the first content distribution according to the first speaker vector to obtain second content distribution; and reconstructing voice data of the target speaker according to the first speaker vector and the second content distribution. 本实施例公开了一种语音转换方法、装置、电子设备和存储介质，该方法包括：获取源说话人的第一语音数据和目标说话人的第二语音数据；对所述第一语音数据提取第一线性谱，对所述第二语音数据提取第一说话人向量；通过对所述第一线性谱进行编码，得出所述第一语音数据的第一内容分布；根据所述第一说话人向量，对所述第一内容分布进行变换分布，得到第二内容分布；根据所述第一说话人向量和所述第二内容分布，重构所述目标说话人的语音数据。</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	chi ; eng
recordid	cdi_epo_espacenet_CN117037823A
source	esp@cenet
subjects	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
title	Voice conversion method and device, electronic equipment and storage medium
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-10T17%3A04%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=ZHONG%20RONGXIU&rft.date=2023-11-10&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN117037823A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true