Audio conversion method and device, storage medium and electronic equipment

The embodiment of the invention provides an audio conversion method and device, a storage medium and electronic equipment. The method comprises the steps of obtaining an initial audio of a source speaker, carrying out feature recognition on the initial audio, obtaining voice recognition features and...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	CHEN WEI, GE WENSHUO, LIU KAI
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	CHEN WEI GE WENSHUO LIU KAI
description	The embodiment of the invention provides an audio conversion method and device, a storage medium and electronic equipment. The method comprises the steps of obtaining an initial audio of a source speaker, carrying out feature recognition on the initial audio, obtaining voice recognition features and audio hidden layer features corresponding to the initial audio, inputting the extracted speech recognition features and audio hidden layer features into an audio conversion model for tone conversion and dialect accent processing to obtain target dialect acoustic features of a target dialect speaker, and then generating a corresponding target audio according to the target dialect acoustic features. The voice recognition features and the audio hidden layer features are processed through the audio conversion model, the audio of any speaker is converted into the audio of the target dialect speaker, tone conversion can be achieved, target dialect accent can be carried in the converted audio, and the voice changing effe
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN113223542A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN113223542A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN113223542A3</originalsourceid><addsrcrecordid>eNqNyr0KwjAUhuEsDqLeQ7rr0EYvoBRFEJzcS0g-9UBzTsxPr18RL8DpHZ53qS599STaCc9ImYR1QHmK15a99pjJYatzkWQf-JCnGr6ECa4kYXIar0oxgMtaLe52ytj8ulLN6XgbzjtEGZGjdWCUcbi2rek6c9h3vfnneQNd4jVx</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Audio conversion method and device, storage medium and electronic equipment</title><source>esp@cenet</source><creator>CHEN WEI ; GE WENSHUO ; LIU KAI</creator><creatorcontrib>CHEN WEI ; GE WENSHUO ; LIU KAI</creatorcontrib><description>The embodiment of the invention provides an audio conversion method and device, a storage medium and electronic equipment. The method comprises the steps of obtaining an initial audio of a source speaker, carrying out feature recognition on the initial audio, obtaining voice recognition features and audio hidden layer features corresponding to the initial audio, inputting the extracted speech recognition features and audio hidden layer features into an audio conversion model for tone conversion and dialect accent processing to obtain target dialect acoustic features of a target dialect speaker, and then generating a corresponding target audio according to the target dialect acoustic features. The voice recognition features and the audio hidden layer features are processed through the audio conversion model, the audio of any speaker is converted into the audio of the target dialect speaker, tone conversion can be achieved, target dialect accent can be carried in the converted audio, and the voice changing effe</description><language>chi ; eng</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20210806&DB=EPODOC&CC=CN&NR=113223542A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20210806&DB=EPODOC&CC=CN&NR=113223542A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>CHEN WEI</creatorcontrib><creatorcontrib>GE WENSHUO</creatorcontrib><creatorcontrib>LIU KAI</creatorcontrib><title>Audio conversion method and device, storage medium and electronic equipment</title><description>The embodiment of the invention provides an audio conversion method and device, a storage medium and electronic equipment. The method comprises the steps of obtaining an initial audio of a source speaker, carrying out feature recognition on the initial audio, obtaining voice recognition features and audio hidden layer features corresponding to the initial audio, inputting the extracted speech recognition features and audio hidden layer features into an audio conversion model for tone conversion and dialect accent processing to obtain target dialect acoustic features of a target dialect speaker, and then generating a corresponding target audio according to the target dialect acoustic features. The voice recognition features and the audio hidden layer features are processed through the audio conversion model, the audio of any speaker is converted into the audio of the target dialect speaker, tone conversion can be achieved, target dialect accent can be carried in the converted audio, and the voice changing effe</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNyr0KwjAUhuEsDqLeQ7rr0EYvoBRFEJzcS0g-9UBzTsxPr18RL8DpHZ53qS599STaCc9ImYR1QHmK15a99pjJYatzkWQf-JCnGr6ECa4kYXIar0oxgMtaLe52ytj8ulLN6XgbzjtEGZGjdWCUcbi2rek6c9h3vfnneQNd4jVx</recordid><startdate>20210806</startdate><enddate>20210806</enddate><creator>CHEN WEI</creator><creator>GE WENSHUO</creator><creator>LIU KAI</creator><scope>EVB</scope></search><sort><creationdate>20210806</creationdate><title>Audio conversion method and device, storage medium and electronic equipment</title><author>CHEN WEI ; GE WENSHUO ; LIU KAI</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN113223542A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2021</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>CHEN WEI</creatorcontrib><creatorcontrib>GE WENSHUO</creatorcontrib><creatorcontrib>LIU KAI</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>CHEN WEI</au><au>GE WENSHUO</au><au>LIU KAI</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Audio conversion method and device, storage medium and electronic equipment</title><date>2021-08-06</date><risdate>2021</risdate><abstract>The embodiment of the invention provides an audio conversion method and device, a storage medium and electronic equipment. The method comprises the steps of obtaining an initial audio of a source speaker, carrying out feature recognition on the initial audio, obtaining voice recognition features and audio hidden layer features corresponding to the initial audio, inputting the extracted speech recognition features and audio hidden layer features into an audio conversion model for tone conversion and dialect accent processing to obtain target dialect acoustic features of a target dialect speaker, and then generating a corresponding target audio according to the target dialect acoustic features. The voice recognition features and the audio hidden layer features are processed through the audio conversion model, the audio of any speaker is converted into the audio of the target dialect speaker, tone conversion can be achieved, target dialect accent can be carried in the converted audio, and the voice changing effe</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	chi ; eng
recordid	cdi_epo_espacenet_CN113223542A
source	esp@cenet
subjects	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
title	Audio conversion method and device, storage medium and electronic equipment
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T05%3A11%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=CHEN%20WEI&rft.date=2021-08-06&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN113223542A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true