Audio conversion method and device, storage medium and electronic equipment

The embodiment of the invention provides an audio conversion method and device, a storage medium and electronic equipment. The method comprises the steps of obtaining an initial audio of a source speaker, carrying out feature recognition on the initial audio, obtaining voice recognition features and...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: CHEN WEI, GE WENSHUO, LIU KAI
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator CHEN WEI
GE WENSHUO
LIU KAI
description The embodiment of the invention provides an audio conversion method and device, a storage medium and electronic equipment. The method comprises the steps of obtaining an initial audio of a source speaker, carrying out feature recognition on the initial audio, obtaining voice recognition features and audio hidden layer features corresponding to the initial audio, inputting the extracted speech recognition features and audio hidden layer features into an audio conversion model for tone conversion and dialect accent processing to obtain target dialect acoustic features of a target dialect speaker, and then generating a corresponding target audio according to the target dialect acoustic features. The voice recognition features and the audio hidden layer features are processed through the audio conversion model, the audio of any speaker is converted into the audio of the target dialect speaker, tone conversion can be achieved, target dialect accent can be carried in the converted audio, and the voice changing effe
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN113223542A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN113223542A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN113223542A3</originalsourceid><addsrcrecordid>eNqNyr0KwjAUhuEsDqLeQ7rr0EYvoBRFEJzcS0g-9UBzTsxPr18RL8DpHZ53qS599STaCc9ImYR1QHmK15a99pjJYatzkWQf-JCnGr6ECa4kYXIar0oxgMtaLe52ytj8ulLN6XgbzjtEGZGjdWCUcbi2rek6c9h3vfnneQNd4jVx</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Audio conversion method and device, storage medium and electronic equipment</title><source>esp@cenet</source><creator>CHEN WEI ; GE WENSHUO ; LIU KAI</creator><creatorcontrib>CHEN WEI ; GE WENSHUO ; LIU KAI</creatorcontrib><description>The embodiment of the invention provides an audio conversion method and device, a storage medium and electronic equipment. The method comprises the steps of obtaining an initial audio of a source speaker, carrying out feature recognition on the initial audio, obtaining voice recognition features and audio hidden layer features corresponding to the initial audio, inputting the extracted speech recognition features and audio hidden layer features into an audio conversion model for tone conversion and dialect accent processing to obtain target dialect acoustic features of a target dialect speaker, and then generating a corresponding target audio according to the target dialect acoustic features. The voice recognition features and the audio hidden layer features are processed through the audio conversion model, the audio of any speaker is converted into the audio of the target dialect speaker, tone conversion can be achieved, target dialect accent can be carried in the converted audio, and the voice changing effe</description><language>chi ; eng</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210806&amp;DB=EPODOC&amp;CC=CN&amp;NR=113223542A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210806&amp;DB=EPODOC&amp;CC=CN&amp;NR=113223542A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>CHEN WEI</creatorcontrib><creatorcontrib>GE WENSHUO</creatorcontrib><creatorcontrib>LIU KAI</creatorcontrib><title>Audio conversion method and device, storage medium and electronic equipment</title><description>The embodiment of the invention provides an audio conversion method and device, a storage medium and electronic equipment. The method comprises the steps of obtaining an initial audio of a source speaker, carrying out feature recognition on the initial audio, obtaining voice recognition features and audio hidden layer features corresponding to the initial audio, inputting the extracted speech recognition features and audio hidden layer features into an audio conversion model for tone conversion and dialect accent processing to obtain target dialect acoustic features of a target dialect speaker, and then generating a corresponding target audio according to the target dialect acoustic features. The voice recognition features and the audio hidden layer features are processed through the audio conversion model, the audio of any speaker is converted into the audio of the target dialect speaker, tone conversion can be achieved, target dialect accent can be carried in the converted audio, and the voice changing effe</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNyr0KwjAUhuEsDqLeQ7rr0EYvoBRFEJzcS0g-9UBzTsxPr18RL8DpHZ53qS599STaCc9ImYR1QHmK15a99pjJYatzkWQf-JCnGr6ECa4kYXIar0oxgMtaLe52ytj8ulLN6XgbzjtEGZGjdWCUcbi2rek6c9h3vfnneQNd4jVx</recordid><startdate>20210806</startdate><enddate>20210806</enddate><creator>CHEN WEI</creator><creator>GE WENSHUO</creator><creator>LIU KAI</creator><scope>EVB</scope></search><sort><creationdate>20210806</creationdate><title>Audio conversion method and device, storage medium and electronic equipment</title><author>CHEN WEI ; GE WENSHUO ; LIU KAI</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN113223542A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2021</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>CHEN WEI</creatorcontrib><creatorcontrib>GE WENSHUO</creatorcontrib><creatorcontrib>LIU KAI</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>CHEN WEI</au><au>GE WENSHUO</au><au>LIU KAI</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Audio conversion method and device, storage medium and electronic equipment</title><date>2021-08-06</date><risdate>2021</risdate><abstract>The embodiment of the invention provides an audio conversion method and device, a storage medium and electronic equipment. The method comprises the steps of obtaining an initial audio of a source speaker, carrying out feature recognition on the initial audio, obtaining voice recognition features and audio hidden layer features corresponding to the initial audio, inputting the extracted speech recognition features and audio hidden layer features into an audio conversion model for tone conversion and dialect accent processing to obtain target dialect acoustic features of a target dialect speaker, and then generating a corresponding target audio according to the target dialect acoustic features. The voice recognition features and the audio hidden layer features are processed through the audio conversion model, the audio of any speaker is converted into the audio of the target dialect speaker, tone conversion can be achieved, target dialect accent can be carried in the converted audio, and the voice changing effe</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN113223542A
source esp@cenet
subjects ACOUSTICS
MUSICAL INSTRUMENTS
PHYSICS
SPEECH ANALYSIS OR SYNTHESIS
SPEECH OR AUDIO CODING OR DECODING
SPEECH OR VOICE PROCESSING
SPEECH RECOGNITION
title Audio conversion method and device, storage medium and electronic equipment
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T05%3A11%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=CHEN%20WEI&rft.date=2021-08-06&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN113223542A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true