Audio conversion method and device, storage medium and electronic equipment
The embodiment of the invention provides an audio conversion method and device, a storage medium and electronic equipment. The method comprises the steps of obtaining an initial audio of a source speaker, carrying out feature recognition on the initial audio, obtaining voice recognition features and...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | CHEN WEI GE WENSHUO LIU KAI |
description | The embodiment of the invention provides an audio conversion method and device, a storage medium and electronic equipment. The method comprises the steps of obtaining an initial audio of a source speaker, carrying out feature recognition on the initial audio, obtaining voice recognition features and audio hidden layer features corresponding to the initial audio, inputting the extracted speech recognition features and audio hidden layer features into an audio conversion model for tone conversion and dialect accent processing to obtain target dialect acoustic features of a target dialect speaker, and then generating a corresponding target audio according to the target dialect acoustic features. The voice recognition features and the audio hidden layer features are processed through the audio conversion model, the audio of any speaker is converted into the audio of the target dialect speaker, tone conversion can be achieved, target dialect accent can be carried in the converted audio, and the voice changing effe |
format | Patent |
fullrecord | <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN113223542A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN113223542A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN113223542A3</originalsourceid><addsrcrecordid>eNqNyr0KwjAUhuEsDqLeQ7rr0EYvoBRFEJzcS0g-9UBzTsxPr18RL8DpHZ53qS599STaCc9ImYR1QHmK15a99pjJYatzkWQf-JCnGr6ECa4kYXIar0oxgMtaLe52ytj8ulLN6XgbzjtEGZGjdWCUcbi2rek6c9h3vfnneQNd4jVx</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Audio conversion method and device, storage medium and electronic equipment</title><source>esp@cenet</source><creator>CHEN WEI ; GE WENSHUO ; LIU KAI</creator><creatorcontrib>CHEN WEI ; GE WENSHUO ; LIU KAI</creatorcontrib><description>The embodiment of the invention provides an audio conversion method and device, a storage medium and electronic equipment. The method comprises the steps of obtaining an initial audio of a source speaker, carrying out feature recognition on the initial audio, obtaining voice recognition features and audio hidden layer features corresponding to the initial audio, inputting the extracted speech recognition features and audio hidden layer features into an audio conversion model for tone conversion and dialect accent processing to obtain target dialect acoustic features of a target dialect speaker, and then generating a corresponding target audio according to the target dialect acoustic features. The voice recognition features and the audio hidden layer features are processed through the audio conversion model, the audio of any speaker is converted into the audio of the target dialect speaker, tone conversion can be achieved, target dialect accent can be carried in the converted audio, and the voice changing effe</description><language>chi ; eng</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20210806&DB=EPODOC&CC=CN&NR=113223542A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20210806&DB=EPODOC&CC=CN&NR=113223542A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>CHEN WEI</creatorcontrib><creatorcontrib>GE WENSHUO</creatorcontrib><creatorcontrib>LIU KAI</creatorcontrib><title>Audio conversion method and device, storage medium and electronic equipment</title><description>The embodiment of the invention provides an audio conversion method and device, a storage medium and electronic equipment. The method comprises the steps of obtaining an initial audio of a source speaker, carrying out feature recognition on the initial audio, obtaining voice recognition features and audio hidden layer features corresponding to the initial audio, inputting the extracted speech recognition features and audio hidden layer features into an audio conversion model for tone conversion and dialect accent processing to obtain target dialect acoustic features of a target dialect speaker, and then generating a corresponding target audio according to the target dialect acoustic features. The voice recognition features and the audio hidden layer features are processed through the audio conversion model, the audio of any speaker is converted into the audio of the target dialect speaker, tone conversion can be achieved, target dialect accent can be carried in the converted audio, and the voice changing effe</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNyr0KwjAUhuEsDqLeQ7rr0EYvoBRFEJzcS0g-9UBzTsxPr18RL8DpHZ53qS599STaCc9ImYR1QHmK15a99pjJYatzkWQf-JCnGr6ECa4kYXIar0oxgMtaLe52ytj8ulLN6XgbzjtEGZGjdWCUcbi2rek6c9h3vfnneQNd4jVx</recordid><startdate>20210806</startdate><enddate>20210806</enddate><creator>CHEN WEI</creator><creator>GE WENSHUO</creator><creator>LIU KAI</creator><scope>EVB</scope></search><sort><creationdate>20210806</creationdate><title>Audio conversion method and device, storage medium and electronic equipment</title><author>CHEN WEI ; GE WENSHUO ; LIU KAI</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN113223542A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2021</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>CHEN WEI</creatorcontrib><creatorcontrib>GE WENSHUO</creatorcontrib><creatorcontrib>LIU KAI</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>CHEN WEI</au><au>GE WENSHUO</au><au>LIU KAI</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Audio conversion method and device, storage medium and electronic equipment</title><date>2021-08-06</date><risdate>2021</risdate><abstract>The embodiment of the invention provides an audio conversion method and device, a storage medium and electronic equipment. The method comprises the steps of obtaining an initial audio of a source speaker, carrying out feature recognition on the initial audio, obtaining voice recognition features and audio hidden layer features corresponding to the initial audio, inputting the extracted speech recognition features and audio hidden layer features into an audio conversion model for tone conversion and dialect accent processing to obtain target dialect acoustic features of a target dialect speaker, and then generating a corresponding target audio according to the target dialect acoustic features. The voice recognition features and the audio hidden layer features are processed through the audio conversion model, the audio of any speaker is converted into the audio of the target dialect speaker, tone conversion can be achieved, target dialect accent can be carried in the converted audio, and the voice changing effe</abstract><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | chi ; eng |
recordid | cdi_epo_espacenet_CN113223542A |
source | esp@cenet |
subjects | ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION |
title | Audio conversion method and device, storage medium and electronic equipment |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T05%3A11%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=CHEN%20WEI&rft.date=2021-08-06&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN113223542A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |