Information processing method, device and equipment

The invention provides an information processing method, device and equipment, and the method comprises the steps: obtaining first text coding content corresponding to to-be-processed audio data; using a target generator in a generative adversarial network model to obtain target audio data according...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: ZHONG RONGXIU, DENG CHAO, YANG HUIBAO, LIU YING, ZHANG SHILEI
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention provides an information processing method, device and equipment, and the method comprises the steps: obtaining first text coding content corresponding to to-be-processed audio data; using a target generator in a generative adversarial network model to obtain target audio data according to the first text coding content and the target sound feature information; wherein the target sound feature information comprises at least one of target loudness information, target tone information and target timbre information. According to the scheme, the generative adversarial network model can be adopted to predict the voice waveform (that is, the target audio data is obtained), a vocoder is not needed to synthesize the voice waveform, end-to-end voice conversion is realized, the mismatch problem caused by vocoder cascade and the defects of noise or tone quality damage and the like existing in a result output by the vocoder are avoided, and the voice conversion efficiency is improved. The problem of noise or