TEXT DATA PROCESSING METHOD AND APPARATUS

A text data processing method is disclosed, and is applied to the field of artificial intelligence. The method includes: obtaining target text, where a phoneme of the target text includes a first phoneme and a second phoneme that are adjacent to each other (401); performing feature extraction on the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: ZHENG, Nianzu, WANG, Disong, ZHANG, Yang, DENG, Liqun
Format: Patent
Sprache:eng ; fre ; ger
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A text data processing method is disclosed, and is applied to the field of artificial intelligence. The method includes: obtaining target text, where a phoneme of the target text includes a first phoneme and a second phoneme that are adjacent to each other (401); performing feature extraction on the first phoneme and the second phoneme, to obtain a first audio feature of the first phoneme and a second audio feature of the second phoneme (402); obtaining, by using a target recurrent neural network RNN and based on the first audio feature, first speech data corresponding to the first phoneme, and obtaining, by using the target RNN and based on the second audio feature, second speech data corresponding to the second phoneme, where a step of obtaining the first speech data corresponding to the first phoneme and a step of obtaining the second speech data corresponding to the second phoneme are concurrently performed (403); and obtaining, by using a vocoder and based on the first speech data and the second speech data, audio corresponding to the first phoneme and audio corresponding to the second phoneme (404). The target RNN can concurrently process the first audio feature and the second audio feature. This decouples a processing process of the first audio feature and a processing process of the second audio feature, and reduces duration for processing the audio features by the target RNN.