Speech translation model modeling method and device based on speech synthesis data

The invention relates to a speech translation model modeling method and device based on speech synthesis data, and belongs to the technical field of natural language processing. The problem that in the prior art, a speech translation model is small in training data size and insufficient in utilizati...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: YANG MURUN, DU QUAN
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention relates to a speech translation model modeling method and device based on speech synthesis data, and belongs to the technical field of natural language processing. The problem that in the prior art, a speech translation model is small in training data size and insufficient in utilization, and consequently the translation result is inaccurate is solved. The modeling method comprises the following steps: acquiring a general speech synthesis data set, and training to obtain a general speech synthesis model; acquiring a speech translation data set of the target domain; performing fine tuning on the universal speech synthesis model by using the speech translation data set to obtain a special speech synthesis model; inputting the source language annotation text into a special voice synthesis model, and generating a plurality of pieces of voice synthesis pseudo data according to a preset proportion to obtain a pseudo voice data set; and constructing an initial speech translation model, training the ini