Natural language translation model training and configuration

A computer-implemented method for training a natural language translation model. The computer-implemented method includes: processing one or more sets of electronic parallel documents to obtain a plurality of aligned parallel sentences; creating a first training set comprising a subset of a pluralit...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	TANG ZHIHONG, WONG KWONG YEUNG SIMON, HUANG JINAN, ZHONG ZHANCHAO, TANG SIMIN
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	A computer-implemented method for training a natural language translation model. The computer-implemented method includes: processing one or more sets of electronic parallel documents to obtain a plurality of aligned parallel sentences; creating a first training set comprising a subset of a plurality of aligned parallel sentences; and training the natural language translation model using the first training set in the first stage. The computer-implemented method further includes modifying the first training set based on translation errors detected after training of the first stage; creating a second training set based on the modified first training set and at least some aligned parallel sentences not in the first training set; and training the natural language translation model by using the second training set in the second stage so as to improve the translation performance of the natural language translation model. 一种用于训练自然语言翻译模型的计算机实现的方法。该计算机实现的方法包括：处理一组或多组电子并行文档，以获得多个对齐的并行句子；创建第一训练集，其包括多个对齐的并行句子的子集；以及在第一阶段中