Missed word detection method and system based on fine tuning generative adversarial network model

The invention provides a missed word detection method and system based on a fine tuning generative adversarial network model. A to-be-detected text corpus is preprocessed to form a sequence composed of a plurality of segmented words, the segmented words in the sequence are read as embedded vectors a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: SHEN XIN, LAN JIANMIN
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention provides a missed word detection method and system based on a fine tuning generative adversarial network model. A to-be-detected text corpus is preprocessed to form a sequence composed of a plurality of segmented words, the segmented words in the sequence are read as embedded vectors according to an ERNIE word list, the embedded vectors of the plurality of segmented words form a vector sequence Eseq, and the vector sequence Eseq is stored in the ERNIE word list; adopting a distance formula to calculate the distance between the generation sequence and the standard sequence as a threshold value, preprocessing the to-be-detected sequence to obtain a to-be-detected input sequence, inputting the to-be-detected input sequence into the generation network to obtain a to-be-detected generation sequence, and comparing the distance between the to-be-detected generation sequence and the standard sequence with the threshold value; if the threshold value is greater than the threshold value, the missing word e