Missed word detection method and system based on fine tuning generative adversarial network model
The invention provides a missed word detection method and system based on a fine tuning generative adversarial network model. A to-be-detected text corpus is preprocessed to form a sequence composed of a plurality of segmented words, the segmented words in the sequence are read as embedded vectors a...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The invention provides a missed word detection method and system based on a fine tuning generative adversarial network model. A to-be-detected text corpus is preprocessed to form a sequence composed of a plurality of segmented words, the segmented words in the sequence are read as embedded vectors according to an ERNIE word list, the embedded vectors of the plurality of segmented words form a vector sequence Eseq, and the vector sequence Eseq is stored in the ERNIE word list; adopting a distance formula to calculate the distance between the generation sequence and the standard sequence as a threshold value, preprocessing the to-be-detected sequence to obtain a to-be-detected input sequence, inputting the to-be-detected input sequence into the generation network to obtain a to-be-detected generation sequence, and comparing the distance between the to-be-detected generation sequence and the standard sequence with the threshold value; if the threshold value is greater than the threshold value, the missing word e |
---|