Automatic checking method of similitude file

This invention relates to one automatic detecting similar file method, which can process word break, punctuation mark deletion, filter stopping, word normalizing; then establishing each word of each file one reverse index file; then comparing each files through above steps through reverse index bloc...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: YUANXIAN ZENG
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:This invention relates to one automatic detecting similar file method, which can process word break, punctuation mark deletion, filter stopping, word normalizing; then establishing each word of each file one reverse index file; then comparing each files through above steps through reverse index block to provide index function to rapidly research each word in file set appearance times and according to similar formula to compute files with other filed similarity.