Document analysis system, document analysis method, document analysis program and recording medium

As a document analysis system to calculate a similarity degree between texts with high accuracy, an information processing device includes: a common character string calculation unit to extract character strings that are common between two texts and to determine whether or not the two texts are to b...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: Kuroiwa Yukiko
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:As a document analysis system to calculate a similarity degree between texts with high accuracy, an information processing device includes: a common character string calculation unit to extract character strings that are common between two texts and to determine whether or not the two texts are to be set as calculation objects based on a number of the extracted character strings that are common; and a similarity degree calculation unit to calculate, when the two texts are the determined calculation objects, a similarity degree therebetween by using an approximation of a Kolmogorov complexity, and when the two texts are not the calculation objects, handling the similarity degree between the two texts as being dissimilar.