A related data determination method and device, computer equipment and a storage medium

The invention provides a related data determination method and device, computer equipment and a storage medium. The related data determination method comprises the steps of obtaining a to-be-analyzed data object set, wherein the data object set comprises a plurality of data objects; calculating data...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: LIU JIANHUAN, LI YU
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention provides a related data determination method and device, computer equipment and a storage medium. The related data determination method comprises the steps of obtaining a to-be-analyzed data object set, wherein the data object set comprises a plurality of data objects; calculating data portrait information of the data object; performing clustering analysis on the data object set according to the data portrait information to obtain a plurality of clustering clusters, the clustering clusters comprising a plurality of data objects; calculating a content similarity value between the data objects in the same cluster; calculating a semantic similarity value between the data objects in the same cluster; and in the same cluster, determining related data according to the content similarity value and the semantic similarity value, wherein the larger the content similarity value is, the larger the probability that the data objects with the larger semantic similarity value are mutually related data is. Acco