Data clustering method and system, electronic equipment and storage medium

The invention relates to the technical field of computers, and provides a data clustering method and system, electronic equipment and a storage medium, and the method comprises the steps: initializing an original corpus based on a clustering center point in combination with user features, and obtain...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: OU YANGYANG, CHEN KAI, FU HAO, ZHANG WEI, LIU LIEMING, WEI DONGYUE
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention relates to the technical field of computers, and provides a data clustering method and system, electronic equipment and a storage medium, and the method comprises the steps: initializing an original corpus based on a clustering center point in combination with user features, and obtaining a first target corpus; calculating a similarity matrix matched with the word frequency inverse document matrix according to segmented words adopted by word frequency statistics in the process of constructing the word frequency inverse document matrix; inputting the similarity matrix into a first target corpus, calculating vector cosine similarities of each query lexical item and all non-query lexical items, and performing descending sort to obtain a recommendation result of the expansion word; and combining the recommendation result and the user information to obtain a sorting result, and performing clustering updating on the first target corpus based on the sorting result to obtain a second target corpus. Acco