Mass data structuring method and device, computer equipment and memory medium

The embodiment of the invention discloses a mass data structuring method and device, computer equipment and a memory medium. The method comprises the steps of clustering unstructured data, thereby obtaining a clustering result corresponding to the preset clustering number, and setting an ID serial n...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: JIN XIN, WANG YI, ZHANG CHUAN, HUANG DUXIN
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The embodiment of the invention discloses a mass data structuring method and device, computer equipment and a memory medium. The method comprises the steps of clustering unstructured data, thereby obtaining a clustering result corresponding to the preset clustering number, and setting an ID serial number in one-to-one correspondence with each cluster; obtaining a piece of unstructured data in eachcluster of the clustering result, and converting the obtained unstructured data into a regular expression; and converting the unstructured data contained in each cluster according to the regular expression corresponding to the cluster, thereby obtaining structured data. According to the method, the mass unstructured data is clustered through adoption of a clustering algorithm, the regular expression is generated corresponding to each cluster and the regular expression is applied to all data of the cluster, so the mass unstructured data can be rapidly converted into the structured data, and ademand of depth learning f