Method for extracting theme tag from text set and electronic equipment
The invention provides a method for extracting topic tags from a text set and electronic equipment. The method comprises the following steps: converting each text in the text set into a text vector; taking each text vector as a cluster at the bottommost layer, executing hierarchical clustering from...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The invention provides a method for extracting topic tags from a text set and electronic equipment. The method comprises the following steps: converting each text in the text set into a text vector; taking each text vector as a cluster at the bottommost layer, executing hierarchical clustering from bottom to top, and determining a topic tag of each layer of cluster; for any word, obtaining a cluster set corresponding to the word according to a cluster containing the word in the topic tag; the cluster set comprises at least one cluster, and each cluster comprises at least one text; according to the cluster sets corresponding to the different words and the keywords to be extracted, finding out a target cluster set mapped by the keywords; and according to the topic label corresponding to each cluster in the target cluster set, obtaining the topic label which is extracted from the text set and is related to the keyword. According to the scheme, extraction of theme tags is simpler and more convenient.
本申请提供一种从文本集中 |
---|