Short text feature extraction method

The invention discloses a short text feature extraction method that performs feature extraction on a short text based on a knowledge base and a syntactic analysis method. The method comprises a model training process and a feature extraction process. The method comprises: performing training accordi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: GUAN PINGYIN, LI FANDING, HE XIAOYU, TONG YUNHAI, LIU WENYI, YE SHAOQIANG
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention discloses a short text feature extraction method that performs feature extraction on a short text based on a knowledge base and a syntactic analysis method. The method comprises a model training process and a feature extraction process. The method comprises: performing training according to training set data; performing validation by using validation set data, and obtaining a weight set W that corresponds to a highest accuracy rate and a training model M that corresponds to the highest accuracy rate; after the feature extraction process performs processing for test set data, assigning the weight set W to each category; mapping the short text in a conceptual space by using an ESA algorithm, thereby obtaining an interpretation vector of the short text; and obtaining a topic vector through LDA, and using the vector as a final feature vector of the short text and a feature of the short text. The method provided by the invention can solve the problem that the short text is sparse in text feature and unclear in theme; and the method can reduce the difficulty in short text feature extraction processing, enhance the result of short text feature extraction, and improve accuracy of text classification.