Short text feature extraction method
The invention discloses a short text feature extraction method that performs feature extraction on a short text based on a knowledge base and a syntactic analysis method. The method comprises a model training process and a feature extraction process. The method comprises: performing training accordi...
Gespeichert in:
Hauptverfasser: | , , , , , |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The invention discloses a short text feature extraction method that performs feature extraction on a short text based on a knowledge base and a syntactic analysis method. The method comprises a model training process and a feature extraction process. The method comprises: performing training according to training set data; performing validation by using validation set data, and obtaining a weight set W that corresponds to a highest accuracy rate and a training model M that corresponds to the highest accuracy rate; after the feature extraction process performs processing for test set data, assigning the weight set W to each category; mapping the short text in a conceptual space by using an ESA algorithm, thereby obtaining an interpretation vector of the short text; and obtaining a topic vector through LDA, and using the vector as a final feature vector of the short text and a feature of the short text. The method provided by the invention can solve the problem that the short text is sparse in text feature and unclear in theme; and the method can reduce the difficulty in short text feature extraction processing, enhance the result of short text feature extraction, and improve accuracy of text classification. |
---|