Text classification method for open network questions in specific field

The invention belongs to the technical field of text classification processing, and particularly relates to a text classification method for open network questions in a specific field. According to the method, the problems of lack of enough available corpus sets with category marks, low network text...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: YU RICHANG, YANG HUI, LI RONGSHENG, LIU WANGYANG, ZHANG BAIJIA, HUANG SHAOBIN, SHEN LINSHAN, LI YI
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention belongs to the technical field of text classification processing, and particularly relates to a text classification method for open network questions in a specific field. According to the method, the problems of lack of enough available corpus sets with category marks, low network text information amount and high noise under the condition of executing network open text classificationtasks in certain specific fields are solved, and a new method is provided for hierarchical classification of open network questions in the fields. According to the method, open network questions andwritten texts in a specific domain are utilized to enable word embedding representation in the domain to better conform to domain knowledge features, and meanwhile, a semi-supervised method is used for accelerating classification model training and reducing required marked samples; and in addition, category classification at a multi-granularity level is realized in combination with conditional probability. The method can a