Code text information-containing repeated post labeling method, storage medium and electronic equipment

According to the duplicate post labeling method containing the code text information, the storage medium and the electronic equipment, data label balance is guaranteed, the manual labeling cost is greatly reduced, the labeling efficiency is improved, and label distribution is balanced; the data set...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: WANG XUXIN, CUI CAN, SUN XIA, GUO XIANGQIANG, YANG MINGYUAN
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:According to the duplicate post labeling method containing the code text information, the storage medium and the electronic equipment, data label balance is guaranteed, the manual labeling cost is greatly reduced, the labeling efficiency is improved, and label distribution is balanced; the data set established through the method is balanced in label distribution, less in labeling time and high in data set quality. 本发明的含代码文本信息的重复帖子标注方法、存储介质及电子设备,保证数据标签均衡,极大减少人工标注成本,提高标注效率且平衡标签分布;经过该方法构建的数据集标签分布均衡,标注时间花费较少,数据集质量优质。