Data annotation method and device for HTML document in engineering field and electronic equipment

The invention provides a data annotation method and device for an HTML document in the engineering field and electronic equipment, and relates to the technical field of data processing. The method comprises the steps of obtaining to-be-labeled data, performing named entity identification marking on...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: ZHAO ERHUA, HU SHUANGYIN, LUO FENG, HE DAN, ZHANG SEN, HE RUIJING, WANG HONGLIAN, TAN ZHUO, TONG YAMEI, XINFUYAN QIMEI, HUANG XUETAO, LAI XINGYU, ZHANG QIN, GUO HONGKE, CHENG SHUWEI
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention provides a data annotation method and device for an HTML document in the engineering field and electronic equipment, and relates to the technical field of data processing. The method comprises the steps of obtaining to-be-labeled data, performing named entity identification marking on a preset target phrase of a target project contained in the to-be-labeled data, and obtaining an entity mark corresponding to the preset target phrase; performing offset calculation according to the entity mark to obtain an offset corresponding to the corresponding preset target phrase; and performing relation matching on the preset target phrase according to the offset and the context semantic relation of the to-be-labeled data to obtain a labeling result of the relation matching. The device is used for executing the method. By automatically identifying the entities and calculating the relationship between the entities, a more accurate entity and entity relationship extraction method for the engineering field is o