Data annotation method and device for HTML document in engineering field and electronic equipment
The invention provides a data annotation method and device for an HTML document in the engineering field and electronic equipment, and relates to the technical field of data processing. The method comprises the steps of obtaining to-be-labeled data, performing named entity identification marking on...
Gespeichert in:
Hauptverfasser: | , , , , , , , , , , , , , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The invention provides a data annotation method and device for an HTML document in the engineering field and electronic equipment, and relates to the technical field of data processing. The method comprises the steps of obtaining to-be-labeled data, performing named entity identification marking on a preset target phrase of a target project contained in the to-be-labeled data, and obtaining an entity mark corresponding to the preset target phrase; performing offset calculation according to the entity mark to obtain an offset corresponding to the corresponding preset target phrase; and performing relation matching on the preset target phrase according to the offset and the context semantic relation of the to-be-labeled data to obtain a labeling result of the relation matching. The device is used for executing the method. By automatically identifying the entities and calculating the relationship between the entities, a more accurate entity and entity relationship extraction method for the engineering field is o |
---|