BERT-BiLSTM-CRF dangerous chemical named entity identification method fusing multiple features

The invention discloses a BERT-BiLSTM-CRF dangerous chemical named entity identification method fusing multiple features, and the method comprises the steps: obtaining dangerous chemical data, and carrying out the cleaning and preprocessing: removing noise data, carrying out the word segmentation of...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: GAO LI, ZHANG HAODONG, WANG LANFANG, SHAO HESHUAI, LI MIN, QIU JUNLIN, SHEN YONGSHAN, SUN KEKE, JIANG XIAOLING
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention discloses a BERT-BiLSTM-CRF dangerous chemical named entity identification method fusing multiple features, and the method comprises the steps: obtaining dangerous chemical data, and carrying out the cleaning and preprocessing: removing noise data, carrying out the word segmentation of the data, and marking a named entity; a 3D-CNN framework is constructed to extract context morphological features, and pinyin, radicals and stroke numbers are introduced as Chinese character features; constructing a BiLSTM model for extracting context semantic features, introducing an attention mechanism, and learning more complex context sequence features; and through the constructed CRF part, the named entity tag sequence is decoded, and the dangerous chemical named entity is identified. According to the method, through the constructed BERT-BiLSTM-CRF model fusing multiple features, the dangerous chemical named entities can be accurately identified, dangerous chemical information can be extracted, and the method