A Knowledge-Enriched Ensemble Method for Word Embedding and Multi-Sense Embedding

Representing words as embeddings has been proven to be successful in improving the performance in many natural language processing tasks. Different from the traditional methods that learn the embeddings from large text corpora, ensemble methods have been proposed to leverage the merits of pre-traine...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on knowledge and data engineering 2023-06, Vol.35 (6), p.5534-5549
Hauptverfasser:	Fang, Lanting, Luo, Yong, Feng, Kaiyu, Zhao, Kaiqi, Hu, Aiqun
Format:	Artikel
Sprache:	eng
Schlagworte:	Bit error rate Context modeling Embedding ensemble model Knowledge engineering knowledge graph Knowledge representation multi-sense embedding Natural language processing Retrofitting Semantics Similarity Task analysis Vocabulary Wheels Word embedding Words (language)
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Representing words as embeddings has been proven to be successful in improving the performance in many natural language processing tasks. Different from the traditional methods that learn the embeddings from large text corpora, ensemble methods have been proposed to leverage the merits of pre-trained word embeddings as well as external semantic sources. In this paper, we propose a knowledge-enriched ensemble method to combine information from both knowledge graphs and pre-trained word embeddings. Specifically, we propose an attention network to retrofit the semantic information in the lexical knowledge graph into the pre-trained word embeddings. In addition, we further extend our method to contextual word embeddings and multi-sense embeddings. Extensive experiments demonstrate that the proposed word embeddings outperform the state-of-the-art models in word analogy, word similarity and several downstream tasks. The proposed word sense embeddings outperform the state-of-the-art models in word similarity and word sense induction tasks.
ISSN:	1041-4347 1558-2191
DOI:	10.1109/TKDE.2022.3159539