A data-centric framework of improving graph neural networks for knowledge graph embedding

Knowledge Graph Embedding (KGE) aims to learn representations of entities and relations of knowledge graph (KG). Recently Graph Neural Networks (GNNs) have gained great success on KGE, but for the reason behind it, most views simply attribute to the well learning of knowledge graph structure, which...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	World wide web (Bussum) 2025, Vol.28 (1), p.2, Article 2
Hauptverfasser:	Cao, Yanan, Lin, Xixun, Wu, Yongxuan, Shi, Fengzhao, Shang, Yanmin, Tan, Qingfeng, Zhou, Chuan, Zhang, Peng
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science Database Management Embedding Empirical analysis Graph neural networks Graphical representations Graphs Information Systems Applications (incl.Internet) Knowledge representation Neural networks Nodes Operating Systems Source code
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue	1
container_start_page	2
container_title	World wide web (Bussum)
container_volume	28
creator	Cao, Yanan Lin, Xixun Wu, Yongxuan Shi, Fengzhao Shang, Yanmin Tan, Qingfeng Zhou, Chuan Zhang, Peng
description	Knowledge Graph Embedding (KGE) aims to learn representations of entities and relations of knowledge graph (KG). Recently Graph Neural Networks (GNNs) have gained great success on KGE, but for the reason behind it, most views simply attribute to the well learning of knowledge graph structure, which still remains a limited understanding of the internal mechanism. In this work, we first study a fundamental problem, i.e., what are the important factors for GNNs to help KGE. To investigate this problem, we discuss the core idea of current GNN models for KG, and propose a new assumption of relational homophily that connected nodes possess similar features after relation’s transforming, to explain why aggregating neighbors with relation can help KGE. Based on the model and empirical analyses, we then introduce a novel data-centric framework for applying GNNs to KGE called KSG-GNN . In KSG-GNN, we construct a new graph structure from KG named Knowledge Similarity Graph (KSG), where each node connects with its similar nodes as neighbors, and then we apply GNNs on this graph to perform KGE. Instead of following the relational homophily assumption in KG, KSG aligns with homogeneous graphs that can directly satisfy homophily assumption. Hence, any GNN developed on homogeneous graphs like GCN, GAT, GraphSAGE, etc., can be applied out-of-the-box as KSG-GNN without modification, which provides a more general and effective GNN paradigm. Finally, we conduct extensive experiments on two benchmark datasets, i.e., FB15k-237 and WN18RR, demonstrating the superior performance of KSG-GNN over multiple strong baselines. The source code is available at https://github.com/advancer99/WWWJ-KGE .
doi_str_mv	10.1007/s11280-024-01320-0
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3133091389</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3133091389</sourcerecordid><originalsourceid>FETCH-LOGICAL-c200t-4d93ec1424661f2373ae5b3aeded5bdf6f921a79bcf472972bcddadedf75b0213</originalsourceid><addsrcrecordid>eNp9kE9LAzEQxYMoWKtfwFPA8-pMsrtpjqX4DwpeFPQUsptk3ba7qclW8dubugVvXmYe5PdmMo-QS4RrBBA3EZHNIAOWZ4CcJXVEJlgInmGO_DhpPiuTLl5PyVmMKwAoucQJeZtTowed1bYfQltTF3Rnv3xYU-9o222D_2z7hjZBb99pb3dBb1Ib9kSkzge67v3XxprGHhjbVdaY5DknJ05vor049Cl5ubt9Xjxky6f7x8V8mdUMYMhyI7mtMWd5WaJjXHBtiyoVY01RGVc6yVALWdUuF0wKVtXG6PToRFEBQz4lV-Pc9NePnY2DWvld6NNKxZFzkOl0mSg2UnXwMQbr1Da0nQ7fCkHtI1RjhCpFqH4jVJBMfDTFBPeNDX-j_3H9ANwpdYM</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3133091389</pqid></control><display><type>article</type><title>A data-centric framework of improving graph neural networks for knowledge graph embedding</title><source>Springer Nature - Complete Springer Journals</source><creator>Cao, Yanan ; Lin, Xixun ; Wu, Yongxuan ; Shi, Fengzhao ; Shang, Yanmin ; Tan, Qingfeng ; Zhou, Chuan ; Zhang, Peng</creator><creatorcontrib>Cao, Yanan ; Lin, Xixun ; Wu, Yongxuan ; Shi, Fengzhao ; Shang, Yanmin ; Tan, Qingfeng ; Zhou, Chuan ; Zhang, Peng</creatorcontrib><description>Knowledge Graph Embedding (KGE) aims to learn representations of entities and relations of knowledge graph (KG). Recently Graph Neural Networks (GNNs) have gained great success on KGE, but for the reason behind it, most views simply attribute to the well learning of knowledge graph structure, which still remains a limited understanding of the internal mechanism. In this work, we first study a fundamental problem, i.e., what are the important factors for GNNs to help KGE. To investigate this problem, we discuss the core idea of current GNN models for KG, and propose a new assumption of relational homophily that connected nodes possess similar features after relation’s transforming, to explain why aggregating neighbors with relation can help KGE. Based on the model and empirical analyses, we then introduce a novel data-centric framework for applying GNNs to KGE called KSG-GNN . In KSG-GNN, we construct a new graph structure from KG named Knowledge Similarity Graph (KSG), where each node connects with its similar nodes as neighbors, and then we apply GNNs on this graph to perform KGE. Instead of following the relational homophily assumption in KG, KSG aligns with homogeneous graphs that can directly satisfy homophily assumption. Hence, any GNN developed on homogeneous graphs like GCN, GAT, GraphSAGE, etc., can be applied out-of-the-box as KSG-GNN without modification, which provides a more general and effective GNN paradigm. Finally, we conduct extensive experiments on two benchmark datasets, i.e., FB15k-237 and WN18RR, demonstrating the superior performance of KSG-GNN over multiple strong baselines. The source code is available at https://github.com/advancer99/WWWJ-KGE .</description><identifier>ISSN: 1386-145X</identifier><identifier>EISSN: 1573-1413</identifier><identifier>DOI: 10.1007/s11280-024-01320-0</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Computer Science ; Database Management ; Embedding ; Empirical analysis ; Graph neural networks ; Graphical representations ; Graphs ; Information Systems Applications (incl.Internet) ; Knowledge representation ; Neural networks ; Nodes ; Operating Systems ; Source code</subject><ispartof>World wide web (Bussum), 2025, Vol.28 (1), p.2, Article 2</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c200t-4d93ec1424661f2373ae5b3aeded5bdf6f921a79bcf472972bcddadedf75b0213</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11280-024-01320-0$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11280-024-01320-0$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Cao, Yanan</creatorcontrib><creatorcontrib>Lin, Xixun</creatorcontrib><creatorcontrib>Wu, Yongxuan</creatorcontrib><creatorcontrib>Shi, Fengzhao</creatorcontrib><creatorcontrib>Shang, Yanmin</creatorcontrib><creatorcontrib>Tan, Qingfeng</creatorcontrib><creatorcontrib>Zhou, Chuan</creatorcontrib><creatorcontrib>Zhang, Peng</creatorcontrib><title>A data-centric framework of improving graph neural networks for knowledge graph embedding</title><title>World wide web (Bussum)</title><addtitle>World Wide Web</addtitle><description>Knowledge Graph Embedding (KGE) aims to learn representations of entities and relations of knowledge graph (KG). Recently Graph Neural Networks (GNNs) have gained great success on KGE, but for the reason behind it, most views simply attribute to the well learning of knowledge graph structure, which still remains a limited understanding of the internal mechanism. In this work, we first study a fundamental problem, i.e., what are the important factors for GNNs to help KGE. To investigate this problem, we discuss the core idea of current GNN models for KG, and propose a new assumption of relational homophily that connected nodes possess similar features after relation’s transforming, to explain why aggregating neighbors with relation can help KGE. Based on the model and empirical analyses, we then introduce a novel data-centric framework for applying GNNs to KGE called KSG-GNN . In KSG-GNN, we construct a new graph structure from KG named Knowledge Similarity Graph (KSG), where each node connects with its similar nodes as neighbors, and then we apply GNNs on this graph to perform KGE. Instead of following the relational homophily assumption in KG, KSG aligns with homogeneous graphs that can directly satisfy homophily assumption. Hence, any GNN developed on homogeneous graphs like GCN, GAT, GraphSAGE, etc., can be applied out-of-the-box as KSG-GNN without modification, which provides a more general and effective GNN paradigm. Finally, we conduct extensive experiments on two benchmark datasets, i.e., FB15k-237 and WN18RR, demonstrating the superior performance of KSG-GNN over multiple strong baselines. The source code is available at https://github.com/advancer99/WWWJ-KGE .</description><subject>Computer Science</subject><subject>Database Management</subject><subject>Embedding</subject><subject>Empirical analysis</subject><subject>Graph neural networks</subject><subject>Graphical representations</subject><subject>Graphs</subject><subject>Information Systems Applications (incl.Internet)</subject><subject>Knowledge representation</subject><subject>Neural networks</subject><subject>Nodes</subject><subject>Operating Systems</subject><subject>Source code</subject><issn>1386-145X</issn><issn>1573-1413</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2025</creationdate><recordtype>article</recordtype><recordid>eNp9kE9LAzEQxYMoWKtfwFPA8-pMsrtpjqX4DwpeFPQUsptk3ba7qclW8dubugVvXmYe5PdmMo-QS4RrBBA3EZHNIAOWZ4CcJXVEJlgInmGO_DhpPiuTLl5PyVmMKwAoucQJeZtTowed1bYfQltTF3Rnv3xYU-9o222D_2z7hjZBb99pb3dBb1Ib9kSkzge67v3XxprGHhjbVdaY5DknJ05vor049Cl5ubt9Xjxky6f7x8V8mdUMYMhyI7mtMWd5WaJjXHBtiyoVY01RGVc6yVALWdUuF0wKVtXG6PToRFEBQz4lV-Pc9NePnY2DWvld6NNKxZFzkOl0mSg2UnXwMQbr1Da0nQ7fCkHtI1RjhCpFqH4jVJBMfDTFBPeNDX-j_3H9ANwpdYM</recordid><startdate>2025</startdate><enddate>2025</enddate><creator>Cao, Yanan</creator><creator>Lin, Xixun</creator><creator>Wu, Yongxuan</creator><creator>Shi, Fengzhao</creator><creator>Shang, Yanmin</creator><creator>Tan, Qingfeng</creator><creator>Zhou, Chuan</creator><creator>Zhang, Peng</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>2025</creationdate><title>A data-centric framework of improving graph neural networks for knowledge graph embedding</title><author>Cao, Yanan ; Lin, Xixun ; Wu, Yongxuan ; Shi, Fengzhao ; Shang, Yanmin ; Tan, Qingfeng ; Zhou, Chuan ; Zhang, Peng</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c200t-4d93ec1424661f2373ae5b3aeded5bdf6f921a79bcf472972bcddadedf75b0213</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2025</creationdate><topic>Computer Science</topic><topic>Database Management</topic><topic>Embedding</topic><topic>Empirical analysis</topic><topic>Graph neural networks</topic><topic>Graphical representations</topic><topic>Graphs</topic><topic>Information Systems Applications (incl.Internet)</topic><topic>Knowledge representation</topic><topic>Neural networks</topic><topic>Nodes</topic><topic>Operating Systems</topic><topic>Source code</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Cao, Yanan</creatorcontrib><creatorcontrib>Lin, Xixun</creatorcontrib><creatorcontrib>Wu, Yongxuan</creatorcontrib><creatorcontrib>Shi, Fengzhao</creatorcontrib><creatorcontrib>Shang, Yanmin</creatorcontrib><creatorcontrib>Tan, Qingfeng</creatorcontrib><creatorcontrib>Zhou, Chuan</creatorcontrib><creatorcontrib>Zhang, Peng</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>World wide web (Bussum)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Cao, Yanan</au><au>Lin, Xixun</au><au>Wu, Yongxuan</au><au>Shi, Fengzhao</au><au>Shang, Yanmin</au><au>Tan, Qingfeng</au><au>Zhou, Chuan</au><au>Zhang, Peng</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A data-centric framework of improving graph neural networks for knowledge graph embedding</atitle><jtitle>World wide web (Bussum)</jtitle><stitle>World Wide Web</stitle><date>2025</date><risdate>2025</risdate><volume>28</volume><issue>1</issue><spage>2</spage><pages>2-</pages><artnum>2</artnum><issn>1386-145X</issn><eissn>1573-1413</eissn><abstract>Knowledge Graph Embedding (KGE) aims to learn representations of entities and relations of knowledge graph (KG). Recently Graph Neural Networks (GNNs) have gained great success on KGE, but for the reason behind it, most views simply attribute to the well learning of knowledge graph structure, which still remains a limited understanding of the internal mechanism. In this work, we first study a fundamental problem, i.e., what are the important factors for GNNs to help KGE. To investigate this problem, we discuss the core idea of current GNN models for KG, and propose a new assumption of relational homophily that connected nodes possess similar features after relation’s transforming, to explain why aggregating neighbors with relation can help KGE. Based on the model and empirical analyses, we then introduce a novel data-centric framework for applying GNNs to KGE called KSG-GNN . In KSG-GNN, we construct a new graph structure from KG named Knowledge Similarity Graph (KSG), where each node connects with its similar nodes as neighbors, and then we apply GNNs on this graph to perform KGE. Instead of following the relational homophily assumption in KG, KSG aligns with homogeneous graphs that can directly satisfy homophily assumption. Hence, any GNN developed on homogeneous graphs like GCN, GAT, GraphSAGE, etc., can be applied out-of-the-box as KSG-GNN without modification, which provides a more general and effective GNN paradigm. Finally, we conduct extensive experiments on two benchmark datasets, i.e., FB15k-237 and WN18RR, demonstrating the superior performance of KSG-GNN over multiple strong baselines. The source code is available at https://github.com/advancer99/WWWJ-KGE .</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11280-024-01320-0</doi></addata></record>
fulltext	fulltext
identifier	ISSN: 1386-145X
ispartof	World wide web (Bussum), 2025, Vol.28 (1), p.2, Article 2
issn	1386-145X 1573-1413
language	eng
recordid	cdi_proquest_journals_3133091389
source	Springer Nature - Complete Springer Journals
subjects	Computer Science Database Management Embedding Empirical analysis Graph neural networks Graphical representations Graphs Information Systems Applications (incl.Internet) Knowledge representation Neural networks Nodes Operating Systems Source code
title	A data-centric framework of improving graph neural networks for knowledge graph embedding
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-31T11%3A56%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20data-centric%20framework%20of%20improving%20graph%20neural%20networks%20for%20knowledge%20graph%20embedding&rft.jtitle=World%20wide%20web%20(Bussum)&rft.au=Cao,%20Yanan&rft.date=2025&rft.volume=28&rft.issue=1&rft.spage=2&rft.pages=2-&rft.artnum=2&rft.issn=1386-145X&rft.eissn=1573-1413&rft_id=info:doi/10.1007/s11280-024-01320-0&rft_dat=%3Cproquest_cross%3E3133091389%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3133091389&rft_id=info:pmid/&rfr_iscdi=true