A data-centric framework of improving graph neural networks for knowledge graph embedding

Knowledge Graph Embedding (KGE) aims to learn representations of entities and relations of knowledge graph (KG). Recently Graph Neural Networks (GNNs) have gained great success on KGE, but for the reason behind it, most views simply attribute to the well learning of knowledge graph structure, which...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:World wide web (Bussum) 2025, Vol.28 (1), p.2, Article 2
Hauptverfasser: Cao, Yanan, Lin, Xixun, Wu, Yongxuan, Shi, Fengzhao, Shang, Yanmin, Tan, Qingfeng, Zhou, Chuan, Zhang, Peng
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 1
container_start_page 2
container_title World wide web (Bussum)
container_volume 28
creator Cao, Yanan
Lin, Xixun
Wu, Yongxuan
Shi, Fengzhao
Shang, Yanmin
Tan, Qingfeng
Zhou, Chuan
Zhang, Peng
description Knowledge Graph Embedding (KGE) aims to learn representations of entities and relations of knowledge graph (KG). Recently Graph Neural Networks (GNNs) have gained great success on KGE, but for the reason behind it, most views simply attribute to the well learning of knowledge graph structure, which still remains a limited understanding of the internal mechanism. In this work, we first study a fundamental problem, i.e., what are the important factors for GNNs to help KGE. To investigate this problem, we discuss the core idea of current GNN models for KG, and propose a new assumption of relational homophily that connected nodes possess similar features after relation’s transforming, to explain why aggregating neighbors with relation can help KGE. Based on the model and empirical analyses, we then introduce a novel data-centric framework for applying GNNs to KGE called KSG-GNN . In KSG-GNN, we construct a new graph structure from KG named Knowledge Similarity Graph (KSG), where each node connects with its similar nodes as neighbors, and then we apply GNNs on this graph to perform KGE. Instead of following the relational homophily assumption in KG, KSG aligns with homogeneous graphs that can directly satisfy homophily assumption. Hence, any GNN developed on homogeneous graphs like GCN, GAT, GraphSAGE, etc., can be applied out-of-the-box as KSG-GNN without modification, which provides a more general and effective GNN paradigm. Finally, we conduct extensive experiments on two benchmark datasets, i.e., FB15k-237 and WN18RR, demonstrating the superior performance of KSG-GNN over multiple strong baselines. The source code is available at https://github.com/advancer99/WWWJ-KGE .
doi_str_mv 10.1007/s11280-024-01320-0
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3133091389</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3133091389</sourcerecordid><originalsourceid>FETCH-LOGICAL-c200t-4d93ec1424661f2373ae5b3aeded5bdf6f921a79bcf472972bcddadedf75b0213</originalsourceid><addsrcrecordid>eNp9kE9LAzEQxYMoWKtfwFPA8-pMsrtpjqX4DwpeFPQUsptk3ba7qclW8dubugVvXmYe5PdmMo-QS4RrBBA3EZHNIAOWZ4CcJXVEJlgInmGO_DhpPiuTLl5PyVmMKwAoucQJeZtTowed1bYfQltTF3Rnv3xYU-9o222D_2z7hjZBb99pb3dBb1Ib9kSkzge67v3XxprGHhjbVdaY5DknJ05vor049Cl5ubt9Xjxky6f7x8V8mdUMYMhyI7mtMWd5WaJjXHBtiyoVY01RGVc6yVALWdUuF0wKVtXG6PToRFEBQz4lV-Pc9NePnY2DWvld6NNKxZFzkOl0mSg2UnXwMQbr1Da0nQ7fCkHtI1RjhCpFqH4jVJBMfDTFBPeNDX-j_3H9ANwpdYM</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3133091389</pqid></control><display><type>article</type><title>A data-centric framework of improving graph neural networks for knowledge graph embedding</title><source>Springer Nature - Complete Springer Journals</source><creator>Cao, Yanan ; Lin, Xixun ; Wu, Yongxuan ; Shi, Fengzhao ; Shang, Yanmin ; Tan, Qingfeng ; Zhou, Chuan ; Zhang, Peng</creator><creatorcontrib>Cao, Yanan ; Lin, Xixun ; Wu, Yongxuan ; Shi, Fengzhao ; Shang, Yanmin ; Tan, Qingfeng ; Zhou, Chuan ; Zhang, Peng</creatorcontrib><description>Knowledge Graph Embedding (KGE) aims to learn representations of entities and relations of knowledge graph (KG). Recently Graph Neural Networks (GNNs) have gained great success on KGE, but for the reason behind it, most views simply attribute to the well learning of knowledge graph structure, which still remains a limited understanding of the internal mechanism. In this work, we first study a fundamental problem, i.e., what are the important factors for GNNs to help KGE. To investigate this problem, we discuss the core idea of current GNN models for KG, and propose a new assumption of relational homophily that connected nodes possess similar features after relation’s transforming, to explain why aggregating neighbors with relation can help KGE. Based on the model and empirical analyses, we then introduce a novel data-centric framework for applying GNNs to KGE called KSG-GNN . In KSG-GNN, we construct a new graph structure from KG named Knowledge Similarity Graph (KSG), where each node connects with its similar nodes as neighbors, and then we apply GNNs on this graph to perform KGE. Instead of following the relational homophily assumption in KG, KSG aligns with homogeneous graphs that can directly satisfy homophily assumption. Hence, any GNN developed on homogeneous graphs like GCN, GAT, GraphSAGE, etc., can be applied out-of-the-box as KSG-GNN without modification, which provides a more general and effective GNN paradigm. Finally, we conduct extensive experiments on two benchmark datasets, i.e., FB15k-237 and WN18RR, demonstrating the superior performance of KSG-GNN over multiple strong baselines. The source code is available at https://github.com/advancer99/WWWJ-KGE .</description><identifier>ISSN: 1386-145X</identifier><identifier>EISSN: 1573-1413</identifier><identifier>DOI: 10.1007/s11280-024-01320-0</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Computer Science ; Database Management ; Embedding ; Empirical analysis ; Graph neural networks ; Graphical representations ; Graphs ; Information Systems Applications (incl.Internet) ; Knowledge representation ; Neural networks ; Nodes ; Operating Systems ; Source code</subject><ispartof>World wide web (Bussum), 2025, Vol.28 (1), p.2, Article 2</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c200t-4d93ec1424661f2373ae5b3aeded5bdf6f921a79bcf472972bcddadedf75b0213</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11280-024-01320-0$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11280-024-01320-0$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Cao, Yanan</creatorcontrib><creatorcontrib>Lin, Xixun</creatorcontrib><creatorcontrib>Wu, Yongxuan</creatorcontrib><creatorcontrib>Shi, Fengzhao</creatorcontrib><creatorcontrib>Shang, Yanmin</creatorcontrib><creatorcontrib>Tan, Qingfeng</creatorcontrib><creatorcontrib>Zhou, Chuan</creatorcontrib><creatorcontrib>Zhang, Peng</creatorcontrib><title>A data-centric framework of improving graph neural networks for knowledge graph embedding</title><title>World wide web (Bussum)</title><addtitle>World Wide Web</addtitle><description>Knowledge Graph Embedding (KGE) aims to learn representations of entities and relations of knowledge graph (KG). Recently Graph Neural Networks (GNNs) have gained great success on KGE, but for the reason behind it, most views simply attribute to the well learning of knowledge graph structure, which still remains a limited understanding of the internal mechanism. In this work, we first study a fundamental problem, i.e., what are the important factors for GNNs to help KGE. To investigate this problem, we discuss the core idea of current GNN models for KG, and propose a new assumption of relational homophily that connected nodes possess similar features after relation’s transforming, to explain why aggregating neighbors with relation can help KGE. Based on the model and empirical analyses, we then introduce a novel data-centric framework for applying GNNs to KGE called KSG-GNN . In KSG-GNN, we construct a new graph structure from KG named Knowledge Similarity Graph (KSG), where each node connects with its similar nodes as neighbors, and then we apply GNNs on this graph to perform KGE. Instead of following the relational homophily assumption in KG, KSG aligns with homogeneous graphs that can directly satisfy homophily assumption. Hence, any GNN developed on homogeneous graphs like GCN, GAT, GraphSAGE, etc., can be applied out-of-the-box as KSG-GNN without modification, which provides a more general and effective GNN paradigm. Finally, we conduct extensive experiments on two benchmark datasets, i.e., FB15k-237 and WN18RR, demonstrating the superior performance of KSG-GNN over multiple strong baselines. The source code is available at https://github.com/advancer99/WWWJ-KGE .</description><subject>Computer Science</subject><subject>Database Management</subject><subject>Embedding</subject><subject>Empirical analysis</subject><subject>Graph neural networks</subject><subject>Graphical representations</subject><subject>Graphs</subject><subject>Information Systems Applications (incl.Internet)</subject><subject>Knowledge representation</subject><subject>Neural networks</subject><subject>Nodes</subject><subject>Operating Systems</subject><subject>Source code</subject><issn>1386-145X</issn><issn>1573-1413</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2025</creationdate><recordtype>article</recordtype><recordid>eNp9kE9LAzEQxYMoWKtfwFPA8-pMsrtpjqX4DwpeFPQUsptk3ba7qclW8dubugVvXmYe5PdmMo-QS4RrBBA3EZHNIAOWZ4CcJXVEJlgInmGO_DhpPiuTLl5PyVmMKwAoucQJeZtTowed1bYfQltTF3Rnv3xYU-9o222D_2z7hjZBb99pb3dBb1Ib9kSkzge67v3XxprGHhjbVdaY5DknJ05vor049Cl5ubt9Xjxky6f7x8V8mdUMYMhyI7mtMWd5WaJjXHBtiyoVY01RGVc6yVALWdUuF0wKVtXG6PToRFEBQz4lV-Pc9NePnY2DWvld6NNKxZFzkOl0mSg2UnXwMQbr1Da0nQ7fCkHtI1RjhCpFqH4jVJBMfDTFBPeNDX-j_3H9ANwpdYM</recordid><startdate>2025</startdate><enddate>2025</enddate><creator>Cao, Yanan</creator><creator>Lin, Xixun</creator><creator>Wu, Yongxuan</creator><creator>Shi, Fengzhao</creator><creator>Shang, Yanmin</creator><creator>Tan, Qingfeng</creator><creator>Zhou, Chuan</creator><creator>Zhang, Peng</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>2025</creationdate><title>A data-centric framework of improving graph neural networks for knowledge graph embedding</title><author>Cao, Yanan ; Lin, Xixun ; Wu, Yongxuan ; Shi, Fengzhao ; Shang, Yanmin ; Tan, Qingfeng ; Zhou, Chuan ; Zhang, Peng</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c200t-4d93ec1424661f2373ae5b3aeded5bdf6f921a79bcf472972bcddadedf75b0213</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2025</creationdate><topic>Computer Science</topic><topic>Database Management</topic><topic>Embedding</topic><topic>Empirical analysis</topic><topic>Graph neural networks</topic><topic>Graphical representations</topic><topic>Graphs</topic><topic>Information Systems Applications (incl.Internet)</topic><topic>Knowledge representation</topic><topic>Neural networks</topic><topic>Nodes</topic><topic>Operating Systems</topic><topic>Source code</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Cao, Yanan</creatorcontrib><creatorcontrib>Lin, Xixun</creatorcontrib><creatorcontrib>Wu, Yongxuan</creatorcontrib><creatorcontrib>Shi, Fengzhao</creatorcontrib><creatorcontrib>Shang, Yanmin</creatorcontrib><creatorcontrib>Tan, Qingfeng</creatorcontrib><creatorcontrib>Zhou, Chuan</creatorcontrib><creatorcontrib>Zhang, Peng</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>World wide web (Bussum)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Cao, Yanan</au><au>Lin, Xixun</au><au>Wu, Yongxuan</au><au>Shi, Fengzhao</au><au>Shang, Yanmin</au><au>Tan, Qingfeng</au><au>Zhou, Chuan</au><au>Zhang, Peng</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A data-centric framework of improving graph neural networks for knowledge graph embedding</atitle><jtitle>World wide web (Bussum)</jtitle><stitle>World Wide Web</stitle><date>2025</date><risdate>2025</risdate><volume>28</volume><issue>1</issue><spage>2</spage><pages>2-</pages><artnum>2</artnum><issn>1386-145X</issn><eissn>1573-1413</eissn><abstract>Knowledge Graph Embedding (KGE) aims to learn representations of entities and relations of knowledge graph (KG). Recently Graph Neural Networks (GNNs) have gained great success on KGE, but for the reason behind it, most views simply attribute to the well learning of knowledge graph structure, which still remains a limited understanding of the internal mechanism. In this work, we first study a fundamental problem, i.e., what are the important factors for GNNs to help KGE. To investigate this problem, we discuss the core idea of current GNN models for KG, and propose a new assumption of relational homophily that connected nodes possess similar features after relation’s transforming, to explain why aggregating neighbors with relation can help KGE. Based on the model and empirical analyses, we then introduce a novel data-centric framework for applying GNNs to KGE called KSG-GNN . In KSG-GNN, we construct a new graph structure from KG named Knowledge Similarity Graph (KSG), where each node connects with its similar nodes as neighbors, and then we apply GNNs on this graph to perform KGE. Instead of following the relational homophily assumption in KG, KSG aligns with homogeneous graphs that can directly satisfy homophily assumption. Hence, any GNN developed on homogeneous graphs like GCN, GAT, GraphSAGE, etc., can be applied out-of-the-box as KSG-GNN without modification, which provides a more general and effective GNN paradigm. Finally, we conduct extensive experiments on two benchmark datasets, i.e., FB15k-237 and WN18RR, demonstrating the superior performance of KSG-GNN over multiple strong baselines. The source code is available at https://github.com/advancer99/WWWJ-KGE .</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11280-024-01320-0</doi></addata></record>
fulltext fulltext
identifier ISSN: 1386-145X
ispartof World wide web (Bussum), 2025, Vol.28 (1), p.2, Article 2
issn 1386-145X
1573-1413
language eng
recordid cdi_proquest_journals_3133091389
source Springer Nature - Complete Springer Journals
subjects Computer Science
Database Management
Embedding
Empirical analysis
Graph neural networks
Graphical representations
Graphs
Information Systems Applications (incl.Internet)
Knowledge representation
Neural networks
Nodes
Operating Systems
Source code
title A data-centric framework of improving graph neural networks for knowledge graph embedding
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-31T11%3A56%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20data-centric%20framework%20of%20improving%20graph%20neural%20networks%20for%20knowledge%20graph%20embedding&rft.jtitle=World%20wide%20web%20(Bussum)&rft.au=Cao,%20Yanan&rft.date=2025&rft.volume=28&rft.issue=1&rft.spage=2&rft.pages=2-&rft.artnum=2&rft.issn=1386-145X&rft.eissn=1573-1413&rft_id=info:doi/10.1007/s11280-024-01320-0&rft_dat=%3Cproquest_cross%3E3133091389%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3133091389&rft_id=info:pmid/&rfr_iscdi=true