A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs

Entity alignment seeks to find entities in different knowledge graphs (KGs) that refer to the same real-world object. Recent advancement in KG embedding impels the advent of embedding-based entity alignment, which encodes entities in a continuous embedding space and measures entity similarities base...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2020-07
Hauptverfasser:	Sun, Zequn, Zhang, Qingheng, Hu, Wei, Wang, Chengming, Chen, Muhao, Akrami, Farahnaz, Li, Chengkai
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Alignment Benchmarks Computer Science - Artificial Intelligence Computer Science - Computation and Language Computer Science - Databases Computer Science - Learning Datasets Embedding Evaluation Graphs Statistics - Machine Learning
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Sun, Zequn Zhang, Qingheng Hu, Wei Wang, Chengming Chen, Muhao Akrami, Farahnaz Li, Chengkai
description	Entity alignment seeks to find entities in different knowledge graphs (KGs) that refer to the same real-world object. Recent advancement in KG embedding impels the advent of embedding-based entity alignment, which encodes entities in a continuous embedding space and measures entity similarities based on the learned embeddings. In this paper, we conduct a comprehensive experimental study of this emerging field. We survey 23 recent embedding-based entity alignment approaches and categorize them based on their techniques and characteristics. We also propose a new KG sampling algorithm, with which we generate a set of dedicated benchmark datasets with various heterogeneity and distributions for a realistic evaluation. We develop an open-source library including 12 representative embedding-based entity alignment approaches, and extensively evaluate these approaches, to understand their strengths and limitations. Additionally, for several directions that have not been explored in current approaches, we perform exploratory experiments and report our preliminary findings for future studies. The benchmark datasets, open-source library and experimental results are all accessible online and will be duly maintained.
doi_str_mv	10.48550/arxiv.2003.07743
format	Article
fullrecord	<record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_2003_07743</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2378452350</sourcerecordid><originalsourceid>FETCH-LOGICAL-a520-9f19e5195e91345624ae24c1db3d0da2675d1a19e264b7bb201187dfaf4b88ce3</originalsourceid><addsrcrecordid>eNotj19LwzAUR4MgOOY-gE8GfG5NbpL-eayjTnHig3svSZN2nW06k1btt7duPl24HH6cg9ANJSFPhCD30v00XyEQwkISx5xdoAUwRoOEA1yhlfcHQghEMQjBFug1ww_GlvtOuo_G1vh9GPWE-wrnnTJaz69ASW80zu3QDBPO2qa2nbEDrnqHX2z_3RpdG7xx8rj31-iykq03q_-7RLvHfLd-CrZvm-d1tg2kABKkFU2NoKkwKWVcRMClAV5SrZgmWs5uQlM5MxBxFSsFhNIk1pWsuEqS0rAluj3PnlqLo2tm_an4ay5OzTNxdyaOrv8cjR-KQz86OzsVwOKEC2CCsF95iljI</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2378452350</pqid></control><display><type>article</type><title>A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Sun, Zequn ; Zhang, Qingheng ; Hu, Wei ; Wang, Chengming ; Chen, Muhao ; Akrami, Farahnaz ; Li, Chengkai</creator><creatorcontrib>Sun, Zequn ; Zhang, Qingheng ; Hu, Wei ; Wang, Chengming ; Chen, Muhao ; Akrami, Farahnaz ; Li, Chengkai</creatorcontrib><description>Entity alignment seeks to find entities in different knowledge graphs (KGs) that refer to the same real-world object. Recent advancement in KG embedding impels the advent of embedding-based entity alignment, which encodes entities in a continuous embedding space and measures entity similarities based on the learned embeddings. In this paper, we conduct a comprehensive experimental study of this emerging field. We survey 23 recent embedding-based entity alignment approaches and categorize them based on their techniques and characteristics. We also propose a new KG sampling algorithm, with which we generate a set of dedicated benchmark datasets with various heterogeneity and distributions for a realistic evaluation. We develop an open-source library including 12 representative embedding-based entity alignment approaches, and extensively evaluate these approaches, to understand their strengths and limitations. Additionally, for several directions that have not been explored in current approaches, we perform exploratory experiments and report our preliminary findings for future studies. The benchmark datasets, open-source library and experimental results are all accessible online and will be duly maintained.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2003.07743</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Algorithms ; Alignment ; Benchmarks ; Computer Science - Artificial Intelligence ; Computer Science - Computation and Language ; Computer Science - Databases ; Computer Science - Learning ; Datasets ; Embedding ; Evaluation ; Graphs ; Statistics - Machine Learning</subject><ispartof>arXiv.org, 2020-07</ispartof><rights>2020. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,780,881,27902</link.rule.ids><backlink>$$Uhttps://doi.org/10.14778/3407790.3407828$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.48550/arXiv.2003.07743$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Sun, Zequn</creatorcontrib><creatorcontrib>Zhang, Qingheng</creatorcontrib><creatorcontrib>Hu, Wei</creatorcontrib><creatorcontrib>Wang, Chengming</creatorcontrib><creatorcontrib>Chen, Muhao</creatorcontrib><creatorcontrib>Akrami, Farahnaz</creatorcontrib><creatorcontrib>Li, Chengkai</creatorcontrib><title>A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs</title><title>arXiv.org</title><description>Entity alignment seeks to find entities in different knowledge graphs (KGs) that refer to the same real-world object. Recent advancement in KG embedding impels the advent of embedding-based entity alignment, which encodes entities in a continuous embedding space and measures entity similarities based on the learned embeddings. In this paper, we conduct a comprehensive experimental study of this emerging field. We survey 23 recent embedding-based entity alignment approaches and categorize them based on their techniques and characteristics. We also propose a new KG sampling algorithm, with which we generate a set of dedicated benchmark datasets with various heterogeneity and distributions for a realistic evaluation. We develop an open-source library including 12 representative embedding-based entity alignment approaches, and extensively evaluate these approaches, to understand their strengths and limitations. Additionally, for several directions that have not been explored in current approaches, we perform exploratory experiments and report our preliminary findings for future studies. The benchmark datasets, open-source library and experimental results are all accessible online and will be duly maintained.</description><subject>Algorithms</subject><subject>Alignment</subject><subject>Benchmarks</subject><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Computation and Language</subject><subject>Computer Science - Databases</subject><subject>Computer Science - Learning</subject><subject>Datasets</subject><subject>Embedding</subject><subject>Evaluation</subject><subject>Graphs</subject><subject>Statistics - Machine Learning</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><sourceid>GOX</sourceid><recordid>eNotj19LwzAUR4MgOOY-gE8GfG5NbpL-eayjTnHig3svSZN2nW06k1btt7duPl24HH6cg9ANJSFPhCD30v00XyEQwkISx5xdoAUwRoOEA1yhlfcHQghEMQjBFug1ww_GlvtOuo_G1vh9GPWE-wrnnTJaz69ASW80zu3QDBPO2qa2nbEDrnqHX2z_3RpdG7xx8rj31-iykq03q_-7RLvHfLd-CrZvm-d1tg2kABKkFU2NoKkwKWVcRMClAV5SrZgmWs5uQlM5MxBxFSsFhNIk1pWsuEqS0rAluj3PnlqLo2tm_an4ay5OzTNxdyaOrv8cjR-KQz86OzsVwOKEC2CCsF95iljI</recordid><startdate>20200720</startdate><enddate>20200720</enddate><creator>Sun, Zequn</creator><creator>Zhang, Qingheng</creator><creator>Hu, Wei</creator><creator>Wang, Chengming</creator><creator>Chen, Muhao</creator><creator>Akrami, Farahnaz</creator><creator>Li, Chengkai</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>AKY</scope><scope>EPD</scope><scope>GOX</scope></search><sort><creationdate>20200720</creationdate><title>A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs</title><author>Sun, Zequn ; Zhang, Qingheng ; Hu, Wei ; Wang, Chengming ; Chen, Muhao ; Akrami, Farahnaz ; Li, Chengkai</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a520-9f19e5195e91345624ae24c1db3d0da2675d1a19e264b7bb201187dfaf4b88ce3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Alignment</topic><topic>Benchmarks</topic><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Computation and Language</topic><topic>Computer Science - Databases</topic><topic>Computer Science - Learning</topic><topic>Datasets</topic><topic>Embedding</topic><topic>Evaluation</topic><topic>Graphs</topic><topic>Statistics - Machine Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Sun, Zequn</creatorcontrib><creatorcontrib>Zhang, Qingheng</creatorcontrib><creatorcontrib>Hu, Wei</creatorcontrib><creatorcontrib>Wang, Chengming</creatorcontrib><creatorcontrib>Chen, Muhao</creatorcontrib><creatorcontrib>Akrami, Farahnaz</creatorcontrib><creatorcontrib>Li, Chengkai</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection><collection>arXiv Computer Science</collection><collection>arXiv Statistics</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sun, Zequn</au><au>Zhang, Qingheng</au><au>Hu, Wei</au><au>Wang, Chengming</au><au>Chen, Muhao</au><au>Akrami, Farahnaz</au><au>Li, Chengkai</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs</atitle><jtitle>arXiv.org</jtitle><date>2020-07-20</date><risdate>2020</risdate><eissn>2331-8422</eissn><abstract>Entity alignment seeks to find entities in different knowledge graphs (KGs) that refer to the same real-world object. Recent advancement in KG embedding impels the advent of embedding-based entity alignment, which encodes entities in a continuous embedding space and measures entity similarities based on the learned embeddings. In this paper, we conduct a comprehensive experimental study of this emerging field. We survey 23 recent embedding-based entity alignment approaches and categorize them based on their techniques and characteristics. We also propose a new KG sampling algorithm, with which we generate a set of dedicated benchmark datasets with various heterogeneity and distributions for a realistic evaluation. We develop an open-source library including 12 representative embedding-based entity alignment approaches, and extensively evaluate these approaches, to understand their strengths and limitations. Additionally, for several directions that have not been explored in current approaches, we perform exploratory experiments and report our preliminary findings for future studies. The benchmark datasets, open-source library and experimental results are all accessible online and will be duly maintained.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2003.07743</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2020-07
issn	2331-8422
language	eng
recordid	cdi_arxiv_primary_2003_07743
source	arXiv.org; Free E- Journals
subjects	Algorithms Alignment Benchmarks Computer Science - Artificial Intelligence Computer Science - Computation and Language Computer Science - Databases Computer Science - Learning Datasets Embedding Evaluation Graphs Statistics - Machine Learning
title	A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-12T00%3A04%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Benchmarking%20Study%20of%20Embedding-based%20Entity%20Alignment%20for%20Knowledge%20Graphs&rft.jtitle=arXiv.org&rft.au=Sun,%20Zequn&rft.date=2020-07-20&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2003.07743&rft_dat=%3Cproquest_arxiv%3E2378452350%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2378452350&rft_id=info:pmid/&rfr_iscdi=true