A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs

Entity alignment seeks to find entities in different knowledge graphs (KGs) that refer to the same real-world object. Recent advancement in KG embedding impels the advent of embedding-based entity alignment, which encodes entities in a continuous embedding space and measures entity similarities base...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2020-07
Hauptverfasser: Sun, Zequn, Zhang, Qingheng, Hu, Wei, Wang, Chengming, Chen, Muhao, Akrami, Farahnaz, Li, Chengkai
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Sun, Zequn
Zhang, Qingheng
Hu, Wei
Wang, Chengming
Chen, Muhao
Akrami, Farahnaz
Li, Chengkai
description Entity alignment seeks to find entities in different knowledge graphs (KGs) that refer to the same real-world object. Recent advancement in KG embedding impels the advent of embedding-based entity alignment, which encodes entities in a continuous embedding space and measures entity similarities based on the learned embeddings. In this paper, we conduct a comprehensive experimental study of this emerging field. We survey 23 recent embedding-based entity alignment approaches and categorize them based on their techniques and characteristics. We also propose a new KG sampling algorithm, with which we generate a set of dedicated benchmark datasets with various heterogeneity and distributions for a realistic evaluation. We develop an open-source library including 12 representative embedding-based entity alignment approaches, and extensively evaluate these approaches, to understand their strengths and limitations. Additionally, for several directions that have not been explored in current approaches, we perform exploratory experiments and report our preliminary findings for future studies. The benchmark datasets, open-source library and experimental results are all accessible online and will be duly maintained.
doi_str_mv 10.48550/arxiv.2003.07743
format Article
fullrecord <record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_2003_07743</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2378452350</sourcerecordid><originalsourceid>FETCH-LOGICAL-a520-9f19e5195e91345624ae24c1db3d0da2675d1a19e264b7bb201187dfaf4b88ce3</originalsourceid><addsrcrecordid>eNotj19LwzAUR4MgOOY-gE8GfG5NbpL-eayjTnHig3svSZN2nW06k1btt7duPl24HH6cg9ANJSFPhCD30v00XyEQwkISx5xdoAUwRoOEA1yhlfcHQghEMQjBFug1ww_GlvtOuo_G1vh9GPWE-wrnnTJaz69ASW80zu3QDBPO2qa2nbEDrnqHX2z_3RpdG7xx8rj31-iykq03q_-7RLvHfLd-CrZvm-d1tg2kABKkFU2NoKkwKWVcRMClAV5SrZgmWs5uQlM5MxBxFSsFhNIk1pWsuEqS0rAluj3PnlqLo2tm_an4ay5OzTNxdyaOrv8cjR-KQz86OzsVwOKEC2CCsF95iljI</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2378452350</pqid></control><display><type>article</type><title>A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Sun, Zequn ; Zhang, Qingheng ; Hu, Wei ; Wang, Chengming ; Chen, Muhao ; Akrami, Farahnaz ; Li, Chengkai</creator><creatorcontrib>Sun, Zequn ; Zhang, Qingheng ; Hu, Wei ; Wang, Chengming ; Chen, Muhao ; Akrami, Farahnaz ; Li, Chengkai</creatorcontrib><description>Entity alignment seeks to find entities in different knowledge graphs (KGs) that refer to the same real-world object. Recent advancement in KG embedding impels the advent of embedding-based entity alignment, which encodes entities in a continuous embedding space and measures entity similarities based on the learned embeddings. In this paper, we conduct a comprehensive experimental study of this emerging field. We survey 23 recent embedding-based entity alignment approaches and categorize them based on their techniques and characteristics. We also propose a new KG sampling algorithm, with which we generate a set of dedicated benchmark datasets with various heterogeneity and distributions for a realistic evaluation. We develop an open-source library including 12 representative embedding-based entity alignment approaches, and extensively evaluate these approaches, to understand their strengths and limitations. Additionally, for several directions that have not been explored in current approaches, we perform exploratory experiments and report our preliminary findings for future studies. The benchmark datasets, open-source library and experimental results are all accessible online and will be duly maintained.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2003.07743</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Algorithms ; Alignment ; Benchmarks ; Computer Science - Artificial Intelligence ; Computer Science - Computation and Language ; Computer Science - Databases ; Computer Science - Learning ; Datasets ; Embedding ; Evaluation ; Graphs ; Statistics - Machine Learning</subject><ispartof>arXiv.org, 2020-07</ispartof><rights>2020. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,780,881,27902</link.rule.ids><backlink>$$Uhttps://doi.org/10.14778/3407790.3407828$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.48550/arXiv.2003.07743$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Sun, Zequn</creatorcontrib><creatorcontrib>Zhang, Qingheng</creatorcontrib><creatorcontrib>Hu, Wei</creatorcontrib><creatorcontrib>Wang, Chengming</creatorcontrib><creatorcontrib>Chen, Muhao</creatorcontrib><creatorcontrib>Akrami, Farahnaz</creatorcontrib><creatorcontrib>Li, Chengkai</creatorcontrib><title>A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs</title><title>arXiv.org</title><description>Entity alignment seeks to find entities in different knowledge graphs (KGs) that refer to the same real-world object. Recent advancement in KG embedding impels the advent of embedding-based entity alignment, which encodes entities in a continuous embedding space and measures entity similarities based on the learned embeddings. In this paper, we conduct a comprehensive experimental study of this emerging field. We survey 23 recent embedding-based entity alignment approaches and categorize them based on their techniques and characteristics. We also propose a new KG sampling algorithm, with which we generate a set of dedicated benchmark datasets with various heterogeneity and distributions for a realistic evaluation. We develop an open-source library including 12 representative embedding-based entity alignment approaches, and extensively evaluate these approaches, to understand their strengths and limitations. Additionally, for several directions that have not been explored in current approaches, we perform exploratory experiments and report our preliminary findings for future studies. The benchmark datasets, open-source library and experimental results are all accessible online and will be duly maintained.</description><subject>Algorithms</subject><subject>Alignment</subject><subject>Benchmarks</subject><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Computation and Language</subject><subject>Computer Science - Databases</subject><subject>Computer Science - Learning</subject><subject>Datasets</subject><subject>Embedding</subject><subject>Evaluation</subject><subject>Graphs</subject><subject>Statistics - Machine Learning</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><sourceid>GOX</sourceid><recordid>eNotj19LwzAUR4MgOOY-gE8GfG5NbpL-eayjTnHig3svSZN2nW06k1btt7duPl24HH6cg9ANJSFPhCD30v00XyEQwkISx5xdoAUwRoOEA1yhlfcHQghEMQjBFug1ww_GlvtOuo_G1vh9GPWE-wrnnTJaz69ASW80zu3QDBPO2qa2nbEDrnqHX2z_3RpdG7xx8rj31-iykq03q_-7RLvHfLd-CrZvm-d1tg2kABKkFU2NoKkwKWVcRMClAV5SrZgmWs5uQlM5MxBxFSsFhNIk1pWsuEqS0rAluj3PnlqLo2tm_an4ay5OzTNxdyaOrv8cjR-KQz86OzsVwOKEC2CCsF95iljI</recordid><startdate>20200720</startdate><enddate>20200720</enddate><creator>Sun, Zequn</creator><creator>Zhang, Qingheng</creator><creator>Hu, Wei</creator><creator>Wang, Chengming</creator><creator>Chen, Muhao</creator><creator>Akrami, Farahnaz</creator><creator>Li, Chengkai</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>AKY</scope><scope>EPD</scope><scope>GOX</scope></search><sort><creationdate>20200720</creationdate><title>A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs</title><author>Sun, Zequn ; Zhang, Qingheng ; Hu, Wei ; Wang, Chengming ; Chen, Muhao ; Akrami, Farahnaz ; Li, Chengkai</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a520-9f19e5195e91345624ae24c1db3d0da2675d1a19e264b7bb201187dfaf4b88ce3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Alignment</topic><topic>Benchmarks</topic><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Computation and Language</topic><topic>Computer Science - Databases</topic><topic>Computer Science - Learning</topic><topic>Datasets</topic><topic>Embedding</topic><topic>Evaluation</topic><topic>Graphs</topic><topic>Statistics - Machine Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Sun, Zequn</creatorcontrib><creatorcontrib>Zhang, Qingheng</creatorcontrib><creatorcontrib>Hu, Wei</creatorcontrib><creatorcontrib>Wang, Chengming</creatorcontrib><creatorcontrib>Chen, Muhao</creatorcontrib><creatorcontrib>Akrami, Farahnaz</creatorcontrib><creatorcontrib>Li, Chengkai</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection><collection>arXiv Computer Science</collection><collection>arXiv Statistics</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sun, Zequn</au><au>Zhang, Qingheng</au><au>Hu, Wei</au><au>Wang, Chengming</au><au>Chen, Muhao</au><au>Akrami, Farahnaz</au><au>Li, Chengkai</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs</atitle><jtitle>arXiv.org</jtitle><date>2020-07-20</date><risdate>2020</risdate><eissn>2331-8422</eissn><abstract>Entity alignment seeks to find entities in different knowledge graphs (KGs) that refer to the same real-world object. Recent advancement in KG embedding impels the advent of embedding-based entity alignment, which encodes entities in a continuous embedding space and measures entity similarities based on the learned embeddings. In this paper, we conduct a comprehensive experimental study of this emerging field. We survey 23 recent embedding-based entity alignment approaches and categorize them based on their techniques and characteristics. We also propose a new KG sampling algorithm, with which we generate a set of dedicated benchmark datasets with various heterogeneity and distributions for a realistic evaluation. We develop an open-source library including 12 representative embedding-based entity alignment approaches, and extensively evaluate these approaches, to understand their strengths and limitations. Additionally, for several directions that have not been explored in current approaches, we perform exploratory experiments and report our preliminary findings for future studies. The benchmark datasets, open-source library and experimental results are all accessible online and will be duly maintained.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2003.07743</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2020-07
issn 2331-8422
language eng
recordid cdi_arxiv_primary_2003_07743
source arXiv.org; Free E- Journals
subjects Algorithms
Alignment
Benchmarks
Computer Science - Artificial Intelligence
Computer Science - Computation and Language
Computer Science - Databases
Computer Science - Learning
Datasets
Embedding
Evaluation
Graphs
Statistics - Machine Learning
title A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-12T00%3A04%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Benchmarking%20Study%20of%20Embedding-based%20Entity%20Alignment%20for%20Knowledge%20Graphs&rft.jtitle=arXiv.org&rft.au=Sun,%20Zequn&rft.date=2020-07-20&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2003.07743&rft_dat=%3Cproquest_arxiv%3E2378452350%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2378452350&rft_id=info:pmid/&rfr_iscdi=true