A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs
Entity alignment seeks to find entities in different knowledge graphs (KGs) that refer to the same real-world object. Recent advancement in KG embedding impels the advent of embedding-based entity alignment, which encodes entities in a continuous embedding space and measures entity similarities base...
Gespeichert in:
Veröffentlicht in: | arXiv.org 2020-07 |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Sun, Zequn Zhang, Qingheng Hu, Wei Wang, Chengming Chen, Muhao Akrami, Farahnaz Li, Chengkai |
description | Entity alignment seeks to find entities in different knowledge graphs (KGs) that refer to the same real-world object. Recent advancement in KG embedding impels the advent of embedding-based entity alignment, which encodes entities in a continuous embedding space and measures entity similarities based on the learned embeddings. In this paper, we conduct a comprehensive experimental study of this emerging field. We survey 23 recent embedding-based entity alignment approaches and categorize them based on their techniques and characteristics. We also propose a new KG sampling algorithm, with which we generate a set of dedicated benchmark datasets with various heterogeneity and distributions for a realistic evaluation. We develop an open-source library including 12 representative embedding-based entity alignment approaches, and extensively evaluate these approaches, to understand their strengths and limitations. Additionally, for several directions that have not been explored in current approaches, we perform exploratory experiments and report our preliminary findings for future studies. The benchmark datasets, open-source library and experimental results are all accessible online and will be duly maintained. |
doi_str_mv | 10.48550/arxiv.2003.07743 |
format | Article |
fullrecord | <record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_2003_07743</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2378452350</sourcerecordid><originalsourceid>FETCH-LOGICAL-a520-9f19e5195e91345624ae24c1db3d0da2675d1a19e264b7bb201187dfaf4b88ce3</originalsourceid><addsrcrecordid>eNotj19LwzAUR4MgOOY-gE8GfG5NbpL-eayjTnHig3svSZN2nW06k1btt7duPl24HH6cg9ANJSFPhCD30v00XyEQwkISx5xdoAUwRoOEA1yhlfcHQghEMQjBFug1ww_GlvtOuo_G1vh9GPWE-wrnnTJaz69ASW80zu3QDBPO2qa2nbEDrnqHX2z_3RpdG7xx8rj31-iykq03q_-7RLvHfLd-CrZvm-d1tg2kABKkFU2NoKkwKWVcRMClAV5SrZgmWs5uQlM5MxBxFSsFhNIk1pWsuEqS0rAluj3PnlqLo2tm_an4ay5OzTNxdyaOrv8cjR-KQz86OzsVwOKEC2CCsF95iljI</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2378452350</pqid></control><display><type>article</type><title>A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Sun, Zequn ; Zhang, Qingheng ; Hu, Wei ; Wang, Chengming ; Chen, Muhao ; Akrami, Farahnaz ; Li, Chengkai</creator><creatorcontrib>Sun, Zequn ; Zhang, Qingheng ; Hu, Wei ; Wang, Chengming ; Chen, Muhao ; Akrami, Farahnaz ; Li, Chengkai</creatorcontrib><description>Entity alignment seeks to find entities in different knowledge graphs (KGs) that refer to the same real-world object. Recent advancement in KG embedding impels the advent of embedding-based entity alignment, which encodes entities in a continuous embedding space and measures entity similarities based on the learned embeddings. In this paper, we conduct a comprehensive experimental study of this emerging field. We survey 23 recent embedding-based entity alignment approaches and categorize them based on their techniques and characteristics. We also propose a new KG sampling algorithm, with which we generate a set of dedicated benchmark datasets with various heterogeneity and distributions for a realistic evaluation. We develop an open-source library including 12 representative embedding-based entity alignment approaches, and extensively evaluate these approaches, to understand their strengths and limitations. Additionally, for several directions that have not been explored in current approaches, we perform exploratory experiments and report our preliminary findings for future studies. The benchmark datasets, open-source library and experimental results are all accessible online and will be duly maintained.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2003.07743</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Algorithms ; Alignment ; Benchmarks ; Computer Science - Artificial Intelligence ; Computer Science - Computation and Language ; Computer Science - Databases ; Computer Science - Learning ; Datasets ; Embedding ; Evaluation ; Graphs ; Statistics - Machine Learning</subject><ispartof>arXiv.org, 2020-07</ispartof><rights>2020. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,780,881,27902</link.rule.ids><backlink>$$Uhttps://doi.org/10.14778/3407790.3407828$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.48550/arXiv.2003.07743$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Sun, Zequn</creatorcontrib><creatorcontrib>Zhang, Qingheng</creatorcontrib><creatorcontrib>Hu, Wei</creatorcontrib><creatorcontrib>Wang, Chengming</creatorcontrib><creatorcontrib>Chen, Muhao</creatorcontrib><creatorcontrib>Akrami, Farahnaz</creatorcontrib><creatorcontrib>Li, Chengkai</creatorcontrib><title>A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs</title><title>arXiv.org</title><description>Entity alignment seeks to find entities in different knowledge graphs (KGs) that refer to the same real-world object. Recent advancement in KG embedding impels the advent of embedding-based entity alignment, which encodes entities in a continuous embedding space and measures entity similarities based on the learned embeddings. In this paper, we conduct a comprehensive experimental study of this emerging field. We survey 23 recent embedding-based entity alignment approaches and categorize them based on their techniques and characteristics. We also propose a new KG sampling algorithm, with which we generate a set of dedicated benchmark datasets with various heterogeneity and distributions for a realistic evaluation. We develop an open-source library including 12 representative embedding-based entity alignment approaches, and extensively evaluate these approaches, to understand their strengths and limitations. Additionally, for several directions that have not been explored in current approaches, we perform exploratory experiments and report our preliminary findings for future studies. The benchmark datasets, open-source library and experimental results are all accessible online and will be duly maintained.</description><subject>Algorithms</subject><subject>Alignment</subject><subject>Benchmarks</subject><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Computation and Language</subject><subject>Computer Science - Databases</subject><subject>Computer Science - Learning</subject><subject>Datasets</subject><subject>Embedding</subject><subject>Evaluation</subject><subject>Graphs</subject><subject>Statistics - Machine Learning</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><sourceid>GOX</sourceid><recordid>eNotj19LwzAUR4MgOOY-gE8GfG5NbpL-eayjTnHig3svSZN2nW06k1btt7duPl24HH6cg9ANJSFPhCD30v00XyEQwkISx5xdoAUwRoOEA1yhlfcHQghEMQjBFug1ww_GlvtOuo_G1vh9GPWE-wrnnTJaz69ASW80zu3QDBPO2qa2nbEDrnqHX2z_3RpdG7xx8rj31-iykq03q_-7RLvHfLd-CrZvm-d1tg2kABKkFU2NoKkwKWVcRMClAV5SrZgmWs5uQlM5MxBxFSsFhNIk1pWsuEqS0rAluj3PnlqLo2tm_an4ay5OzTNxdyaOrv8cjR-KQz86OzsVwOKEC2CCsF95iljI</recordid><startdate>20200720</startdate><enddate>20200720</enddate><creator>Sun, Zequn</creator><creator>Zhang, Qingheng</creator><creator>Hu, Wei</creator><creator>Wang, Chengming</creator><creator>Chen, Muhao</creator><creator>Akrami, Farahnaz</creator><creator>Li, Chengkai</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>AKY</scope><scope>EPD</scope><scope>GOX</scope></search><sort><creationdate>20200720</creationdate><title>A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs</title><author>Sun, Zequn ; Zhang, Qingheng ; Hu, Wei ; Wang, Chengming ; Chen, Muhao ; Akrami, Farahnaz ; Li, Chengkai</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a520-9f19e5195e91345624ae24c1db3d0da2675d1a19e264b7bb201187dfaf4b88ce3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Alignment</topic><topic>Benchmarks</topic><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Computation and Language</topic><topic>Computer Science - Databases</topic><topic>Computer Science - Learning</topic><topic>Datasets</topic><topic>Embedding</topic><topic>Evaluation</topic><topic>Graphs</topic><topic>Statistics - Machine Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Sun, Zequn</creatorcontrib><creatorcontrib>Zhang, Qingheng</creatorcontrib><creatorcontrib>Hu, Wei</creatorcontrib><creatorcontrib>Wang, Chengming</creatorcontrib><creatorcontrib>Chen, Muhao</creatorcontrib><creatorcontrib>Akrami, Farahnaz</creatorcontrib><creatorcontrib>Li, Chengkai</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection><collection>arXiv Computer Science</collection><collection>arXiv Statistics</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sun, Zequn</au><au>Zhang, Qingheng</au><au>Hu, Wei</au><au>Wang, Chengming</au><au>Chen, Muhao</au><au>Akrami, Farahnaz</au><au>Li, Chengkai</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs</atitle><jtitle>arXiv.org</jtitle><date>2020-07-20</date><risdate>2020</risdate><eissn>2331-8422</eissn><abstract>Entity alignment seeks to find entities in different knowledge graphs (KGs) that refer to the same real-world object. Recent advancement in KG embedding impels the advent of embedding-based entity alignment, which encodes entities in a continuous embedding space and measures entity similarities based on the learned embeddings. In this paper, we conduct a comprehensive experimental study of this emerging field. We survey 23 recent embedding-based entity alignment approaches and categorize them based on their techniques and characteristics. We also propose a new KG sampling algorithm, with which we generate a set of dedicated benchmark datasets with various heterogeneity and distributions for a realistic evaluation. We develop an open-source library including 12 representative embedding-based entity alignment approaches, and extensively evaluate these approaches, to understand their strengths and limitations. Additionally, for several directions that have not been explored in current approaches, we perform exploratory experiments and report our preliminary findings for future studies. The benchmark datasets, open-source library and experimental results are all accessible online and will be duly maintained.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2003.07743</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2020-07 |
issn | 2331-8422 |
language | eng |
recordid | cdi_arxiv_primary_2003_07743 |
source | arXiv.org; Free E- Journals |
subjects | Algorithms Alignment Benchmarks Computer Science - Artificial Intelligence Computer Science - Computation and Language Computer Science - Databases Computer Science - Learning Datasets Embedding Evaluation Graphs Statistics - Machine Learning |
title | A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-12T00%3A04%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Benchmarking%20Study%20of%20Embedding-based%20Entity%20Alignment%20for%20Knowledge%20Graphs&rft.jtitle=arXiv.org&rft.au=Sun,%20Zequn&rft.date=2020-07-20&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2003.07743&rft_dat=%3Cproquest_arxiv%3E2378452350%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2378452350&rft_id=info:pmid/&rfr_iscdi=true |