ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget

Entity Linking (EL) and Relation Extraction (RE) are fundamental tasks in Natural Language Processing, serving as critical components in a wide range of applications. In this paper, we propose ReLiK, a Retriever-Reader architecture for both EL and RE, where, given an input text, the Retriever module...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Orlando, Riccardo, Cabot, Pere-Lluis Huguet, Barba, Edoardo, Navigli, Roberto
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Orlando, Riccardo
Cabot, Pere-Lluis Huguet
Barba, Edoardo
Navigli, Roberto
description Entity Linking (EL) and Relation Extraction (RE) are fundamental tasks in Natural Language Processing, serving as critical components in a wide range of applications. In this paper, we propose ReLiK, a Retriever-Reader architecture for both EL and RE, where, given an input text, the Retriever module undertakes the identification of candidate entities or relations that could potentially appear within the text. Subsequently, the Reader module is tasked to discern the pertinent retrieved entities or relations and establish their alignment with the corresponding textual spans. Notably, we put forward an innovative input representation that incorporates the candidate entities or relations alongside the text, making it possible to link entities or extract relations in a single forward pass and to fully leverage pre-trained language models contextualization capabilities, in contrast with previous Retriever-Reader-based methods, which require a forward pass for each candidate. Our formulation of EL and RE achieves state-of-the-art performance in both in-domain and out-of-domain benchmarks while using academic budget training and with up to 40x inference speed compared to competitors. Finally, we show how our architecture can be used seamlessly for Information Extraction (cIE), i.e. EL + RE, and setting a new state of the art by employing a shared Reader that simultaneously extracts entities and relations.
doi_str_mv 10.48550/arxiv.2408.00103
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2408_00103</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2408_00103</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2408_001033</originalsourceid><addsrcrecordid>eNqFjsEKgkAQhvfSIaoH6NQ-QNmWCtKtQgn0JN1l0MmGdIt1FH37dOke_DD_8M3AJ8T6oBwv8H21B9NT5xw9FThKHZQ7F88UE4pPMkU2hB1K0IVMSMdbGUHDdj3neWuAUYaaiYcJv0iXlqVYAdNby7BnA7mtY0CPX1BgTbm8tEWJvBSzB1QNrn5zITZReL_edtYp-xiqwQzZ5JZZN_f_xRedFUOI</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget</title><source>arXiv.org</source><creator>Orlando, Riccardo ; Cabot, Pere-Lluis Huguet ; Barba, Edoardo ; Navigli, Roberto</creator><creatorcontrib>Orlando, Riccardo ; Cabot, Pere-Lluis Huguet ; Barba, Edoardo ; Navigli, Roberto</creatorcontrib><description>Entity Linking (EL) and Relation Extraction (RE) are fundamental tasks in Natural Language Processing, serving as critical components in a wide range of applications. In this paper, we propose ReLiK, a Retriever-Reader architecture for both EL and RE, where, given an input text, the Retriever module undertakes the identification of candidate entities or relations that could potentially appear within the text. Subsequently, the Reader module is tasked to discern the pertinent retrieved entities or relations and establish their alignment with the corresponding textual spans. Notably, we put forward an innovative input representation that incorporates the candidate entities or relations alongside the text, making it possible to link entities or extract relations in a single forward pass and to fully leverage pre-trained language models contextualization capabilities, in contrast with previous Retriever-Reader-based methods, which require a forward pass for each candidate. Our formulation of EL and RE achieves state-of-the-art performance in both in-domain and out-of-domain benchmarks while using academic budget training and with up to 40x inference speed compared to competitors. Finally, we show how our architecture can be used seamlessly for Information Extraction (cIE), i.e. EL + RE, and setting a new state of the art by employing a shared Reader that simultaneously extracts entities and relations.</description><identifier>DOI: 10.48550/arxiv.2408.00103</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Computation and Language</subject><creationdate>2024-07</creationdate><rights>http://creativecommons.org/licenses/by-sa/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2408.00103$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2408.00103$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Orlando, Riccardo</creatorcontrib><creatorcontrib>Cabot, Pere-Lluis Huguet</creatorcontrib><creatorcontrib>Barba, Edoardo</creatorcontrib><creatorcontrib>Navigli, Roberto</creatorcontrib><title>ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget</title><description>Entity Linking (EL) and Relation Extraction (RE) are fundamental tasks in Natural Language Processing, serving as critical components in a wide range of applications. In this paper, we propose ReLiK, a Retriever-Reader architecture for both EL and RE, where, given an input text, the Retriever module undertakes the identification of candidate entities or relations that could potentially appear within the text. Subsequently, the Reader module is tasked to discern the pertinent retrieved entities or relations and establish their alignment with the corresponding textual spans. Notably, we put forward an innovative input representation that incorporates the candidate entities or relations alongside the text, making it possible to link entities or extract relations in a single forward pass and to fully leverage pre-trained language models contextualization capabilities, in contrast with previous Retriever-Reader-based methods, which require a forward pass for each candidate. Our formulation of EL and RE achieves state-of-the-art performance in both in-domain and out-of-domain benchmarks while using academic budget training and with up to 40x inference speed compared to competitors. Finally, we show how our architecture can be used seamlessly for Information Extraction (cIE), i.e. EL + RE, and setting a new state of the art by employing a shared Reader that simultaneously extracts entities and relations.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Computation and Language</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNqFjsEKgkAQhvfSIaoH6NQ-QNmWCtKtQgn0JN1l0MmGdIt1FH37dOke_DD_8M3AJ8T6oBwv8H21B9NT5xw9FThKHZQ7F88UE4pPMkU2hB1K0IVMSMdbGUHDdj3neWuAUYaaiYcJv0iXlqVYAdNby7BnA7mtY0CPX1BgTbm8tEWJvBSzB1QNrn5zITZReL_edtYp-xiqwQzZ5JZZN_f_xRedFUOI</recordid><startdate>20240731</startdate><enddate>20240731</enddate><creator>Orlando, Riccardo</creator><creator>Cabot, Pere-Lluis Huguet</creator><creator>Barba, Edoardo</creator><creator>Navigli, Roberto</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240731</creationdate><title>ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget</title><author>Orlando, Riccardo ; Cabot, Pere-Lluis Huguet ; Barba, Edoardo ; Navigli, Roberto</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2408_001033</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Computation and Language</topic><toplevel>online_resources</toplevel><creatorcontrib>Orlando, Riccardo</creatorcontrib><creatorcontrib>Cabot, Pere-Lluis Huguet</creatorcontrib><creatorcontrib>Barba, Edoardo</creatorcontrib><creatorcontrib>Navigli, Roberto</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Orlando, Riccardo</au><au>Cabot, Pere-Lluis Huguet</au><au>Barba, Edoardo</au><au>Navigli, Roberto</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget</atitle><date>2024-07-31</date><risdate>2024</risdate><abstract>Entity Linking (EL) and Relation Extraction (RE) are fundamental tasks in Natural Language Processing, serving as critical components in a wide range of applications. In this paper, we propose ReLiK, a Retriever-Reader architecture for both EL and RE, where, given an input text, the Retriever module undertakes the identification of candidate entities or relations that could potentially appear within the text. Subsequently, the Reader module is tasked to discern the pertinent retrieved entities or relations and establish their alignment with the corresponding textual spans. Notably, we put forward an innovative input representation that incorporates the candidate entities or relations alongside the text, making it possible to link entities or extract relations in a single forward pass and to fully leverage pre-trained language models contextualization capabilities, in contrast with previous Retriever-Reader-based methods, which require a forward pass for each candidate. Our formulation of EL and RE achieves state-of-the-art performance in both in-domain and out-of-domain benchmarks while using academic budget training and with up to 40x inference speed compared to competitors. Finally, we show how our architecture can be used seamlessly for Information Extraction (cIE), i.e. EL + RE, and setting a new state of the art by employing a shared Reader that simultaneously extracts entities and relations.</abstract><doi>10.48550/arxiv.2408.00103</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2408.00103
ispartof
issn
language eng
recordid cdi_arxiv_primary_2408_00103
source arXiv.org
subjects Computer Science - Artificial Intelligence
Computer Science - Computation and Language
title ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-11T07%3A53%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=ReLiK:%20Retrieve%20and%20LinK,%20Fast%20and%20Accurate%20Entity%20Linking%20and%20Relation%20Extraction%20on%20an%20Academic%20Budget&rft.au=Orlando,%20Riccardo&rft.date=2024-07-31&rft_id=info:doi/10.48550/arxiv.2408.00103&rft_dat=%3Carxiv_GOX%3E2408_00103%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true