Towards holistic Entity Linking: Survey and directions

Entity Linking (EL) empowers Natural Language Processing applications by linking relevant mentions found in raw textual data to precise information about what they supposedly stand for. However, EL approaches have mostly focused on particular kinds of inputs and frequently fail to properly handle te...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Information systems (Oxford) 2021-01, Vol.95, p.101624, Article 101624
Hauptverfasser: Oliveira, Italo L., Fileto, Renato, Speck, René, Garcia, Luís P.F., Moussallem, Diego, Lehmann, Jens
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Entity Linking (EL) empowers Natural Language Processing applications by linking relevant mentions found in raw textual data to precise information about what they supposedly stand for. However, EL approaches have mostly focused on particular kinds of inputs and frequently fail to properly handle texts from specific sources (e.g., microblogs) that have particularities such as grammatical errors, slangs, lack of contextual information and other problems, besides difficulties to exploit their associated data (e.g., time stamps, geographic indicators, authors’ profile data). Some EL approaches have been devised to circumvent such challenges. They exploit several inputs, data features, and EL methods in a synergetic process for more powerful and robust collective EL. This paper reviews recent works that employ such holistic strategies for EL, discusses their limitations, and proposes directions for further advancing holistic EL approaches. •Holistic approaches have the potential to boost Entity Linking results.•This survey reviews approaches that present some degree of holism.•The main types of approaches are collective or embedding based.•Holistic approaches are promising for short text documents.
ISSN:0306-4379
1873-6076
DOI:10.1016/j.is.2020.101624