Document Entity Linking on Online Social Networks

In one embodiment, a method includes accessing a document, identifying one or more noun phrases in the document by performing a pre-processing on the accessed document, generating, for each identified noun phrase, a list of candidate entities corresponding to the noun phrase, wherein the list of can...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Yan, Xiaohua, Dousti, Mohammad Javad, Du, Jingfei, Shankar, Jeevan, Xue, Bi, Stoyanov, Veselin S, Shenoy, Rajesh Krishna
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In one embodiment, a method includes accessing a document, identifying one or more noun phrases in the document by performing a pre-processing on the accessed document, generating, for each identified noun phrase, a list of candidate entities corresponding to the noun phrase, wherein the list of candidate entities is looked up in an entity index using the noun phrase, computing, for each candidate entity corresponding to each identified noun phrase, a confidence score that the noun phrase is intended to reference the candidate entity by analyzing the accessed document by a machine learning model, constructing a pool of mention-entity pairs for the accessed document, filtering the pool of mention-entity pairs by removing each mention-entity pair from the pool based on their computed confidence scores, and storing the post-filtered pool of mention-entity pairs in a data store in association with the accessed document.