METHOD FOR SYSTEMATIC MASS NORMALIZATION OF TITLES

A system for determining a canonical title includes an interface and a processor. The interface is to receive a new title. The processor is to generate a set of n-grams based on the new title; look up n-grams in canonical database to assign one or more labels; and map the new title to a canonical ti...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Lita Lucian, Tewary Anuranjita, Giverts Vladimir, Rabbani Tarek, Gateley Kristy, Goldman Jonathan, Feinstein Efraim, Ajao Adeyemi, Segal Barzel, Monroe Phil, Prach Jan
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A system for determining a canonical title includes an interface and a processor. The interface is to receive a new title. The processor is to generate a set of n-grams based on the new title; look up n-grams in canonical database to assign one or more labels; and map the new title to a canonical title based on the one or more labels.