Extracting and normalizing organization names from text

A method, apparatus and article of manufacture for extracting and normalizing organization names from text. The method uses regular expressions, certain rules and dictionaries to identify potential organization names in text, then uses word similarity metrics, clustering, and other considerations to...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: TOPHAM PHILIP S, JONNALAGADDA SIDDHARTHA
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A method, apparatus and article of manufacture for extracting and normalizing organization names from text. The method uses regular expressions, certain rules and dictionaries to identify potential organization names in text, then uses word similarity metrics, clustering, and other considerations to group normalized organization names.