Automated attribute disambiguation with human input

Systems and methods are disclosed herein for performing classification of documents or performing other tasks based on rules. The rules may include context rules that define a mapping that relates a value and context in a document to an attribute to which the value corresponds. Products are selected...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: SUBRAMANIAM SRIKANTH, SUN CHONG, RAVIKANT DINTYALA VENKATA SUBRAHMANYA, GARERA NIKESH LUCKY, YALIN HEATHER DAWN, RAMPALLI NARASIMHAN
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Systems and methods are disclosed herein for performing classification of documents or performing other tasks based on rules. The rules may include context rules that define a mapping that relates a value and context in a document to an attribute to which the value corresponds. Products are selected for labeling with attributes by identifying patterns, e.g. values and contexts that are not covered by a current rule set. Those products having a highest score are selected for labeling in a crowd sourcing forum, where the score is based on the number of non-covered patterns and a frequency of occurrence of the non-covered patterns in a document corpus. Proposed rules are generated for frequently occurring patterns and submitted to analysts for one or both of completion and validation. Proposed rules may include a proposed attribute for a frequently occurring value and corresponding context.