CLASSIFYING DOCUMENTS BASED ON TEXT ANALYSIS AND MACHINE LEARNING

A computer device identifies a set of documents for classification. The computing device classifies documents of a first subset of the set of documents based, at least in part, on a text analysis of the documents of the first subset. The computing device trains a document classifier using, as traini...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Kern, Robert, Schuetz, Werner, Babu, Hemanth Kumar, Schieber, Dieter Hans, Koenig, Holger, Bremer, Lars, Gerstl, Peter, Baessler, Michael
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A computer device identifies a set of documents for classification. The computing device classifies documents of a first subset of the set of documents based, at least in part, on a text analysis of the documents of the first subset. The computing device trains a document classifier using, as training data: (i) results of the classifying of the documents of the first subset, and (ii) metadata associated with the documents of the first subset. The computing device classifies documents of a second subset of the set of documents by providing metadata of the documents of the second subset to the trained document classifier.