Machine learned language modeling and identification

Systems, devices, media, and methods are presented for generating a language detection model of a language analysis system. The systems and methods access a set of messages including text elements and convert the set of messages into a set of training messages. The set of training messages are confi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Dos Santos Marujo, Luis Carlos, Carvalho, Vitor Rocha de, Neves, Leonardo Ribas Machado das
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Systems, devices, media, and methods are presented for generating a language detection model of a language analysis system. The systems and methods access a set of messages including text elements and convert the set of messages into a set of training messages. The set of training messages are configured for training a language detection model. The systems and methods train a classifier based on the set of training messages. The classifier has a set of features representing word frequency, character frequency, and a character ratio. The systems and methods generate a language detection model based on the classifier and the set of features.