Automated News Categorization using Machine Learning methods

Being one of the most linguistically rich languages, Azerbaijani has been researched less in the context of natural language processing area. The text corpus created from Azerbaijani news articles is designed to apply supervised machine learning approaches for the case of automatic news labeling. Ch...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IOP conference series. Materials Science and Engineering 2018-12, Vol.459 (1), p.12006
Hauptverfasser: Suleymanov, U, Rustamov, S
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Being one of the most linguistically rich languages, Azerbaijani has been researched less in the context of natural language processing area. The text corpus created from Azerbaijani news articles is designed to apply supervised machine learning approaches for the case of automatic news labeling. Chi-squared test and LASSO methods have been implemented for feature selection and pre-processing. The application of supervised machine learning approaches to the text corpus allowed us to compare the performance results of well-established supervised machine learning approaches in the domain of Azerbaijani language.
ISSN:1757-8981
1757-899X
1757-899X
DOI:10.1088/1757-899X/459/1/012006