A probabilistically entropic mechanism of topical clusterisation along with thematic annotation for evolution analysis of meaningful social information of internet sources

An approach to monitoring temporal evolution of thematic clusters with evaluating their relations on base of probability and entropy methods is presented. It allows to get a temporary map of nested topics with their short annotations, concerning a predetermined main theme. The methods of semantic an...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Lobachevskii journal of mathematics 2017-09, Vol.38 (5), p.910-913
Hauptverfasser: Gydovskikh, D. V., Moloshnikov, I. A., Naumov, A. V., Rybka, R. B., Sboev, A. G., Selivanov, A. A.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:An approach to monitoring temporal evolution of thematic clusters with evaluating their relations on base of probability and entropy methods is presented. It allows to get a temporary map of nested topics with their short annotations, concerning a predetermined main theme. The methods of semantic analysis of texts to generate topics and to find the most emotive of them to reflect a social significance are used. The technology word2vec was implemented to determine the relation of topics and evaluate their proximity to the main theme. To increase the usability the visualization of nested topics is realized on base of a WEB interface. The proposed approach complements well the popular software for analyzing big volumes of data such as Elasticsearch (search for thematically similar documents). Results of case study of analyzing the theme “AEROFLOT” on base of news corpus which consists of 3 million messages is presented.
ISSN:1995-0802
1818-9962
DOI:10.1134/S1995080217050134