Database "Pro-family (pronatalist) communities in the social network VKontakte"
The database contains uploading text comments from the social network VKontakte in .csv format (UTF-8 encoding) . The comments are collected from communities discussing pregnancy, childhood, motherhood, etc. Uploading contains comments to posts with which the interaction took place. The absolute num...
Gespeichert in:
Veröffentlicht in: | Naselenie i èkonomika 2020-11, Vol.4 (3), p.98-103 |
---|---|
1. Verfasser: | |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The database contains uploading text comments from the social network VKontakte
in .csv format (UTF-8 encoding)
. The comments are collected from communities discussing pregnancy, childhood, motherhood, etc. Uploading contains comments to posts with which the interaction took place. The absolute number of likes was used as a criterion (comments were collected where the number of likes is greater than or equal to 5). Text data was pre-processed (stemmization and lemmatization).
The data is suitable for thematic analysis (e.g. LDA – Latent Dirichlet Allocation), for modelling the graph structure of communities (the link_comment variable contains a unique post identifier, link_author contains a unique user identifier), for analysis of tonalities of statements and formation of a dictionary of demographic connotation in Russian. Analysis of the tonalities of statements enables measuring the dynamics of “demographic temperature” in pro-family (pronatalist) communities. |
---|---|
ISSN: | 2658-3798 2658-3798 |
DOI: | 10.3897/popecon.4.e60915 |