Classification of Poverty Condition Using Natural Language Processing

This work introduces a methodology to classify between poor and extremely poor people through Natural Language Processing. The approach serves as a baseline to understand and classify poverty through the people’s discourses using machine learning algorithms. Based on classical and modern word vector...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Social indicators research 2022-08, Vol.162 (3), p.1413-1435
Hauptverfasser:	Muñetón-Santa, Guberney, Escobar-Grisales, Daniel, López-Pabón, Felipe Orlando, Pérez-Toro, Paula Andrea, Orozco-Arroyave, Juan Rafael
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Classification Discourses Experiments Human Geography Information sources Language Localization Low income groups Machine learning Machinery Microeconomics Natural language processing Original Research Policy making Poverty Public Health Public policy Quality of Life Research Social networks Social programs Social Sciences Sociology Statistics
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This work introduces a methodology to classify between poor and extremely poor people through Natural Language Processing. The approach serves as a baseline to understand and classify poverty through the people’s discourses using machine learning algorithms. Based on classical and modern word vector representations we propose two strategies for document level representations: (1) document-level features based on the concatenation of descriptive statistics and (2) Gaussian mixture models. Three classification methods are systematically evaluated: Support Vector Machines, Random Forest, and Extreme Gradient Boosting. The fourth best experiments yielded around 55% of accuracy, while the embeddings based on GloVe word vectors yielded a sensitivity of 79.6% which could be of great interest for the public policy makers to accurately find people who need to be prioritized in social programs.
ISSN:	0303-8300 1573-0921
DOI:	10.1007/s11205-022-02883-z