EconPapers    
Economics at your fingertips  
 

Classification of Poverty Condition Using Natural Language Processing

Guberney Muñetón-Santa (), Daniel Escobar-Grisales, Felipe Orlando López-Pabón, Paula Andrea Pérez-Toro and Juan Rafael Orozco-Arroyave
Additional contact information
Guberney Muñetón-Santa: Universidad de Antioquia
Daniel Escobar-Grisales: Universidad de Antioquia
Felipe Orlando López-Pabón: Universidad de Antioquia
Paula Andrea Pérez-Toro: Universidad de Antioquia
Juan Rafael Orozco-Arroyave: Universidad de Antioquia

Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, 2022, vol. 162, issue 3, No 18, 1413-1435

Abstract: Abstract This work introduces a methodology to classify between poor and extremely poor people through Natural Language Processing. The approach serves as a baseline to understand and classify poverty through the people’s discourses using machine learning algorithms. Based on classical and modern word vector representations we propose two strategies for document level representations: (1) document-level features based on the concatenation of descriptive statistics and (2) Gaussian mixture models. Three classification methods are systematically evaluated: Support Vector Machines, Random Forest, and Extreme Gradient Boosting. The fourth best experiments yielded around 55% of accuracy, while the embeddings based on GloVe word vectors yielded a sensitivity of 79.6% which could be of great interest for the public policy makers to accurately find people who need to be prioritized in social programs.

Keywords: Poverty; Natural language processing; Text classification; Word embedding; Document-level embedding; Machine learning (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s11205-022-02883-z Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:soinre:v:162:y:2022:i:3:d:10.1007_s11205-022-02883-z

Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/11135

DOI: 10.1007/s11205-022-02883-z

Access Statistics for this article

Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement is currently edited by Filomena Maggino

More articles in Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:soinre:v:162:y:2022:i:3:d:10.1007_s11205-022-02883-z