EconPapers    
Economics at your fingertips  
 

Using K-Means Cluster Analysis and Decision Trees to Highlight Significant Factors Leading to Homelessness

Andrea Yoder Clark, Nicole Blumenfeld, Eric Lal, Shikar Darbari, Shiyang Northwood and Ashkan Wadpey
Additional contact information
Andrea Yoder Clark: School of Business, University of San Diego, 5998 Alcala Park, San Diego, CA 92110, USA
Nicole Blumenfeld: 2-1-1 San Diego, P.O. Box 420039, San Diego, CA 92124, USA
Eric Lal: School of Business, University of San Diego, 5998 Alcala Park, San Diego, CA 92110, USA
Shikar Darbari: School of Business, University of San Diego, 5998 Alcala Park, San Diego, CA 92110, USA
Shiyang Northwood: School of Business, University of San Diego, 5998 Alcala Park, San Diego, CA 92110, USA
Ashkan Wadpey: School of Business, University of San Diego, 5998 Alcala Park, San Diego, CA 92110, USA

Mathematics, 2021, vol. 9, issue 17, 1-14

Abstract: Homelessness has been a persistent social concern in the United States. A combination of political and economic events since the 1960s has driven increases in poverty that, by 1991, had surpassed 1928 depression era levels in some accounts. This paper explores how the emerging field of behavioral economics can use machine learning and data science methods to explore preventative responses to homelessness. In this study, machine learning data mining strategies, specifically K-means cluster analysis and later, decision trees, were used to understand how environmental factors and resultant behaviors can contribute to the experience of homelessness. Prevention of the first homeless event is especially important as studies show that if a person has experienced homelessness once, they are 2.6 times more likely to have another homeless episode. Study findings demonstrate that when someone is at risk for not being able to pay utility bills at the same time as they experience challenges with two or more of the other social determinants of health, the individual is statistically significantly more likely to have their first homeless event. Additionally, for men over 50 who are not in the workforce, have a health hardship, and experience two or more other social determinants of health hardships at the same time, the individual has a high statistically significant probability of experiencing homelessness for the first time.

Keywords: data science; machine learning; data mining; k-means; cluster analysis; decision trees; homelessness; behavioral economics (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/9/17/2045/pdf (application/pdf)
https://www.mdpi.com/2227-7390/9/17/2045/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:9:y:2021:i:17:p:2045-:d:621503

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jmathe:v:9:y:2021:i:17:p:2045-:d:621503