Random forest analysis of two household surveys can identify important predictors of migration in Bangladesh
Kelsea B. Best (),
Jonathan M. Gilligan,
Hiba Baroud,
Amanda R. Carrico,
Katharine M. Donato,
Brooke A. Ackerly and
Bishawjit Mallick
Additional contact information
Kelsea B. Best: Vanderbilt University
Jonathan M. Gilligan: Vanderbilt University
Hiba Baroud: Vanderbilt University
Amanda R. Carrico: University of Colorado at Boulder
Katharine M. Donato: Georgetown University
Brooke A. Ackerly: Vanderbilt University
Bishawjit Mallick: Technische Universität Dresden
Journal of Computational Social Science, 2021, vol. 4, issue 1, No 4, 77-100
Abstract:
Abstract The decision to migrate is complex and is often influenced by a combination of economic, social, political, and environmental pressures. Household survey instruments can capture detailed information about migration histories and their contexts, but it can be challenging to identify important predictors from large numbers of covariates with standard statistical methods, such as regression analyses. Machine learning techniques are well suited to pattern identification and can identify important covariates from large datasets. We report on the application of machine learning approaches to two large surveys collected from a total of more than 2800 households in southwestern Bangladesh. We applied random forest classification and regression models to identify significant covariates with the greatest predictive power for household migration decisions. The results show that random forest models are able to identify nuances in predictors of different types of migration and migration in different communities. Random forests also outperform logistic regression and support vector machines in predicting migration in all cases analyzed. Therefore, random forest models and other machine learning methods can be useful for improving the predictive accuracy of migration models and identifying patterns in complex social datasets. Future work should continue to explore the potential of machine learning techniques applied to questions of environmental migration.
Keywords: Random forest; Machine learning; Migration; Climate change; Bangladesh (search for similar items in EconPapers)
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
http://link.springer.com/10.1007/s42001-020-00066-9 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:jcsosc:v:4:y:2021:i:1:d:10.1007_s42001-020-00066-9
Ordering information: This journal article can be ordered from
http://www.springer. ... iences/journal/42001
DOI: 10.1007/s42001-020-00066-9
Access Statistics for this article
Journal of Computational Social Science is currently edited by Takashi Kamihigashi
More articles in Journal of Computational Social Science from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().