Employment modelling through classification and regression trees
Anton Gerunov
International Journal of Data Science, 2016, vol. 1, issue 4, 316-329
Abstract:
The research paper leverages a big dataset from the field of social sciences - the combined World Values Survey 1981-2014 data - to investigate what determines an individual's employment status. We propose an approach to model this by first reducing data dimensionality at a small informational loss and then fitting a number of alternative machine-learning algorithms. A decision tree and a random forest model are studied in more detail. Variable importance is investigated to glean insight into what determines employment status. Employment is explained through traditional demographic and work attitude variables but unemployment is not, meaning that the latter is likely driven by other factors, including structural labour market characteristics and even randomness. The main contribution of this paper is to outline a new approach for doing big data-driven research in labour economics and apply it to a dataset that was not previously investigated in its entirety, thus achieving a more sophisticated process understanding.
Keywords: labour market; unemployment; big data; employment modelling; classification; regression trees; CART; random forest; social sciences; employment status; demographics; work attitudes; labour economics. (search for similar items in EconPapers)
Date: 2016
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.inderscience.com/link.php?id=81368 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ids:ijdsci:v:1:y:2016:i:4:p:316-329
Access Statistics for this article
More articles in International Journal of Data Science from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().