Estimating intergenerational income mobility on sub-optimal data: a machine learning approach
Francesco Bloise,
Paolo Brunori and
Patrizio Piraino
LSE Research Online Documents on Economics from London School of Economics and Political Science, LSE Library
Abstract:
Much of the global evidence on intergenerational income mobility is based on sub-optimal data. In particular, two-stage techniques are widely used to impute parental incomes for analyses of lower-income countries and for estimating long-run trends across multiple generations and historical periods. We propose applying machine learning methods to improve the reliability and comparability of such estimates. Supervised learning algorithms minimize the out-of-sample prediction error in the parental income imputation and provide an objective criterion for choosing across different specifications of the first-stage equation. We use our approach on data from the United States and South Africa to show that under common conditions it can limit the bias generally associated to mobility estimates based on imputed parental income.
Keywords: intergenerational income mobility; machine learning; two-sample two-stage least squares (search for similar items in EconPapers)
JEL-codes: F3 G3 N0 (search for similar items in EconPapers)
Pages: 23 pages
Date: 2021-12-01
References: Add references at CitEc
Citations:
Published in Journal of Economic Inequality, 1, December, 2021, 19(4), pp. 643-665. ISSN: 1569-1721
Downloads: (external link)
https://researchonline.lse.ac.uk/id/eprint/112762/ Open access version. (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ehl:lserod:112762
Access Statistics for this paper
More papers in LSE Research Online Documents on Economics from London School of Economics and Political Science, LSE Library LSE Library Portugal Street London, WC2A 2HD, U.K.. Contact information at EDIRC.
Bibliographic data for series maintained by LSERO Manager ().