Machine Labor
Joshua Angrist and
Brigham Frandsen
Journal of Labor Economics, 2022, vol. 40, issue S1, S97 - S140
Abstract:
The utility of machine learning (ML) for regression-based causal inference is illustrated by using lasso to select control variables for estimates of college characteristics’ wage effects. Post-double-selection lasso offers a path to data-driven sensitivity analysis. ML also seems useful for an instrumental variables (IV) first stage, since two-stage least squares (2SLS) bias reflects overfitting. While ML-based instrument selection can improve on 2SLS, split-sample IV and limited information maximum likelihood do better. Finally, we use ML to choose IV controls. Here, ML creates artificial exclusion restrictions, generating spurious findings. On balance, ML seems ill-suited to IV applications in labor economics.
Date: 2022
References: Add references at CitEc
Citations:
Downloads: (external link)
http://dx.doi.org/10.1086/717933 (application/pdf)
http://dx.doi.org/10.1086/717933 (text/html)
Access to the online full text or PDF requires a subscription.
Related works:
Working Paper: Machine Labor (2019) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ucp:jlabec:doi:10.1086/717933
Access Statistics for this article
More articles in Journal of Labor Economics from University of Chicago Press
Bibliographic data for series maintained by Journals Division ().