Enriching administrative data using survey data and machine learning techniques
Max Kunaschk
Economics Letters, 2024, vol. 243, issue C
Abstract:
I propose an approach to enrich administrative data with information only available in survey data using machine learning techniques. To illustrate the approach, I replicate a prominent study that used survey data to analyze the federal minimum wage introduction in Germany. In contrast to the original study, I use the universe of German establishments rather than the limited number of establishments that participated in the survey. As the administrative data do not contain information on whether establishments were treated by the minimum wage, I use a random forest classifier, trained on survey data, to predict the treatment status of establishments. The results obtained using the administrative data are qualitatively similar to the results obtained using the survey data. Beyond replication of previous research, this approach broadens the research potential of administrative data, enabling researchers to explore more detailed research questions at scale.
Keywords: Machine learning; Administrative data; Survey data; Minimum wage (search for similar items in EconPapers)
JEL-codes: C53 C55 J21 J23 J31 J38 (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0165176524004087
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:ecolet:v:243:y:2024:i:c:s0165176524004087
DOI: 10.1016/j.econlet.2024.111924
Access Statistics for this article
Economics Letters is currently edited by Economics Letters Editorial Office
More articles in Economics Letters from Elsevier
Bibliographic data for series maintained by Catherine Liu ().