Economic Measurement Lost in a Random Forest? A Case Study of Employment Data
Abe Dunn,
Eric English,
Kyle Hood,
Lowell Mason and
Brian Quistorff
AEA Papers and Proceedings, 2025, vol. 115, 68-72
Abstract:
Big data and machine learning (ML) offer transformative potential for economic measurement. This study evaluates the use of alternative employment data from a payroll processor to improve on timely measures of regional employment estimates, comparing ML methods—Lasso regression and Random Forest (RF)—to linear models. RF models show substantial improvements in cross-validation but struggle with extrapolation, particularly during the pandemic. At the county level, greater data variation aids prediction, though sampling errors complicate performance. These findings highlight ML's promise in improving economic statistics while emphasizing the need for careful model selection, robust evaluation metrics, and consideration of data-specific challenges.
JEL-codes: C45 C55 D22 L25 M15 M51 R23 (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.aeaweb.org/doi/10.1257/pandp.20251103 (application/pdf)
https://www.aeaweb.org/articles/materials/23019 (application/zip)
Access to full text is restricted to AEA members and institutional subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:aea:apandp:v:115:y:2025:p:68-72
Ordering information: This journal article can be ordered from
https://www.aeaweb.org/subscribe.html
DOI: 10.1257/pandp.20251103
Access Statistics for this article
AEA Papers and Proceedings is currently edited by William Johnson and Kelly Markel
More articles in AEA Papers and Proceedings from American Economic Association Contact information at EDIRC.
Bibliographic data for series maintained by Michael P. Albert ().