Targeting predictors in random forest regression
Daniel Borup,
Bent Jesper Christensen,
Nicolaj N{\o}rgaard M\"uhlbach and
Mikkel Slot Nielsen
Papers from arXiv.org
Abstract:
Random forest regression (RF) is an extremely popular tool for the analysis of high-dimensional data. Nonetheless, its benefits may be lessened in sparse settings due to weak predictors, and a pre-estimation dimension reduction (targeting) step is required. We show that proper targeting controls the probability of placing splits along strong predictors, thus providing an important complement to RF's feature sampling. This is supported by simulations using representative finite samples. Moreover, we quantify the immediate gain from targeting in terms of increased strength of individual trees. Macroeconomic and financial applications show that the bias-variance trade-off implied by targeting, due to increased correlation among trees in the forest, is balanced at a medium degree of targeting, selecting the best 10--30\% of commonly applied predictors. Improvements in predictive accuracy of targeted RF relative to ordinary RF are considerable, up to 12-13\%, occurring both in recessions and expansions, particularly at long horizons.
Date: 2020-04, Revised 2020-11
New Economics Papers: this item is included in nep-ecm and nep-for
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (12)
Downloads: (external link)
http://arxiv.org/pdf/2004.01411 Latest version (application/pdf)
Related works:
Journal Article: Targeting predictors in random forest regression (2023) 
Working Paper: Targeting predictors in random forest regression (2020) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:arx:papers:2004.01411
Access Statistics for this paper
More papers in Papers from arXiv.org
Bibliographic data for series maintained by arXiv administrators ().