The random forest algorithm for statistical learning
Matthias Schonlau () and
Rosie Yuyan Zou ()
Additional contact information
Matthias Schonlau: University of Waterloo
Rosie Yuyan Zou: University of Waterloo
Stata Journal, 2020, vol. 20, issue 1, 3-29
Abstract:
Random forests (Breiman, 2001, Machine Learning 45: 5–32) is a statistical- or machine-learning algorithm for prediction. In this article, we intro- duce a corresponding new command, rforest. We overview the random forest algorithm and illustrate its use with two examples: The first example is a clas- sification problem that predicts whether a credit card holder will default on his or her debt. The second example is a regression problem that predicts the log- scaled number of shares of online news articles. We conclude with a discussion that summarizes key points demonstrated in the examples.
Keywords: rforest; random decision forest algorithm (search for similar items in EconPapers)
Date: 2020
Note: to access software from within Stata, net describe http://www.stata-journal.com/software/sj20-1/st0587/
References: Add references at CitEc
Citations: View citations in EconPapers (60)
Downloads: (external link)
http://hdl.handle.net/10.1177/1536867X20909688
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:tsj:stataj:v:20:y:2020:i:1:p:3-29
Ordering information: This journal article can be ordered from
http://www.stata-journal.com/subscription.html
DOI: 10.1177/1536867X20909688
Access Statistics for this article
Stata Journal is currently edited by Nicholas J. Cox and Stephen P. Jenkins
More articles in Stata Journal from StataCorp LLC
Bibliographic data for series maintained by Christopher F. Baum () and Lisa Gilmore ().