The random forest algorithm for statistical learning

Schonlau, Matthias; Zou, Rosie Yuyan

The random forest algorithm for statistical learning

Matthias Schonlau () and Rosie Yuyan Zou ()
Additional contact information
Matthias Schonlau: University of Waterloo
Rosie Yuyan Zou: University of Waterloo

Stata Journal, 2020, vol. 20, issue 1, 3-29

Abstract: Random forests (Breiman, 2001, Machine Learning 45: 5–32) is a statistical- or machine-learning algorithm for prediction. In this article, we intro- duce a corresponding new command, rforest. We overview the random forest algorithm and illustrate its use with two examples: The first example is a clas- sification problem that predicts whether a credit card holder will default on his or her debt. The second example is a regression problem that predicts the log- scaled number of shares of online news articles. We conclude with a discussion that summarizes key points demonstrated in the examples.

Keywords: rforest; random decision forest algorithm (search for similar items in EconPapers)
Date: 2020
Note: to access software from within Stata, net describe http://www.stata-journal.com/software/sj20-1/st0587/
References: Add references at CitEc
Citations: View citations in EconPapers (79)

Downloads: (external link)
http://hdl.handle.net/10.1177/1536867X20909688

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:tsj:stataj:v:20:y:2020:i:1:p:3-29

Ordering information: This journal article can be ordered from
http://www.stata-journal.com/subscription.html

DOI: 10.1177/1536867X20909688

Access Statistics for this article

Stata Journal is currently edited by Nicholas J. Cox and Stephen P. Jenkins

More articles in Stata Journal from StataCorp LLC
Bibliographic data for series maintained by Christopher F. Baum () and Lisa Gilmore ().