ranger: A Fast Implementation of Random Forests for High Dimensional Data in C++ and R
Marvin N. Wright and
Andreas Ziegler
Journal of Statistical Software, 2017, vol. 077, issue i01
Abstract:
We introduce the C++ application and R package ranger. The software is a fast implementation of random forests for high dimensional data. Ensembles of classification, regression and survival trees are supported. We describe the implementation, provide examples, validate the package with a reference implementation, and compare runtime and memory usage with other implementations. The new software proves to scale best with the number of features, samples, trees, and features tried for splitting. Finally, we show that ranger is the fastest and most memory efficient implementation of random forests to analyze data on the scale of a genome-wide association study.
Date: 2017-03-31
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (142)
Downloads: (external link)
https://www.jstatsoft.org/index.php/jss/article/view/v077i01/v77i01.pdf
https://www.jstatsoft.org/index.php/jss/article/do ... /ranger_0.7.0.tar.gz
https://www.jstatsoft.org/index.php/jss/article/do ... ranger_cpp_0.5.0.zip
https://www.jstatsoft.org/index.php/jss/article/do ... 7i01-replication.zip
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:jss:jstsof:v:077:i01
DOI: 10.18637/jss.v077.i01
Access Statistics for this article
Journal of Statistical Software is currently edited by Bettina Grün, Edzer Pebesma and Achim Zeileis
More articles in Journal of Statistical Software from Foundation for Open Access Statistics
Bibliographic data for series maintained by Christopher F. Baum ().