wsrf: An R Package for Classification with Scalable Weighted Subspace Random Forests
He Zhao,
Graham J. Williams and
Joshua Zhexue Huang
Journal of Statistical Software, 2017, vol. 077, issue i03
Abstract:
We describe a parallel implementation in R of the weighted subspace random forest algorithm (Xu, Huang, Williams, Wang, and Ye 2012) available as the wsrf package. A novel variable weighting method is used for variable subspace selection in place of the traditional approach of random variable sampling. This new approach is particularly useful in building models for high dimensional data - often consisting of thousands of variables. Parallel computation is used to take advantage of multi-core machines and clusters of machines to build random forest models from high dimensional data in considerably shorter times. A series of experiments presented in this paper demonstrates that wsrf is faster than existing packages whilst retaining and often improving on the classification performance, particularly for high dimensional data.
Date: 2017-03-31
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
https://www.jstatsoft.org/index.php/jss/article/view/v077i03/v77i03.pdf
https://www.jstatsoft.org/index.php/jss/article/do ... 3/wsrf_1.7.10.tar.gz
https://www.jstatsoft.org/index.php/jss/article/do ... 7i03-replication.zip
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:jss:jstsof:v:077:i03
DOI: 10.18637/jss.v077.i03
Access Statistics for this article
Journal of Statistical Software is currently edited by Bettina Grün, Edzer Pebesma and Achim Zeileis
More articles in Journal of Statistical Software from Foundation for Open Access Statistics
Bibliographic data for series maintained by Christopher F. Baum ().