Multivariate forests with missing mixed outcomes
Abdessamad Dine,
Denis Larocque and
François Bellavance
Communications in Statistics - Theory and Methods, 2017, vol. 46, issue 23, 11500-11513
Abstract:
In this article, we propose a multivariate random forest method for multiple responses of mixed types with missing responses. Imputation is performed for each bootstrap sample used to build the individual trees that form the forest. The individual trees are built using a weighted splitting rule allowing downweighting of imputed observations. A simulation study shows the benefits of this approach over complete case analysis when missing responses are missing completely at random and missing at random (MAR). In particular, the gain in prediction accuracy of the proposed method is larger in the MAR case and also increases as the proportion of missing increases.
Date: 2017
References: Add references at CitEc
Citations:
Downloads: (external link)
http://hdl.handle.net/10.1080/03610926.2016.1271427 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:lstaxx:v:46:y:2017:i:23:p:11500-11513
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/lsta20
DOI: 10.1080/03610926.2016.1271427
Access Statistics for this article
Communications in Statistics - Theory and Methods is currently edited by Debbie Iscoe
More articles in Communications in Statistics - Theory and Methods from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().