Variable importance index based on the partial least squares and boxplot cutoff threshold for variable selection
Noppamas Akarachantachote,
Seree Chadcham and
Kidakan Saithanu
International Journal of Data Analysis Techniques and Strategies, 2017, vol. 9, issue 1, 34-45
Abstract:
The variable importance in projection or VIP index obtained by the partial least squares regression (PLS-R) has become a crucial measurement of each predictor to relieve a problem of measuring multiple variables per sample. It has been applied to classification task although it is designed for regression. The new variable importance index combining concept of PLS-R and boxplot cutoff threshold, VIIC-BCT, was here particularly presented for classification of high dimensional data. The proposed VIIC-BCT was compared to the traditional VIP index (VIP-1) and the modified VIP index with boxplot cutoff threshold (VIP-BCT) thru simulation. The four parameters, percentage of the number of relevant variables (Prel), magnitude of mean difference of relevant variables between two classes (Mdif), degree of correlation between relevant variables (Σ) and the sample size (n), were specified to generate the specific 108 situations. The result indicated the VIIC-BCT shows the best performance in the particularly complicated circumstance.
Keywords: variable selection; data classification; partial least squares; PLS regression; PLS-R; variable importance in projection; VIP index; VIP-BCT; VIIC-BCT; boxplot cutoff threshold; multiple variables; high dimensional data. (search for similar items in EconPapers)
Date: 2017
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.inderscience.com/link.php?id=83063 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ids:injdan:v:9:y:2017:i:1:p:34-45
Access Statistics for this article
More articles in International Journal of Data Analysis Techniques and Strategies from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().