Sure screening by ranking the canonical correlations
Xin-Bing Kong (),
Zhi Liu (),
Yuan Yao () and
Wang Zhou ()
Additional contact information
Xin-Bing Kong: CASER, Soochow University
Zhi Liu: University of Macau and UM Zhuhai Research Institute
Yuan Yao: Hong Kong Baptist University
Wang Zhou: National University of Singapore
TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, 2017, vol. 26, issue 1, No 3, 46-70
Abstract:
Abstract The sure independence screening procedure by ranking the marginal Pearson correlation is well documented in literatures and works satisfactorily in the ultra-high dimensional case. However, this marginal Person correlation learning would easily miss the variable that is marginally uncorrelated with the response but correlated with the response jointly with some other variables. This failure in missing an important variable is due to the fact that the marginal Pearson correlation does not use the joint information of the response and a set of covariates. In this paper, we introduce a new screening method which leaves a variable into the active set if it jointly with some other variables has a high canonical correlation with the response. This is accomplished via ranking canonical correlations between the response and all possible sets of k variables. Our results show that the procedure has the sure screening property and substantially reduces the dimensionality to a moderate size against the sample size. Extensive simulations demonstrate that our new method performs substantially better than the existing sure independence screening approaches based on the marginal Pearson correlation or Kental’s tau rank correlation. A real data set is also analyzed by implementing our approach.
Keywords: High dimension; Correlation; Group screening; Primary 62J99; Secondary 62F12 (search for similar items in EconPapers)
Date: 2017
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
http://link.springer.com/10.1007/s11749-016-0497-z Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:testjl:v:26:y:2017:i:1:d:10.1007_s11749-016-0497-z
Ordering information: This journal article can be ordered from
http://www.springer. ... cs/journal/11749/PS2
DOI: 10.1007/s11749-016-0497-z
Access Statistics for this article
TEST: An Official Journal of the Spanish Society of Statistics and Operations Research is currently edited by Alfonso Gordaliza and Ana F. Militino
More articles in TEST: An Official Journal of the Spanish Society of Statistics and Operations Research from Springer, Sociedad de Estadística e Investigación Operativa
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().