EconPapers    
Economics at your fingertips  
 

The EBIC and a sequential procedure for feature selection in interactive linear models with high-dimensional data

Yawei He () and Zehua Chen ()

Annals of the Institute of Statistical Mathematics, 2016, vol. 68, issue 1, 155-180

Abstract: High-dimensional data arises in many important scientific fields. The analysis of high-dimensional data poses great challenges to statisticians. In high-dimensional data, the relationship among the variables is complex. It involves main effects as well as interaction effects of the covariates. The effect of some covariates is only realized through their interaction with the others. This makes the consideration of interactive models imperative in the analysis of high-dimensional data. Because of the existence of high spurious correlation among the covariates in high-dimensional data, conventional tools for dealing with interactive models become inappropriate. In this paper, we develop specific tools for feature selection in high-dimensional data with interactive models, including a version of the extended BIC (EBIC) for interactive models and a sequential feature selection procedure. Main-effect and interaction features are treated differently in the EBIC for interactive models and the sequential procedure due to their different natures. The selection consistency of the EBIC for interactive models and the sequential procedure is established. Simulation studies are carried out to vindicate the asymptotic property in finite samples as well as to compare with non-sequential procedures. The approach developed in this paper is also applied to a real data set. Copyright The Institute of Statistical Mathematics, Tokyo 2016

Keywords: High-dimensional data; EBIC; Feature selection; Interactive model; Sequential procedure; Selection consistency (search for similar items in EconPapers)
Date: 2016
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
http://hdl.handle.net/10.1007/s10463-014-0497-2 (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:aistmt:v:68:y:2016:i:1:p:155-180

Ordering information: This journal article can be ordered from
http://www.springer. ... cs/journal/10463/PS2

DOI: 10.1007/s10463-014-0497-2

Access Statistics for this article

Annals of the Institute of Statistical Mathematics is currently edited by Tomoyuki Higuchi

More articles in Annals of the Institute of Statistical Mathematics from Springer, The Institute of Statistical Mathematics
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:aistmt:v:68:y:2016:i:1:p:155-180