A Regression Subset-Selection Strategy for Fat-Structure Data
Cristian Gatu (),
Marko Sysi-Aho () and
Matej Orešič ()
Additional contact information
Cristian Gatu: VTT Technical Research Centre of Finland
Marko Sysi-Aho: VTT Technical Research Centre of Finland
Matej Orešič: VTT Technical Research Centre of Finland
A chapter in COMPSTAT 2008, 2008, pp 349-358 from Springer
Abstract:
Abstract A strategy is proposed for finding the most significant linear regression submodel for fat-structure data, that is when the number of variables n exceeds the number of available observations m. The method consists of two stages. First, a heuristic is employed to preselect a number of variables n S such that n S ≤m. The second stage performs an exhaustive search on the reduced list of variables. It employs a regression tree structure that generates all possible subset models. Non-optimal subtrees are pruned using a branch-and-bound device. Cross validation experiments on a real biomedical dataset are presented and analyzed.
Keywords: regression tree; branch-and-bound; model selection; fat-structure data (search for similar items in EconPapers)
Date: 2008
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-3-7908-2084-3_29
Ordering information: This item can be ordered from
http://www.springer.com/9783790820843
DOI: 10.1007/978-3-7908-2084-3_29
Access Statistics for this chapter
More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().