Selection of Biologically Relevant Genes with a Wrapper Stochastic Algorithm
Lê Cao Kim-Anh,
Gonçalves Olivier,
Besse Philippe and
Sébastien Gadat ()
Additional contact information
Lê Cao Kim-Anh: Université de Toulouse, CNRS (UMR 5219) and INRA
Gonçalves Olivier: LBP UMR CNRS 6023, Blaise Pascal University
Besse Philippe: Université de Toulouse, CNRS (UMR 5219)
Statistical Applications in Genetics and Molecular Biology, 2007, vol. 6, issue 1, 23
Abstract:
We investigate an important issue of a meta-algorithm for selecting variables in the framework of microarray data. This wrapper method starts from any classification algorithm and weights each variable (i.e. gene) relative to its efficiency for classification. An optimization procedure is then inferred which exhibits important genes for the studied biological process.Theory and application with the SVM classifier were presented in Gadat and Younes, 2007 and we extend this method with CART. The classification error rates are computed on three famous public databases (Leukemia, Colon and Prostate) and compared with those from other wrapper methods (RFE, lo norm SVM, Random Forests). This allows the assessment of the statistical relevance of the proposed algorithm. Furthermore, a biological interpretation with the Ingenuity Pathway Analysis software outputs clearly shows that the gene selections from the different wrapper methods raise very relevant biological information, compared to a classical filter gene selection with T-test.
Keywords: gene selection; classification; stochastic algorithm; cancer databases (search for similar items in EconPapers)
Date: 2007
References: View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
https://doi.org/10.2202/1544-6115.1312 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bpj:sagmbi:v:6:y:2007:i:1:n:29
Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/sagmb/html
DOI: 10.2202/1544-6115.1312
Access Statistics for this article
Statistical Applications in Genetics and Molecular Biology is currently edited by Michael P. H. Stumpf
More articles in Statistical Applications in Genetics and Molecular Biology from De Gruyter
Bibliographic data for series maintained by Peter Golla ().