EconPapers    
Economics at your fingertips  
 

Microarray cancer classification using feature extraction-based ensemble learning method

Anita Bai and Swati Hira

International Journal of Data Analysis Techniques and Strategies, 2021, vol. 13, issue 3, 244-263

Abstract: Microarray cancer datasets generally contain many features with a small number of samples, so initially we need to reduce redundant features to allow faster convergence. To address this issue, we proposed a novel feature extraction-based ensemble classification technique using support vector machine (SVM) which classifying microarray cancer data and helps to build intelligent systems for early cancer detection. Novelty of the proposed approach is described by classifying cancer data as follows: a) we extracted information by reducing the size of larger dataset using various feature selection techniques, such as, principal component analysis (PCA), chi-square, genetic algorithm (GA) and F-score; b) classifying extracted information in two samples as normal and malignant classes using majority voting ensemble SVM. In SVM ensemble-based approach we use different SVM kernels, like, linear, polynomial, radial basis function (RBF), and sigmoid. The calculated results of particular kernels are combined using majority voting approach. The effectiveness of the algorithm is validated on six benchmark cancer datasets viz. colon, ovarian, leukaemia, breast, lung and prostate using ensemble SVM classification.

Keywords: cancer classification; support vector machine; SVM; principal component analysis; PCA; genetic algorithm; F-score; chi-square. (search for similar items in EconPapers)
Date: 2021
References: Add references at CitEc
Citations:

Downloads: (external link)
http://www.inderscience.com/link.php?id=118014 (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:ids:injdan:v:13:y:2021:i:3:p:244-263

Access Statistics for this article

More articles in International Journal of Data Analysis Techniques and Strategies from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().

 
Page updated 2025-03-19
Handle: RePEc:ids:injdan:v:13:y:2021:i:3:p:244-263