A Cascade Deep Forest Model for Breast Cancer Subtype Classification Using Multi-Omics Data
Ala’a El-Nabawy,
Nahla A. Belal and
Nashwa El-Bendary
Additional contact information
Ala’a El-Nabawy: Orange Labs., Smart Village 12511, Giza Governorate, Egypt
Nahla A. Belal: College of Computing and Information Technology, Arab Academy for Science, Technology, and Maritime Transport, Smart Village, Giza 12577, Egypt
Nashwa El-Bendary: College of Computing and Information Technology, Arab Academy for Science, Technology, and Maritime Transport, Smart Village, Giza 12577, Egypt
Mathematics, 2021, vol. 9, issue 13, 1-14
Abstract:
Automated diagnosis systems aim to reduce the cost of diagnosis while maintaining the same efficiency. Many methods have been used for breast cancer subtype classification. Some use single data source, while others integrate many data sources, the case that results in reduced computational performance as opposed to accuracy. Breast cancer data, especially biological data, is known for its imbalance, with lack of extensive amounts of histopathological images as biological data. Recent studies have shown that cascade Deep Forest ensemble model achieves a competitive classification accuracy compared with other alternatives, such as the general ensemble learning methods and the conventional deep neural networks (DNNs), especially for imbalanced training sets, through learning hyper-representations through using cascade ensemble decision trees. In this work, a cascade Deep Forest is employed to classify breast cancer subtypes, IntClust and Pam50, using multi-omics datasets and different configurations. The results obtained recorded an accuracy of 83.45% for 5 subtypes and 77.55% for 10 subtypes. The significance of this work is that it is shown that using gene expression data alone with the cascade Deep Forest classifier achieves comparable accuracy to other techniques with higher computational performance, where the time recorded is about 5 s for 10 subtypes, and 7 s for 5 subtypes.
Keywords: METABRIC dataset; breast cancer subtyping; deep forest; multi-omics data (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2021
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2227-7390/9/13/1574/pdf (application/pdf)
https://www.mdpi.com/2227-7390/9/13/1574/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:9:y:2021:i:13:p:1574-:d:588273
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().