Optimizing Breast Cancer Detection: Integrating Machine Learning with Feature Selection
Salsabila Benghazouani (),
Said Nouh and
Abdelali Zakrani
Additional contact information
Salsabila Benghazouani: Hassan II University
Said Nouh: Hassan II University
Abdelali Zakrani: ENSAM, Hassan II University
A chapter in Information Systems and Technological Advances for Sustainable Development, 2024, pp 272-282 from Springer
Abstract:
Abstract The development of early and precise breast cancer prediction strategies using computer-aided tools has significantly contributed to reducing mortality rates associated with the disease. Nevertheless, the feature selection approach remains a challenging task in identifying and characterizing cancers, particularly in clinical treatment contexts. This paper presents an in-depth experiment utilizing various feature selection techniques, including Genetic Algorithm, Recursive Feature Elimination (RFE), and SelectFromModel (SFM), to enhance accuracy and reduce dimensionality in the Wisconsin Diagnostic Breast Cancer (WDBC) dataset. These techniques are combined with machine learning techniques such as Support Vector Machine, Logistic Regression, and Gradient Boosting to predict breast cancer severity. The model’s performance is assessed using multiple measures, such as accuracy, F1-score, precision, recall, and area under the curve (AUC). The findings indicate that using a genetic algorithm for feature selection integrated with a gradient-boosting classifier produced superior results across all metrics. This strategy achieved an accuracy of 99.10%, a precision of 100.0%, a recall of 98.20%, an F1-score of 99.09%, and an AUC of 99.10%, surpassing the performance of features selected by RFE or SFM individually. These results highlight the critical role of the feature selection approach in optimizing model performance and increasing breast cancer detection accuracy.
Keywords: Classification; Feature Selection Techniques; Machine Learning Algorithms; Breast cancer (search for similar items in EconPapers)
Date: 2024
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:lnichp:978-3-031-75329-9_30
Ordering information: This item can be ordered from
http://www.springer.com/9783031753299
DOI: 10.1007/978-3-031-75329-9_30
Access Statistics for this chapter
More chapters in Lecture Notes in Information Systems and Organization from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().