A Hybrid Initialization and Effective Reproduction-Based Evolutionary Algorithm for Tackling Bi-Objective Large-Scale Feature Selection in Classification
Hang Xu (),
Chaohui Huang,
Hui Wen,
Tao Yan,
Yuanmo Lin and
Ying Xie
Additional contact information
Hang Xu: School of Mechanical, Electrical & Information Engineering, Putian University, Putian 351100, China
Chaohui Huang: School of Mechanical, Electrical & Information Engineering, Putian University, Putian 351100, China
Hui Wen: New Engineering Industry College, Putian University, Putian 351100, China
Tao Yan: School of Mechanical, Electrical & Information Engineering, Putian University, Putian 351100, China
Yuanmo Lin: School of Mechanical, Electrical & Information Engineering, Putian University, Putian 351100, China
Ying Xie: School of Mechanical, Electrical & Information Engineering, Putian University, Putian 351100, China
Mathematics, 2024, vol. 12, issue 4, 1-24
Abstract:
Evolutionary algorithms have been widely used for tackling multi-objective optimization problems, while feature selection in classification can also be seen as a discrete bi-objective optimization problem that pursues minimizing both the classification error and the number of selected features. However, traditional multi-objective evolutionary algorithms (MOEAs) can encounter setbacks when the dimensionality of features explodes to a large scale, i.e., the curse of dimensionality. Thus, in this paper, we focus on designing an adaptive MOEA framework for solving bi-objective feature selection, especially on large-scale datasets, by adopting hybrid initialization and effective reproduction (called HIER). The former attempts to improve the starting state of evolution by composing a hybrid initial population, while the latter tries to generate more effective offspring by modifying the whole reproduction process. Moreover, the statistical experiment results suggest that HIER generally performs the best on most of the 20 test datasets, compared with six state-of-the-art MOEAs, in terms of multiple metrics covering both optimization and classification performances. Then, the component contribution of HIER is also studied, suggesting that each of its essential components has a positive effect. Finally, the computational time complexity of HIER is also analyzed, suggesting that HIER is not time-consuming at all and shows promising computational efficiency.
Keywords: bi-objective optimization; evolutionary algorithm; effective reproduction; hybrid initialization; large-scale feature selection (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2227-7390/12/4/554/pdf (application/pdf)
https://www.mdpi.com/2227-7390/12/4/554/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:12:y:2024:i:4:p:554-:d:1337725
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().