EconPapers    
Economics at your fingertips  
 

Enhancing the Accuracy of Image Classification for Degenerative Brain Diseases with CNN Ensemble Models Using Mel-Spectrograms

Sang-Ha Sung, Michael Pokojovy, Do-Young Kang, Woo-Yong Bae, Yeon-Jae Hong and Sangjin Kim ()
Additional contact information
Sang-Ha Sung: Department of Management Information Systems, Dong-A University, Busan 49236, Republic of Korea
Michael Pokojovy: Department of Mathematics and Statistics, Old Dominion University, Norfolk, VA 23529, USA
Do-Young Kang: Department of Nuclear Medicine, Dong-A University Medical Center, Busan 49201, Republic of Korea
Woo-Yong Bae: Department of Nuclear Medicine, Dong-A University Medical Center, Busan 49201, Republic of Korea
Yeon-Jae Hong: Department of Science Education, Ewha Womans University, Seoul 03760, Republic of Korea
Sangjin Kim: Department of Management Information Systems, Dong-A University, Busan 49236, Republic of Korea

Mathematics, 2025, vol. 13, issue 13, 1-18

Abstract: Alzheimer’s disease (AD) and Parkinson’s disease (PD) are prevalent neurodegenerative disorders among the elderly, leading to cognitive decline and motor impairments. As the population ages, the prevalence of these neurodegenerative disorders is increasing, providing motivation for active research in this area. However, most studies are conducted using brain imaging, with relatively few studies utilizing voice data. Using voice data offers advantages in accessibility compared to brain imaging analysis. This study introduces a novel ensemble-based classification model that utilizes Mel spectrograms and Convolutional Neural Networks (CNNs) to distinguish between healthy individuals (NM), AD, and PD patients. A total of 700 voice samples were collected under standardized conditions, ensuring data reliability and diversity. The proposed ternary classification algorithm integrates the predictions of binary CNN classifiers through a majority voting ensemble strategy. ResNet, DenseNet, and EfficientNet architectures were employed for model development. The experimental results show that the ensemble model based on ResNet achieves a weighted F1 score of 91.31%, demonstrating superior performance compared to existing approaches. To the best of our knowledge, this is the first large-scale study to perform three-class classification of neurodegenerative diseases using voice data.

Keywords: Alzheimer’s disease; convolution neural networks; ensemble classification; Parkinson’s disease; voice data (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/13/13/2100/pdf (application/pdf)
https://www.mdpi.com/2227-7390/13/13/2100/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:13:y:2025:i:13:p:2100-:d:1688201

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-06-27
Handle: RePEc:gam:jmathe:v:13:y:2025:i:13:p:2100-:d:1688201