Optimal tuning of support vector machines and k-NN algorithm by using Bayesian optimization for newborn cry signal diagnosis based on audio signal processing features
Salim Lahmiri,
Chakib Tadj,
Christian Gargour and
Stelios Bekiros
Chaos, Solitons & Fractals, 2023, vol. 167, issue C
Abstract:
Recently, the number of machine learning models used to classify cry signals of healthy and unhealthy newborns has been significantly increasing. Various works have already reported encouraging classification results; however, fine-tuning of the hyper-parameters of machine leaning algorithms is still an open problem in the context of newborn cry signal classification. This paper proposes to use Bayesian optimization (BO) method to optimize the hyper-parameters of Support Vector Machine (SVM) with radial basis function (RBF) kernel and k-nearest neighbors (kNN) trained with different audio features separately or combined; namely, mel-frequency cepstral coefficients (MFCC), auditory-inspired amplitude modulation (AAM), and prosody. Particularly, the chi-square test is applied to each set of features to retain the ten most significant ones used to train optimal classifiers. The accuracy, sensitivity, and specificity of each experimental model are computed following the standard 10-fold cross-validation protocol. One of the contributions is an improvement over previous works on newborn cry signal classification used to distinguish between healthy and unhealthy ones over the same database, in terms of performance. The best model is the SVM trained with AAM ten most significant features achieved 83.62 % ± 0.022 accuracy, 59.18 % ± 0.0469 sensitivity, and 93.87 % ± 0.0190 specificity followed by kNN trained with ten most features from MFCC, AAM, and prosody to obtain 82.88 % ± 0.0144 accuracy, 55.34 % ± 0.0350 sensitivity, and 94.42 % ± 0.0075 specificity. These results outperformed existing works validated on the same database. In addition, optimally tuned SVM and kNN are fed with a restricted number of selected patterns so as the processing time for training and testing is significantly limited. This means that the RBF-SVM-BO classifier trained with AAM ten most significant features is more able to distinguish between healthy and unhealthy newborns.
Keywords: Newborn cry; Mel-frequency cepstral coefficients; Auditory-inspired amplitude modulation; Prosody; Support vector machines; k-Nearest neighbors; Bayesian optimization (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0960077922011511
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:chsofr:v:167:y:2023:i:c:s0960077922011511
DOI: 10.1016/j.chaos.2022.112972
Access Statistics for this article
Chaos, Solitons & Fractals is currently edited by Stefano Boccaletti and Stelios Bekiros
More articles in Chaos, Solitons & Fractals from Elsevier
Bibliographic data for series maintained by Thayer, Thomas R. ().