Spoken Digit Recognition using the k-Nearest-Neighbor method and Mel Frequency Cepstral Coefficients
Sorin Muraru () and
Catalina Lucia Cocianu ()
Informatica Economica, 2024, vol. 28, issue 2, 5-16
Abstract:
This study investigates the utilization of the k-nearest-neighbor algorithm within the framework of machine learning for speech recognition applications. The AudioMNIST dataset is used for performing the evaluations in which the model predicts the spoken digit, namely from 0 to 9. Two different training-to-test percentage splits of the dataset are used, 70%-30% and 80%-20%, while the k parameter ranges from 1 to 12. To better adapt the predic-tion model, the Mel-frequency cepstrum coefficients are extracted from each audio sample, and the 13 filters are averaged over 25 ms frame windows with 10 ms frame overlap. In both training-to-test configurations the value for the k parameter that obtained the highest accu-racy (> 95%) is k=5, while the easiest to predict digits was “7†. These findings underscore the efficacy of k-nearest-neighbor in speech recognition tasks and highlight the importance of parameter selection and feature extraction techniques in optimizing model performance. Further exploration of kNN's applicability in diverse speech recognition contexts holds promise for advancing the field's understanding and practical implementations.
Keywords: k-nearest neighbor; Machine learning; MFCC; Speech recognition; Natural language processing (search for similar items in EconPapers)
Date: 2024
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://revistaie.ase.ro/content/110/01%20-%20muraru,%20cocianu.pdf (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:aes:infoec:v:28:y:2024:i:2:p:5-16
Access Statistics for this article
Informatica Economica is currently edited by Ion Ivan
More articles in Informatica Economica from Academy of Economic Studies - Bucharest, Romania Contact information at EDIRC.
Bibliographic data for series maintained by Paul Pocatilu ().