MULTI-CLASS SPOKEN LANGUAGE DETECTION USING ARTIFICIAL INTELLIGENCE WITH FRACTAL AL-BIRUNI EARTH RADIUS OPTIMIZATION ALGORITHM
Najla I. Al-Shathry,
Majdy M. Eltahir,
Somia A. Asklany,
Sami A. Al Ghamdi,
Abdullah Almuhaimeed,
Fuhid Alanazi,
Abdelmoneim Ali Mohamed and
Mohammed Rizwanullah
Additional contact information
Najla I. Al-Shathry: Department of Language Preparation, Arabic Language Teaching Institute, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia
Majdy M. Eltahir: ��Department of Information Systems, Applied College at Mahayil, King Khalid University, Asir, Abha, Saudi Arabia
Somia A. Asklany: ��Department of Computer Science and Information Technology, Faculty of Sciences and Arts in Turaif, Northern Border University, Arar 91431, Saudi Arabia
Sami A. Al Ghamdi: �Department of Computer Science, Faculty of Computing and Information, Al-Baha University Alaqiq, Saudi Arabia
Abdullah Almuhaimeed: �Digital Health Institute, King Abdulaziz City for Science and Technology, Riyadh 11442, Saudi Arabia
Fuhid Alanazi: ��Department of Information Systems, Faculty of Computer and Information Systems, Islamic University of Madinah, Medina 42351, Saudi Arabia
Abdelmoneim Ali Mohamed: *Department of Information Systems, College of Computer and Information Sciences, Majmaah University, Al-Majmaah 11952, Saudi Arabia
Mohammed Rizwanullah: ��†Department of Computer and Self Development, Preparatory Year Deanship Prince Sattam bin Abdulaziz University, Al-Kharj, Saudi Arabia
FRACTALS (fractals), 2024, vol. 32, issue 09n10, 1-13
Abstract:
Spoken Language Identification (SLID) is the problem of categorizing the language spoken by a speaker in the audio clips. SLID is valuable in multi-language speech recognition systems, personalized voice assistants, and automated speech translation systems in call centers to automatically route calls to the language operator. A primary challenge is the language detection from audio with different noise levels and sampling rates, accurately and with a short delay. A further problem is to differentiate between short-duration languages. Previous research works have applied SLID’s lexical, phonetic, phonotactic, and prosodic features. Spoken language detection using deep learning (DL) usually includes training RNN or CNN approaches on audio features such as spectrograms or MFCCs to categorize the language spoken in audio samples. Pioneering methodologies, such as CNN–RNN transformers or hybrids, can capture the spatial and temporal features for better performance. This paper presents a Multi-Class Spoken Language Detection using Artificial Intelligence with Fractal Al-Biruni Earth Radius Optimization (MCSLD-AIBER) technique. The MCSLD-AIBER technique mainly aims to identify the various classes of spoken languages. In the MCSLD-AIBER technique, the Constant-Q Transform (CQT) approach is applied to transform the speech signals. Additionally, the MCSLD-AIBER technique employs Inception with a Residual Network model for the feature extraction process. Moreover, the hyperparameters can be adjusted using the BER approach. A long short-term memory (LSTM) network can be utilized to identify multiple spoken languages. A set of experiments were involved to illustrate the efficient performance of the MCSLD-AIBER technique. The simulation outcomes indicated that the MCSLD-AIBER method performs optimally over other models.
Keywords: Spoken Language Detection; Artificial Intelligence; Constant-Q Transform; Hyperparameter Selection; Feature Extraction; Fractal Optimization; Complex Systems (search for similar items in EconPapers)
Date: 2024
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.worldscientific.com/doi/abs/10.1142/S0218348X25400547
Access to full text is restricted to subscribers
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:wsi:fracta:v:32:y:2024:i:09n10:n:s0218348x25400547
Ordering information: This journal article can be ordered from
DOI: 10.1142/S0218348X25400547
Access Statistics for this article
FRACTALS (fractals) is currently edited by Tara Taylor
More articles in FRACTALS (fractals) from World Scientific Publishing Co. Pte. Ltd.
Bibliographic data for series maintained by Tai Tone Lim ().