EconPapers    
Economics at your fingertips  
 

Amazigh Speech Recognition via Parallel CNN Transformer-Encoder Model

Mohamed Daouad (), Fadoua Ataa Allah () and El Wardani Dadi ()
Additional contact information
Mohamed Daouad: University of Abdelmalek Essaadi
Fadoua Ataa Allah: CEISIC, The Royal Institute of Amazigh Culture
El Wardani Dadi: University of Abdelmalek Essaadi

A chapter in Information Systems and Technological Advances for Sustainable Development, 2024, pp 255-263 from Springer

Abstract: Abstract Speech recognition technologies serve as a cornerstone in advancing the capabilities of artificial intelligence systems, providing a direct and efficient means of human communication. In the pursuit of fostering a more seamless and intuitive interaction between humans and automated systems, the incorporation of the Amazigh language into Automatic Speech Recognition technology holds profound implications. This integration not only contributes to the preservation of linguistic diversity but also facilitates enhanced accessibility while mitigating the complexities inherent in cross-language ASR transfer. In our research endeavor, we employ a novel approach combining convolutional neural networks with a Transformer encoder network to effectively capture spatial and temporal features for audio classification. Leveraging a dataset comprising 11,644 audio files representing 300 classes, encompassing 200 isolated words and 100 short sentences, recorded from native speakers of the Amazigh Tarifit language, our proposed model demonstrates promising results. Specifically, our model achieves a precision of 84.45% on the test set, underscoring its efficacy in accurately classifying Amazigh speech inputs.

Keywords: Amazigh language; Speech Recognition; parallel deep learning; Transformer; Convolutional Neural Networks (search for similar items in EconPapers)
Date: 2024
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:lnichp:978-3-031-75329-9_28

Ordering information: This item can be ordered from
http://www.springer.com/9783031753299

DOI: 10.1007/978-3-031-75329-9_28

Access Statistics for this chapter

More chapters in Lecture Notes in Information Systems and Organization from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-04-01
Handle: RePEc:spr:lnichp:978-3-031-75329-9_28