EconPapers    
Economics at your fingertips  
 

A Semi-Automatic Framework for Practical Transcription of Foreign Person Names in Lithuanian

Gailius Raškinis, Darius Amilevičius, Danguolė Kalinauskaitė, Artūras Mickus, Daiva Vitkutė-Adžgauskienė, Antanas Čenys and Tomas Krilavičius ()
Additional contact information
Gailius Raškinis: Faculty of Informatics, Vytautas Magnus University, Universiteto str. 10–202, Kaunas District, 53361 Akademija, Lithuania
Darius Amilevičius: Faculty of Informatics, Vytautas Magnus University, Universiteto str. 10–202, Kaunas District, 53361 Akademija, Lithuania
Danguolė Kalinauskaitė: Faculty of Informatics, Vytautas Magnus University, Universiteto str. 10–202, Kaunas District, 53361 Akademija, Lithuania
Artūras Mickus: Faculty of Informatics, Vytautas Magnus University, Universiteto str. 10–202, Kaunas District, 53361 Akademija, Lithuania
Daiva Vitkutė-Adžgauskienė: Faculty of Informatics, Vytautas Magnus University, Universiteto str. 10–202, Kaunas District, 53361 Akademija, Lithuania
Antanas Čenys: Department of Information Systems, Vilnius Gediminas Technical University, 10223 Vilnius, Lithuania
Tomas Krilavičius: Faculty of Informatics, Vytautas Magnus University, Universiteto str. 10–202, Kaunas District, 53361 Akademija, Lithuania

Mathematics, 2025, vol. 13, issue 13, 1-23

Abstract: We present a semi-automatic framework for transcribing foreign personal names into Lithuanian, aimed at reducing pronunciation errors in text-to-speech systems. Focusing on noisy, web-crawled data, the pipeline combines rule-based filtering, morphological normalization, and manual stress annotation—the only non-automated step—to generate training data for character-level transcription models. We evaluate three approaches: a weighted finite-state transducer (WFST), an LSTM-based sequence-to-sequence model with attention, and a Transformer model optimized for character transduction. Results show that word-pair models outperform single-word models, with the Transformer achieving the best performance (19.04% WER) on a cleaned and augmented dataset. Data augmentation via word order reversal proved effective, while combining single-word and word-pair training offered limited gains. Despite filtering, residual noise persists, with 54% of outputs showing some error, though only 11% were perceptually significant.

Keywords: practical transcription; character-level transduction; sequence-to-sequence learning; web-crawled data; Lithuanian (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/13/13/2107/pdf (application/pdf)
https://www.mdpi.com/2227-7390/13/13/2107/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:13:y:2025:i:13:p:2107-:d:1688818

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-06-28
Handle: RePEc:gam:jmathe:v:13:y:2025:i:13:p:2107-:d:1688818