A Solution for Developing Corpora for Polish Speech Enhancement in Complex Acoustic Environments
Mariusz Kleć (),
Krzysztof Szklanny () and
Alicja Wieczorkowska ()
Additional contact information
Mariusz Kleć: Polish-Japanese Academy of Information Technology
Krzysztof Szklanny: Polish-Japanese Academy of Information Technology
Alicja Wieczorkowska: Polish-Japanese Academy of Information Technology
A chapter in Advances in Information Systems Development, 2025, pp 125-143 from Springer
Abstract:
Abstract This paper presents a solution for generating corpora of simulated Polish speech recordings in complex acoustic environments. The proposed method introduces a layer of unpredictable sound events, in addition to the acoustic scene noise and reverberation, making the solution unique. Each sound layer is stored in separate files, allowing users to mute specific layers selectively via phase cancellation. We applied this technique for data augmentation and trained two speech enhancement models. Experimental results show that the models trained with our data augmentation strategy effectively generalize across various background noise complexities. Moreover, we highlight the crucial role of integrating speech enhancement methods within the speech separation pipeline in conditions characterized by diverse background noises. Our publicly available code allows researchers to create their corpora tailored to the Polish language and train speech enhancement or separation models.
Keywords: Speech denoising; Speech separation; Speech enhancement (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:lnichp:978-3-031-87880-0_7
Ordering information: This item can be ordered from
http://www.springer.com/9783031878800
DOI: 10.1007/978-3-031-87880-0_7
Access Statistics for this chapter
More chapters in Lecture Notes in Information Systems and Organization from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().