Voice Spoofing Countermeasure Based on Spectral Features to Detect Synthetic Attacks Through LSTM
Gulam Qadir (),
Saima Zareen,
Farman Hassan and
Auliya Ur Rahman
Additional contact information
Gulam Qadir: University of Engineering and Technology Taxila, Punjab Pakistan
Saima Zareen: University of Engineering and Technology Taxila, Punjab Pakistan
Farman Hassan: University of Engineering and Technology Taxila, Punjab Pakistan
Auliya Ur Rahman: University of Engineering and Technology Taxila, Punjab Pakistan
International Journal of Innovations in Science & Technology, 2022, vol. 3, issue 5, 153-165
Abstract:
With the growing number of voice-controlled devices, it is necessary to address the potential vulnerabilities of Automatic Speaker Verification (ASV) against voice spoofing attacks such as Physical Access (PA) and Logical Access (LA) attacks. To improve the reliability of ASV systems, researchers have developed various voice spoofing countermeasures. However, it is hard for the voice anti-spoofing systems to effectively detect the synthetic speech attacks that are generated through powerful spoofing algorithms and have quite different statistical distributions. More importantly, the speedy improvement of voice spoofing structures is producing the most effective attacks that make ASV structures greater vulnerable to stumble on those voice spoofing assaults. In this paper, we proposed a unique voice spoofing countermeasure which is successful to hit upon the LA attacks (i.e., artificial speech and transformed speech) and classify the spoofing structures by the usage of Long Short-Term Reminiscence (LSTM). The novel set of spectral features i.e., Mel-Frequency Cepstral Coefficients (MFCC), Gammatone Cepstral Coefficients (GTCC), and spectral centroid are capable to seize maximum alterations present in the cloned audio. The proposed system achieved remarkable accuracy of 98.93%, precision of 100%, recall of 92.32%, F1-score of 96.01%, and an Equal Error Rate (EER) of 1.30%. Our method achieved 8.5% and 7.02% smaller EER than the baseline methods such as Constant-Q Cepstral Coefficients (CQCC) using Gaussian Mixture Model (GMM) and Linear Frequency Cepstral Coefficients (LFCC) using GMM, respectively. We evaluated the performance of the proposed system on the standard dataset i.e., ASVspoof2019 LA. Experimental results and comparative analysis with other existing state-of-the-art methods illustrate that our method is reliable and effective to be used for the detection of voice spoofing attacks.
Keywords: ASVspoof 2019 LA dataset; Deep Learning; Spoofing countermeasure; Synthetic Speech; Voice anti-spoofing (search for similar items in EconPapers)
Date: 2022
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://journal.50sea.com/index.php/IJIST/article/view/124/580 (application/pdf)
https://journal.50sea.com/index.php/IJIST/article/view/124 (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:abq:ijist1:v:3:y:2022:i:5:p:153-165
DOI: 10.33411/IJIST/2021030512
Access Statistics for this article
International Journal of Innovations in Science & Technology is currently edited by Prof. Dr. Veraldo Lisenberg, Prof Dr. Ali Iqtedar Mirza
More articles in International Journal of Innovations in Science & Technology from 50sea
Bibliographic data for series maintained by Hafiz Haroon Ahmad, Iqra Nazeer ().