Robust Voice Activity Detection with Deep Maxout Neural Networks
Valentin Sergeyevich Mendelev,
Tatiana Nikolaevna Prisyach and
Alexey Alexandrovich Prudnikov
Modern Applied Science, 2015, vol. 9, issue 8, 153
Abstract:
Voice activity detection (VAD) under non-stationary noises is a very important task to solve when using a real-life system of automatic speech recognition, especially if a remote microphone is used. Many existing methods do not work well with noise that changes over time or with very low signal-to-noise ratio (SNR). This paper proposes a method based on deep maxout neural networks with dropout regularization. The method is effective even for very low SNR (up to -5dB). The robustness of the method is demonstrated by low FR/FA error rates on a test dataset that was recorded under conditions different from the training dataset.
Date: 2015
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://ccsenet.org/journal/index.php/mas/article/download/51796/27799 (application/pdf)
https://ccsenet.org/journal/index.php/mas/article/view/51796 (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ibn:masjnl:v:9:y:2015:i:8:p:153
Access Statistics for this article
More articles in Modern Applied Science from Canadian Center of Science and Education Contact information at EDIRC.
Bibliographic data for series maintained by Canadian Center of Science and Education ().