EconPapers    
Economics at your fingertips  
 

Robust Voice Activity Detection with Deep Maxout Neural Networks

Valentin Sergeyevich Mendelev, Tatiana Nikolaevna Prisyach and Alexey Alexandrovich Prudnikov

Modern Applied Science, 2015, vol. 9, issue 8, 153

Abstract: Voice activity detection (VAD) under non-stationary noises is a very important task to solve when using a real-life system of automatic speech recognition, especially if a remote microphone is used. Many existing methods do not work well with noise that changes over time or with very low signal-to-noise ratio (SNR). This paper proposes a method based on deep maxout neural networks with dropout regularization. The method is effective even for very low SNR (up to -5dB). The robustness of the method is demonstrated by low FR/FA error rates on a test dataset that was recorded under conditions different from the training dataset.

Date: 2015
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://ccsenet.org/journal/index.php/mas/article/download/51796/27799 (application/pdf)
https://ccsenet.org/journal/index.php/mas/article/view/51796 (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:ibn:masjnl:v:9:y:2015:i:8:p:153

Access Statistics for this article

More articles in Modern Applied Science from Canadian Center of Science and Education Contact information at EDIRC.
Bibliographic data for series maintained by Canadian Center of Science and Education ().

 
Page updated 2025-03-19
Handle: RePEc:ibn:masjnl:v:9:y:2015:i:8:p:153