Unit-Selection Speech Synthesis Method Using Words as Search Units
Hiroyuki Segi
Additional contact information
Hiroyuki Segi: Department of Computer and Information Science, Seikei University, Tokyo, Japan
International Journal of Multimedia Data Engineering and Management (IJMDEM), 2016, vol. 7, issue 2, 1-15
Abstract:
Unit-selection speech-synthesis systems have been proposed. In most of the unit-selection speech-synthesis systems, search units are rather short such as syllables, phonemes and diphones. However, when applied to large speech databases, shorter units produce more voice-waveform candidates and a larger speech database cannot be used without narrow pruning for practical use. Narrow pruning impairs the quality of the synthesized speech. Here the author examined the possibility of using words as search units. Subjective evaluations indicated that 70% of the speech synthesized by the proposed method sounded more natural than that synthesized by a conventional method. The five-point mean opinion score of the synthesized speech was 3.5, and 21% was judged to sound as natural as human speech. These results demonstrate the effectiveness of unit-selection speech synthesis using words as search units.
Date: 2016
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve. ... 18/IJMDEM.2016040104 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:igg:jmdem0:v:7:y:2016:i:2:p:1-15
Access Statistics for this article
International Journal of Multimedia Data Engineering and Management (IJMDEM) is currently edited by Chengcui Zhang
More articles in International Journal of Multimedia Data Engineering and Management (IJMDEM) from IGI Global
Bibliographic data for series maintained by Journal Editor ().