Articulatory-to-Acoustic Conversion of Mandarin Emotional Speech Based on PSO-LSSVM

Ren, Guofeng; Fu, Jianmei; Shao, Guicheng; Xun, Yanqin; Wang, Wei

Articulatory-to-Acoustic Conversion of Mandarin Emotional Speech Based on PSO-LSSVM

Guofeng Ren, Jianmei Fu, Guicheng Shao, Yanqin Xun and Wei Wang

Complexity, 2021, vol. 2021, 1-10

Abstract: The production of emotional speech is determined by the movement of the speakerâ€™s tongue, lips, and jaw. In order to combine articulatory data and acoustic data of speakers, articulatory-to-acoustic conversion of emotional speech has been studied. In this paper, parameters of LSSVM model have been optimized using the PSO method, and the optimized PSO-LSSVM model was applied to the articulatory-to-acoustic conversion. The root mean square error (RMSE) and mean Mel-cepstral distortion (MMCD) have been used to evaluate the results of conversion; the evaluated result illustrates that MMCD of MFCC is 1.508â€‰dB, and RMSE of the second formant (F2) is 25.10â€‰Hz. The results of this research can be further applied to the feature fusion of emotion speech recognition to improve the accuracy of emotion recognition.

Date: 2021
References: Add references at CitEc
Citations:

Downloads: (external link)
http://downloads.hindawi.com/journals/complexity/2021/8876005.pdf (application/pdf)
http://downloads.hindawi.com/journals/complexity/2021/8876005.xml (application/xml)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:hin:complx:8876005

DOI: 10.1155/2021/8876005

Access Statistics for this article

More articles in Complexity from Hindawi
Bibliographic data for series maintained by Mohamed Abdelhakeem ().