A Novel Deep Learning Method for Obtaining Bilingual Corpus from Multilingual Website
ShaoLin Zhu,
Xiao Li,
YaTing Yang,
Lei Wang and
ChengGang Mi
Mathematical Problems in Engineering, 2019, vol. 2019, 1-7
Abstract:
Machine translation needs a large number of parallel sentence pairs to make sure of having a good translation performance. However, the lack of parallel corpus heavily limits machine translation for low-resources language pairs. We propose a novel method that combines the continuous word embeddings with deep learning to obtain parallel sentences. Since parallel sentences are very invaluable for low-resources language pair, we introduce cross-lingual semantic representation to induce bilingual signals. Our experiments show that we can achieve promising results under lacking external resources for low-resource languages. Finally, we construct a state-of-the-art machine translation system in low-resources language pair.
Date: 2019
References: Add references at CitEc
Citations:
Downloads: (external link)
http://downloads.hindawi.com/journals/MPE/2019/7495436.pdf (application/pdf)
http://downloads.hindawi.com/journals/MPE/2019/7495436.xml (text/xml)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:hin:jnlmpe:7495436
DOI: 10.1155/2019/7495436
Access Statistics for this article
More articles in Mathematical Problems in Engineering from Hindawi
Bibliographic data for series maintained by Mohamed Abdelhakeem ().