Automatic Extraction of English-Chinese Translation Templates Based on Deep Learning
Zhaofeng Dong and
Naeem Jan
Mathematical Problems in Engineering, 2022, vol. 2022, 1-9
Abstract:
Translation templates are an important cause of knowledge in machine translation (MT) systems. Their quality and scale directly influence the performance of MT systems. How to obtain high-quality and efficient translation templates from corpora has become a hot topic in recent study. In this paper, a tree to String alignment template (TAT) based on syntactic structure is proposed. This template describes the alignment between the source language syntax tree and the target language string. The syntactic structure, a large number of construction tags, and variables are introduced into the template, which enables the syntactic model to deal with discontinuous phrases and has the ability of generalization. Templates can be used in syntactic statistics, case-based, and rule-based MT systems according to different decoders. ATTEBSC algorithm is a basic method to learn translation templates by comparing sentence pairs. It demands that sentence pairs be constructed in a precise comparison structure ahead of time, but there are no strict guidelines on how to do it. In this paper, we propose a method to calculate the specific comparison scheme using the longest common subsequence (LCS) and use the normalized LCS distance to screen sentences with high similarity and then use the ATTEBSC algorithm to automatically remove the template. Experiments show that this method is easy and effective, and many expensive templates can be learned.
Date: 2022
References: Add references at CitEc
Citations:
Downloads: (external link)
http://downloads.hindawi.com/journals/mpe/2022/9349657.pdf (application/pdf)
http://downloads.hindawi.com/journals/mpe/2022/9349657.xml (application/xml)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:hin:jnlmpe:9349657
DOI: 10.1155/2022/9349657
Access Statistics for this article
More articles in Mathematical Problems in Engineering from Hindawi
Bibliographic data for series maintained by Mohamed Abdelhakeem ().