Machine Learning Based Taxonomy and Analysis of English Learners' Translation Errors
Ying Qin
Additional contact information
Ying Qin: Beijing Foreign Studies University, Beijing, China
International Journal of Computer-Assisted Language Learning and Teaching (IJCALLT), 2019, vol. 9, issue 3, 68-83
Abstract:
This study extracts the comments from a large scale of Chinese EFL learners' translation corpus to study the taxonomy of translation errors. Two unsupervised machine learning approaches are used to obtain the computational evidences of translation error taxonomy. After manually revision, ten types of English to Chinese (E2C) and eight types Chinese to English (C2E) translation errors are finally confirmed. There probably exists three categories of top-level errors according to the hierarchical clustering results. In addition, three supervised learning methods are applied to automatically recognize the types of errors, among which the highest performance reaches F1 = 0.85 on E2C and F1 = 0.90 on C2E translation. Further comparison to the intuitive or theoretical studies on translation taxonomy shows some phenomenon accompanied by language skill improvement of Chinese learners. Analysis on translation problems based on machine learning provides the objective insight and understanding on the students' translations.
Date: 2019
References: Add references at CitEc
Citations:
Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve. ... 8/IJCALLT.2019070105 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:igg:jcallt:v:9:y:2019:i:3:p:68-83
Access Statistics for this article
International Journal of Computer-Assisted Language Learning and Teaching (IJCALLT) is currently edited by Bin Zou
More articles in International Journal of Computer-Assisted Language Learning and Teaching (IJCALLT) from IGI Global
Bibliographic data for series maintained by Journal Editor ().