EconPapers    
Economics at your fingertips  
 

Khmer printed character recognition using attention-based Seq2Seq network

Rina Buoy (), Nguonly Taing, Sovisal Chenda and Sokchea Kor
Additional contact information
Rina Buoy: Techo Startup Center, Phnom Penh, Cambodia
Nguonly Taing: Techo Startup Center, Phnom Penh, Cambodia
Sovisal Chenda: Techo Startup Center, Phnom Penh, Cambodia
Sokchea Kor: Royal University of Phnom Penh, Phnom Penh, Cambodia

HO CHI MINH CITY OPEN UNIVERSITY JOURNAL OF SCIENCE - ENGINEERING AND TECHNOLOGY, 2022, vol. 12, issue 1, 3-16

Abstract: This paper presents an end-to-end deep convolutional recurrent neural network solution for Khmer optical character recognition (OCR) task. The proposed solution uses a sequence-to-sequence (Seq2Seq) architecture with attention mechanism. The encoder extracts visual features from an input text-line image via layers of convolutional blocks and a layer of gated recurrent units (GRU). The features are encoded in a single context vector and a sequence of hidden states which are fed to the decoder for decoding one character at a time until a special end-of-sentence (EOS) token is reached. The attention mechanism allows the decoder network to adaptively select relevant parts of the input image while predicting a target character. The Seq2Seq Khmer OCR network is trained on a large collection of computer-generated text-line images for multiple common Khmer fonts. Complex data augmentation is applied on both train and validation dataset. The proposed model’s performance outperforms the state-of-art Tesseract OCR engine for Khmer language on the validation set of 6400 augmented images by achieving a character error rate (CER) of 0.7% vs 35.9%.

Keywords: Khmer; Optical Character Recognition; Deep Learning; Neural Network (search for similar items in EconPapers)
Date: 2022
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://journalofscience.ou.edu.vn/index.php/tech-en/article/view/2217/1678 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bjw:techen:v:12:y:2022:i:1:p:3-16

DOI: 10.46223/HCMCOUJS.tech.en.12.1.2217.2022

Access Statistics for this article

HO CHI MINH CITY OPEN UNIVERSITY JOURNAL OF SCIENCE - ENGINEERING AND TECHNOLOGY is currently edited by Nguyen Thuan

More articles in HO CHI MINH CITY OPEN UNIVERSITY JOURNAL OF SCIENCE - ENGINEERING AND TECHNOLOGY from HO CHI MINH CITY OPEN UNIVERSITY JOURNAL OF SCIENCE, HO CHI MINH CITY OPEN UNIVERSITY
Bibliographic data for series maintained by Vu Tuan Truong ().

 
Page updated 2025-03-19
Handle: RePEc:bjw:techen:v:12:y:2022:i:1:p:3-16