FLIP: A Novel Feedback Learning-Based Intelligent Plugin Towards Accuracy Enhancement of Chinese OCR

Tao, Xinyue; Han, Yueyue; Jin, Yakai; Wu, Yunzhi

FLIP: A Novel Feedback Learning-Based Intelligent Plugin Towards Accuracy Enhancement of Chinese OCR

Xinyue Tao, Yueyue Han, Yakai Jin and Yunzhi Wu ()
Additional contact information
Xinyue Tao: School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei 230036, China
Yueyue Han: School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei 230036, China
Yakai Jin: School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei 230036, China
Yunzhi Wu: School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei 230036, China

Mathematics, 2025, vol. 13, issue 15, 1-19

Abstract: Chinese Optical Character Recognition (OCR) technology is essential for digital transformation in Chinese regions, enabling automated document processing across various applications. However, Chinese OCR systems struggle with visually similar characters, where subtle stroke differences lead to systematic recognition errors that limit practical deployment accuracy. This study develops FLIP (Feedback Learning-based Intelligent Plugin), a lightweight post-processing plugin designed to improve Chinese OCR accuracy across different systems without external dependencies. The plugin operates through three core components as follows: UTF-8 encoding-based output parsing that converts OCR results into mathematical representations, error correction using information entropy and weighted similarity measures to identify and fix character-level errors, and adaptive feedback learning that optimizes parameters through user interactions. The approach functions entirely through mathematical calculations at the character encoding level, ensuring universal compatibility with existing OCR systems while effectively handling complex Chinese character similarities. The plugin’s modular design enables seamless integration without requiring modifications to existing OCR algorithms, while its feedback mechanism adapts to domain-specific terminology and user preferences. Experimental evaluation on 10,000 Chinese document images using four state-of-the-art OCR models demonstrates consistent improvements across all tested systems, with precision gains ranging from 1.17% to 10.37% and overall Chinese character recognition accuracy exceeding 98%. The best performing model achieved 99.42% precision, with ablation studies confirming that feedback learning contributes additional improvements from 0.45% to 4.66% across different OCR architectures.

Keywords: optical character recognition; post-processing; text recognition; machine learning (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2025
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/13/15/2372/pdf (application/pdf)
https://www.mdpi.com/2227-7390/13/15/2372/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:13:y:2025:i:15:p:2372-:d:1708967

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().