OCR QUALITY IMPROVEMENT USING IMAGE PREPROCESSING
Vlad Badoiu (),
Andrei-Constantin Ciobanu () and
Sergiu Craitoiu ()
Additional contact information
Vlad Badoiu: ”Politehnica” University of Bucharest, Bucharest, Romania
Andrei-Constantin Ciobanu: ”Politehnica” University of Bucharest, Bucharest, Romania
Sergiu Craitoiu: ”Politehnica” University of Bucharest, Bucharest, Romania
Journal of Information Systems & Operations Management, 2016, vol. 10, issue 1, 240-252
Abstract:
Optical character recognition (OCR) remains a difficult problem for noisy documents or documents scanned at low resolution. Many current approaches rely on stored font models that are vulnerable to cases in which the document is noisy or is written in a font dissimilar to the stored fonts. In this paper we test two approaches for preprocessing, or correcting the input images. The focus is on noise reduction, lightness correction and binarization, all relative to found letters with a slow but more accurate method and a fast and less accurate method. We then compare the results and see if the extra time spent in developing more complex letter deduction technique offers significant improvements.
Date: 2016
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
http://www.rebe.rau.ro/RePEc/rau/jisomg/SU16/JISOM-SU16-A23.pdf (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:rau:jisomg:v:10:y:2016:i:1:p:240-252
Access Statistics for this article
More articles in Journal of Information Systems & Operations Management from Romanian-American University Contact information at EDIRC.
Bibliographic data for series maintained by Alex Tabusca ().