DOCUMENT LAYOUT ANALYSIS SYSTEM
Andrei Alexandru Aldea (),
Radu Gabriel Coriu (),
Ștefan-Vlad Prajică (),
Răzvan-Ștefan Brînzea () and
Costin-Anton Boiangiu ()
Additional contact information
Andrei Alexandru Aldea: Ubisoft Romania, Bucharest, Romania
Radu Gabriel Coriu: Sparkware Technologies Romania, Bucharest, Romania
Ștefan-Vlad Prajică: Tangoe Romania, Bucharest, Romania
Răzvan-Ștefan Brînzea: Politehnica University of Bucharest, Bucharest, Romania
Costin-Anton Boiangiu: Politehnica University of Bucharest, Bucharest, Romania
Journal of Information Systems & Operations Management, 2018, vol. 12, issue 2, 292-302
Abstract:
The need to process large amounts of printed physical data has led to the development of automated solutions for scanning and converting such documents into an editable text format. Following the layout analysis process, the different areas (blocks) of the document can be labeled by content - text, image, tables. Such an analysis of the document is referred to as geometric analysis. A different approach is that of a logical layout analysis, or semantic analysis, in which text blocks are labeled according to their role inside the document - titles, footnotes etc. Identifying sections correctly, numbering pages and arranging them in the correct order are standard requirements for OCR.
Date: 2018
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.rebe.rau.ro/RePEc/rau/jisomg/Wi18/JISOM-WI18-A06.pdf (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:rau:jisomg:v:12:y:2018:i:2:p:292-302
Access Statistics for this article
More articles in Journal of Information Systems & Operations Management from Romanian-American University Contact information at EDIRC.
Bibliographic data for series maintained by Alex Tabusca ().