Advanced Applications on Bilingual Document Analysis and Processing Systems
Shalini Puri and
Satya Prakash Singh
Additional contact information
Shalini Puri: BIT, Mesra, Ranchi, India
Satya Prakash Singh: BIT, Mesra, Ranchi, Jharkhand, India
International Journal of Applied Metaheuristic Computing (IJAMC), 2020, vol. 11, issue 4, 149-193
Abstract:
Today, rapid digitization requires efficient bilingual non-image and image document classification systems. Although many bilingual NLP and image-based systems provide solutions for real-world problems, they primarily focus on text extraction, identification, and recognition tasks with limited document types. This article discusses a journey of these systems and provides an overview of their methods, feature extraction techniques, document sets, classifiers, and accuracy for English-Hindi and other language pairs. The gaps found lead toward the idea of a generic and integrated bilingual English-Hindi document classification system, which classifies heterogeneous documents using a dual class feeder and two character corpora. Its non-image and image modules include pre- and post-processing stages and pre-and post-segmentation stages to classify documents into predefined classes. This article discusses many real-life applications on societal and commercial issues. The analytical results show important findings of existing and proposed systems.
Date: 2020
References: Add references at CitEc
Citations:
Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve. ... 018/IJAMC.2020100108 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:igg:jamc00:v:11:y:2020:i:4:p:149-193
Access Statistics for this article
International Journal of Applied Metaheuristic Computing (IJAMC) is currently edited by Peng-Yeng Yin
More articles in International Journal of Applied Metaheuristic Computing (IJAMC) from IGI Global
Bibliographic data for series maintained by Journal Editor ().