Towards Building an Arabic Plagiarism Detection System: Plagiarism Detection in Arabic
Imtiaz Hussain Khan,
Muazzam Ahmed Siddiqui and
Kamal M. Jambi
Additional contact information
Imtiaz Hussain Khan: King Abdulaziz University, Jeddah, Saudi Arabia
Muazzam Ahmed Siddiqui: King Abdulaziz University, Jeddah, Saudi Arabia
Kamal M. Jambi: King Abdulaziz University, Jeddah, Saudi Arabia
International Journal of Information Retrieval Research (IJIRR), 2019, vol. 9, issue 3, 12-22
Abstract:
This article describes a plagiarism detection system for the Arabic language that combines different similarity-measure techniques to uncover plagiarism in Arabic documents. The proposed system consists of two main components, one document-retrieval and the other detailed similarity analysis. The document-retrieval component generates queries from a given suspicious document and makes use of Google search API to retrieve candidate source documents from the Web. The similarity analysis component takes each source document in turn and attempts to identify the plagiarized parts in the suspicious document. The proposed system is thoroughly evaluated using an indigenous corpus. At the document-retrieval level, the system achieved above 75% accuracy in terms of f-score, whereas at the detailed similarity-computation level, the f-score is above 70%.
Date: 2019
References: Add references at CitEc
Citations:
Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve. ... 018/IJIRR.2019070102 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:igg:jirr00:v:9:y:2019:i:3:p:12-22
Access Statistics for this article
International Journal of Information Retrieval Research (IJIRR) is currently edited by Zhongyu Lu
More articles in International Journal of Information Retrieval Research (IJIRR) from IGI Global
Bibliographic data for series maintained by Journal Editor ().