Arabic Text Mining Using Rule Based Classification
Fadi Thabtah (),
Omar Gharaibeh () and
Rashid Al-Zubaidy ()
Additional contact information
Fadi Thabtah: MIS Department, Philadelphia University, Amman, Jordan
Omar Gharaibeh: Computing and IS Department, Brunel University, UK
Rashid Al-Zubaidy: CIS Department, Philadelphia University, Amman, Jordan
Journal of Information & Knowledge Management (JIKM), 2012, vol. 11, issue 01, 1-10
Abstract:
A well-known classification problem in the domain of text mining is text classification, which concerns about mapping textual documents into one or more predefined category based on its content. Text classification arena recently attracted many researchers because of the massive amounts of online documents and text archives which hold essential information for a decision-making process. In this field, most of such researches focus on classifying English documents while there are limited studies conducted on other languages like Arabic. In this respect, the paper proposes to investigate the problem of Arabic text classification comprehensively. More specifically the study measures the performance of different rule based classification approaches adopted from machine learning and data mining towards the problem of text Arabic classification. In particular, four different rule based classification approaches: Decision trees (C4.5), Rule Induction (RIPPER), Hybrid (PART) and Simple Rule (One Rule) are evaluated against the published Corpus of Contemporary Arabic Arabic text collection. This experimentation is carried out by employing a modified version of WEKA business intelligence tool. Through analysing the produced results from the experimentation, we determine the most suitable classification algorithms for classifying Arabic texts.
Keywords: Arabic text mining; classification; data mining; rule-based classifiers (search for similar items in EconPapers)
Date: 2012
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
http://www.worldscientific.com/doi/abs/10.1142/S0219649212500062
Access to full text is restricted to subscribers
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:wsi:jikmxx:v:11:y:2012:i:01:n:s0219649212500062
Ordering information: This journal article can be ordered from
DOI: 10.1142/S0219649212500062
Access Statistics for this article
Journal of Information & Knowledge Management (JIKM) is currently edited by Professor Suliman Hawamdeh
More articles in Journal of Information & Knowledge Management (JIKM) from World Scientific Publishing Co. Pte. Ltd.
Bibliographic data for series maintained by Tai Tone Lim ().