Improve Naïve Bayesian Classifier by Using Genetic Algorithm for Arabic Document
Farah Zawaideh () and
Raed Sahawneh ()
Additional contact information
Farah Zawaideh: Irbid National University
Raed Sahawneh: Irbid National University
No 6409186, Proceedings of International Academic Conferences from International Institute of Social and Economic Sciences
Abstract:
Automatic text categorization (TC) has become one of the most interesting fields for researchers in data mining, information retrieval, web text mining, as well as natural language processing paradigms due to the vast number of new documents being retrieved for various information retrieval systems. This paper proposes a new TC technique, which classifies Arabic language text documents using the naïve Bayesian classifier attached to a genetic algorithm, model; this algorithm classifies documents by generating a random sample of chromosomes that represent documents in the corpus. The developed model aims to enhance the work of naïve Bayesian classifier through applying the genetic algorithm model. Experiment results show that the precision and recall are increased when testing higher number of documents; the precision was ranged from 0.8 to 0.97 for different testing environment; the number of genes that is placed in every chromosome is also tested and experiments show that the best value for the number of genes is 50 genes
Keywords: Data mining; Text classification; Genetic algorithm; Naïve Bayesian Classifier; N-gram processing (search for similar items in EconPapers)
JEL-codes: C80 (search for similar items in EconPapers)
Pages: 1 page
Date: 2018-06
New Economics Papers: this item is included in nep-big, nep-cmp, nep-ecm and nep-ict
References: Add references at CitEc
Citations:
Published in Proceedings of the Proceedings of the 35th International Academic Conference, Barcelona, Jun 2018, pages 162-162
Downloads: (external link)
https://iises.net/proceedings/35th-international-a ... =64&iid=054&rid=9186 First version, 2018
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:sek:iacpro:6409186
Access Statistics for this paper
More papers in Proceedings of International Academic Conferences from International Institute of Social and Economic Sciences
Bibliographic data for series maintained by Klara Cermakova ().