Studying and Analysis of a Vertical Web Page Classifier Based on Continuous Learning Naïve Bayes (CLNB) Algorithm
H. A. Ali,
Ali I.El Desouky and
Ahmed I. Saleh
Additional contact information
H. A. Ali: Mansoura University, Egypt
Ali I.El Desouky: Mansoura University, Egypt
Ahmed I. Saleh: Mansoura University, Egypt
International Journal of Information Technology and Web Engineering (IJITWE), 2007, vol. 2, issue 2, 1-44
Abstract:
Recently it will be more valued to build vertical classifiers to classify pages related to a specific domain and compensate those classifiers with novel learning techniques to achieve better performance. The contribution of this paper is three edged; firstly, a novel continuous learning technique is introduced. Secondly, the paper presents a new trend for Web page classification by presenting the domain-oriented classifiers. A new way of applying Bayes and K-Nearest Neighbor algorithms is introduced in order to build Domain Oriented (DONB) and (DOKNN) classifiers. The third contribution is combining both disciplines by introducing a novel classification strategy. Such strategy adds the continuous learning ability to Bayes theorem to build a (CLNB) classifier. It allows the classifier to adapt itself continuously for achieving better performance, and overcome the problem of overfitting. Experimental results have shown that CLNB demonstrates significant performance improvement over both DONB and DOKNN where its accuracy goes beyond 94.1% after testing 1000 pages.
Date: 2007
References: Add references at CitEc
Citations:
Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve. ... 018/jitwe.2007040101 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:igg:jitwe0:v:2:y:2007:i:2:p:1-44
Access Statistics for this article
International Journal of Information Technology and Web Engineering (IJITWE) is currently edited by Ghazi I. Alkhatib
More articles in International Journal of Information Technology and Web Engineering (IJITWE) from IGI Global
Bibliographic data for series maintained by Journal Editor ().