Combining N-Grams and Stemming for Arabic Word-Based Inexact Matching and Term Conflation
Suleiman H. Mustafa ()
Additional contact information
Suleiman H. Mustafa: Dept. of Comp. Info. Systems, Yarmouk University, Irbid-Jordan, Jordan
Journal of Information & Knowledge Management (JIKM), 2005, vol. 04, issue 01, 29-36
Abstract:
In this paper, the results of three N-gram techniques have been reported. Two of these techniques were based on the idea of combining N-grams and stemming. The first used first-order stemming, while the other used light stemming. The performance of the combined approach was then compared with that of pure conventional N-gram-based string matching. The results provide good evidence that combining N-grams with stemming improves the overall performance, as measured by word-match recall and word-match precision, using different similarity threshold values.
Keywords: N-grams; Arabic string matching; text searching; stemming; information retrieval; word conflation (search for similar items in EconPapers)
Date: 2005
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
http://www.worldscientific.com/doi/abs/10.1142/S0219649205000992
Access to full text is restricted to subscribers
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:wsi:jikmxx:v:04:y:2005:i:01:n:s0219649205000992
Ordering information: This journal article can be ordered from
DOI: 10.1142/S0219649205000992
Access Statistics for this article
Journal of Information & Knowledge Management (JIKM) is currently edited by Professor Suliman Hawamdeh
More articles in Journal of Information & Knowledge Management (JIKM) from World Scientific Publishing Co. Pte. Ltd.
Bibliographic data for series maintained by Tai Tone Lim ().