Comparing words, stems, and roots as index terms in an Arabic Information Retrieval system
Ibrahim A. Al‐Kharashi and
Martha W. Evens
Journal of the American Society for Information Science, 1994, vol. 45, issue 8, 548-560
Abstract:
The Micro‐AIRS System, a microcomputer system for Arabic Information Retrieval, was designed as an experimental system to investigate indexing and retrieval processes for Arabic bibliographic data. A series of experiments were performed using 29 queries against a base of 355 Arabic bibliographic records, covering computer and information science from the bibliographic databank at King Abdulaziz City for Science and Technology. These experiments revealed that using roots and using stems as index terms gives better retrieval results than using words. The root performs as well as or better than the stem at low recall levels and definitely better at high recall levels. Several different binary similarity coefficients were tried: the cosine, Dice, and Jaccard coefficients. All three led to exactly the same document rankings for every query. The experiments were run on an IBM/AT‐compatible microcomputer. Micro‐AIRS is written in Turbo C, Version 2.0. © 1994 John Wiley & Sons, Inc.
Date: 1994
References: Add references at CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
https://doi.org/10.1002/(SICI)1097-4571(199409)45:83.0.CO;2-X
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bla:jamest:v:45:y:1994:i:8:p:548-560
Ordering information: This journal article can be ordered from
https://doi.org/10.1002/(ISSN)1097-4571
Access Statistics for this article
More articles in Journal of the American Society for Information Science from Association for Information Science & Technology
Bibliographic data for series maintained by Wiley Content Delivery ().