Multi term based co-term frequency method for term weighting in information retrieval
M. Santhanakumar,
C. Christopher Columbus and
K. Jayapriya
International Journal of Business Information Systems, 2018, vol. 28, issue 1, 79-94
Abstract:
Nowadays, World Wide Web (WWW) has become the only source of all kind of information. Retrieving the relevant web pages based on user queries from WWW is an exigent task. Term frequency inverse document frequency (TF-IDF) is the most frequently used method for term weighting based on the occurrences and presence of a term inside the document. Retrieved document based on a single query term may not relate to the user search. This may lead the user to process the unwanted information. So, this paper proposes a new term weighting method named co-term frequency, in which the weight is assigned according to the multi terms which commonly occur in all documents. From the measures of precision, recall and F-score of the proposed method, it is clearly evident that the proposed framework retrieves the most relevant web pages when compared to other term weighting methods.
Keywords: inverse document frequency; IDF; co-term frequency; CTF; term weighting; World Wide Web; WWW; precision; term frequency; F-score; recall. (search for similar items in EconPapers)
Date: 2018
References: Add references at CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
http://www.inderscience.com/link.php?id=91164 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ids:ijbisy:v:28:y:2018:i:1:p:79-94
Access Statistics for this article
More articles in International Journal of Business Information Systems from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().