What makes an automatic keyword classification effective?
K. Sparck Jones and
E. O. Barber
Journal of the American Society for Information Science, 1971, vol. 22, issue 3, 166-175
Abstract:
Though the idea of constructing a keyword classification for retrieval purposes automatically is not a new one, comparatively few systematic experiments have been carried out in this area; and while many suggestions have been put forward, not enough is known about the behaviour of automatic keyword classifications, and hence about the properties such classifications should have and the ways they should be used. In previous experiments we showed that some forms of classification could give good results, and this paper describes a further series of tests designed to examine this sort of classification in more detail, with a view to establishing the optimum forms of classification and procedures for using them in different retrieval situations. These tests demonstrate that further improvements in performance over that for unclassified keywords can be obtained, and that definite conclusions can be drawn about the correct approach to classification for collections like the test one: the best results are given when grouping is confined to strongly connected, non‐frequent keywords, when the classification is used to provide additional rather than alternative indexing terms, particularly for requests, and when matching is controlled by keyword collection frequency.
Date: 1971
References: Add references at CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1002/asi.4630220305
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bla:jamest:v:22:y:1971:i:3:p:166-175
Ordering information: This journal article can be ordered from
https://doi.org/10.1002/(ISSN)1097-4571
Access Statistics for this article
More articles in Journal of the American Society for Information Science from Association for Information Science & Technology
Bibliographic data for series maintained by Wiley Content Delivery ().