The distribution of term usage in manipulative indexes
Nona Houston and
Eugene Wall
American Documentation, 1964, vol. 15, issue 2, 105-114
Abstract:
A semi‐empirical correlation, based on data from nine indexes, permits the prediction of the percentage of terms in a manipulative index vocabulary which will be used to index any given number of documents. This is a function of the total number of index entries in the system. A log‐normal relationship, similar to Zipf's Law, exists between total index entries and distribution of term usage. Based upon the correlation, optimum vocabulary size and growth rate can be inferred, as well as the most efficient arrangement of index entries in a storage medium. The results agree well with published data and appear to be particularly useful for designers of mechanized retrieval or publication operations.
Date: 1964
References: Add references at CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1002/asi.5090150208
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bla:amedoc:v:15:y:1964:i:2:p:105-114
Ordering information: This journal article can be ordered from
https://doi.org/10.1002/(ISSN)1936-6108
Access Statistics for this article
American Documentation is currently edited by Javed Mostafa
More articles in American Documentation from Wiley Blackwell
Bibliographic data for series maintained by Wiley Content Delivery ().