EconPapers    
Economics at your fingertips  
 

The distribution of term usage in manipulative indexes

Nona Houston and Eugene Wall

American Documentation, 1964, vol. 15, issue 2, 105-114

Abstract: A semi‐empirical correlation, based on data from nine indexes, permits the prediction of the percentage of terms in a manipulative index vocabulary which will be used to index any given number of documents. This is a function of the total number of index entries in the system. A log‐normal relationship, similar to Zipf's Law, exists between total index entries and distribution of term usage. Based upon the correlation, optimum vocabulary size and growth rate can be inferred, as well as the most efficient arrangement of index entries in a storage medium. The results agree well with published data and appear to be particularly useful for designers of mechanized retrieval or publication operations.

Date: 1964
References: Add references at CitEc
Citations:

Downloads: (external link)
https://doi.org/10.1002/asi.5090150208

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bla:amedoc:v:15:y:1964:i:2:p:105-114

Ordering information: This journal article can be ordered from
https://doi.org/10.1002/(ISSN)1936-6108

Access Statistics for this article

American Documentation is currently edited by Javed Mostafa

More articles in American Documentation from Wiley Blackwell
Bibliographic data for series maintained by Wiley Content Delivery ().

 
Page updated 2025-03-19
Handle: RePEc:bla:amedoc:v:15:y:1964:i:2:p:105-114