Some hierarchical models for automatic document retrieval
Gerard Salton
American Documentation, 1963, vol. 14, issue 3, 213-222
Abstract:
Within the last few years, several automatic indexing and abstracting systems have been designed which are based primarily on word frequency counts and on techniques for measuring word and document associations. These systems are not wholly successful because both the sentence structure and the semantic relations between words are normally disregarded. An attempt is made in the present study to overcome the limitations of the strictly quantitative methods by presenting two systems for automatic document retrieval which are based on hierarchical storage arrangements as well as on the usual frequency counts and association measures. The first one utilizes a hierarchical arrangement similar to a library classification schedule, including lists of synonyms or related words, and cross‐references. The second uses, in addition, a simplified form of syntactic analysis, thus making it possible to represent the syntactic dependency structure between individual words. The required retrieval operations are described briefly and are compared with those of the simpler quantitative model.
Date: 1963
References: Add references at CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1002/asi.5090140307
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bla:amedoc:v:14:y:1963:i:3:p:213-222
Ordering information: This journal article can be ordered from
https://doi.org/10.1002/(ISSN)1936-6108
Access Statistics for this article
American Documentation is currently edited by Javed Mostafa
More articles in American Documentation from Wiley Blackwell
Bibliographic data for series maintained by Wiley Content Delivery ().