Indexing by latent semantic analysis
Scott Deerwester,
Susan T. Dumais,
George W. Furnas,
Thomas K. Landauer and
Richard Harshman
Journal of the American Society for Information Science, 1990, vol. 41, issue 6, 391-407
Abstract:
A new method for automatic indexing and retrieval is described. The approach is to take advantage of implicit higher‐order structure in the association of terms with documents (“semantic structure”) in order to improve the detection of relevant documents on the basis of terms found in queries. The particular technique used is singular‐value decomposition, in which a large term by document matrix is decomposed into a set of ca. 100 orthogonal factors from which the original matrix can be approximated by linear combination. Documents are represented by ca. 100 item vectors of factor weights. Queries are represented as pseudo‐document vectors formed from weighted combinations of terms, and documents with supra‐threshold cosine values are returned. Initial tests find this completely automatic method for retrieval to be promising. © 1990 John Wiley & Sons, Inc.
Date: 1990
References: Add references at CitEc
Citations: View citations in EconPapers (271)
Downloads: (external link)
https://doi.org/10.1002/(SICI)1097-4571(199009)41:63.0.CO;2-9
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bla:jamest:v:41:y:1990:i:6:p:391-407
Ordering information: This journal article can be ordered from
https://doi.org/10.1002/(ISSN)1097-4571
Access Statistics for this article
More articles in Journal of the American Society for Information Science from Association for Information Science & Technology
Bibliographic data for series maintained by Wiley Content Delivery ().