Determining the context of text using augmented latent semantic indexing
Tom Rishel,
Louise A. Perkins,
Sumanth Yenduri and
Farnaz Zand
Journal of the American Society for Information Science and Technology, 2007, vol. 58, issue 14, 2197-2204
Abstract:
Latent semantic analysis has been used for several years to improve the performance of document library searches. We show that latent semantic analysis, augmented with a Part–of–Speech Tagger, may be an effective algorithm for classifying a textual document as well. Using Brille's Part–of–Speech Tagger, we truncate the singular value decomposition used in latent semantic analysis to reduce the size of the word–frequency matrix. This method is then tested on a toy problem, and has shown to increase search accuracy. We then relate these results to natural language processing and show that latent semantic analysis can be combined with context free grammars to infer semantic meaning from natural language. English is the natural language currently being used.
Date: 2007
References: Add references at CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1002/asi.20687
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bla:jamist:v:58:y:2007:i:14:p:2197-2204
Ordering information: This journal article can be ordered from
https://doi.org/10.1002/(ISSN)1532-2890
Access Statistics for this article
More articles in Journal of the American Society for Information Science and Technology from Association for Information Science & Technology
Bibliographic data for series maintained by Wiley Content Delivery ().