Three level weight for latent semantic analysis: an efficient approach to find enhanced semantic themes
Pooja Kherwa and
Poonam Bansal
International Journal of Knowledge and Learning, 2023, vol. 16, issue 1, 56-72
Abstract:
Latent semantic analysis is a prominent semantic themes detection and topic modelling technique. In this paper, we have designed a three-level weight for latent semantic analysis for creating an optimised semantic space for large collection of documents. Using this novel approach, an efficient latent semantic space is created, in which terms in documents comes closer to each other, which appear far away in actual document collection. In this approach, authors used two dataset: first is a synthetic dataset consists of small stories collected by the authors; second is benchmark BBC-news dataset used in text mining applications. These proposed three level weight models assign weight at term level, document level, and at a corpus level. These weight models are known as: 1) NPC; 2) NTC; 3) APC; 4) ATC. These weight models are tested on both the dataset, compared with state of the art term frequency and it has shown significant improved performances in term set correlation, document set correlation and has also shown highest correlation in semantic similarity of terms in semantic space generated through these three level weights. Our approach also shows automatic context clustering generated in dataset through three level weights.
Keywords: single value decomposition; SVD; latent semantic analysis; LSA; context clustering; semantic space. (search for similar items in EconPapers)
Date: 2023
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.inderscience.com/link.php?id=127328 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ids:ijklea:v:16:y:2023:i:1:p:56-72
Access Statistics for this article
More articles in International Journal of Knowledge and Learning from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().