Prospecting the Effect of Topic Modeling in Information Retrieval
Aakanksha Sharaff,
Jitesh Kumar Dewangan and
Dilip Singh Sisodia
Additional contact information
Aakanksha Sharaff: National Institute of Technology, Raipur, India
Jitesh Kumar Dewangan: Samsung Research Institute, Noida, India
Dilip Singh Sisodia: National Institute of Technology, Raipur, India
International Journal on Semantic Web and Information Systems (IJSWIS), 2021, vol. 17, issue 3, 18-34
Abstract:
Enormous records and data are gathered every day. Organization of this data is a challenging task. Topic modeling provides a way to categorize these documents, where high dimensionality of the corpus affects the result of topic model, making it important to apply feature selection or information retrieval process for dimensionality reduction. The requirement for efficient topic modeling includes the removal of unrelated words that might lead to specious coexistence of the unrelated words. This paper proposes an efficient framework for the generation of better topic coherence, where term frequency-inverse document frequency (TF-IDF) and parsimonious language model (PLM) are used for the information retrieval task. PLM extracts the important information and expels the general words from the corpus, whereas TF-IDF re-estimates the weightage of each word in the corpus. The work carried out in this paper improved the topic coherence measure to provide a better correlation among the actual topic and the topics generated from PLM.
Date: 2021
References: Add references at CitEc
Citations:
Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve. ... 18/IJSWIS.2021070102 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:igg:jswis0:v:17:y:2021:i:3:p:18-34
Access Statistics for this article
International Journal on Semantic Web and Information Systems (IJSWIS) is currently edited by Brij Gupta
More articles in International Journal on Semantic Web and Information Systems (IJSWIS) from IGI Global
Bibliographic data for series maintained by Journal Editor ().