MeSH-Based Semantic Weighting Scheme to Enhance Document Indexing: Application on Biomedical Document Classification
Imen Gabsi (),
Hager Kammoun (),
Dalila Souidi () and
Ikram Amous ()
Additional contact information
Imen Gabsi: Sfax University, MIRACL Laboratory, Sfax, Tunisia
Hager Kammoun: Sfax University, MIRACL Laboratory, Sfax, Tunisia
Dalila Souidi: Sfax University, MIRACL Laboratory, Sfax, Tunisia
Ikram Amous: Sfax University, MIRACL Laboratory, Sfax, Tunisia
Journal of Information & Knowledge Management (JIKM), 2025, vol. 24, issue 03, 1-32
Abstract:
Document indexing phase plays a significant role in text mining applications such as text document classification. The common indexing paradigm is based on terms frequency in documents known as Bag Of Words (BOW)-based representation approach. However, such classical approach suffers from ambiguity and disparity of words. In addition, traditional term weighting schemes, such as TF-IDF, exploit only the statistical information of terms in documents. To overcome these problems, we have been interested in biomedical semantic document indexing using concepts extracted from the knowledge resource MeSH. Thus, we have focused first on a disambiguation method to identify the adequate senses of ambiguous MeSH concepts and we have considered four representation enrichment strategies to identify the best appropriate representatives of the adequate sense in the textual entities representation. Second, we propose to introduce a semantic weighting scheme that quantifies MeSH concept’s importance in documents through their occurrence frequency and semantic similarities with unambiguous MeSH concepts. Our contribution lies particularly in the in-depth experimental study of the performance of these methods and precisely the impact of the semantic weighting scheme on the performance. To do that, three benchmark datasets TREC 2004 genomics, BioCreative II and OHSUMED were used.
Keywords: Semantic document indexing; word sense disambiguation; weighting scheme (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.worldscientific.com/doi/abs/10.1142/S0219649224500357
Access to full text is restricted to subscribers
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:wsi:jikmxx:v:24:y:2025:i:03:n:s0219649224500357
Ordering information: This journal article can be ordered from
DOI: 10.1142/S0219649224500357
Access Statistics for this article
Journal of Information & Knowledge Management (JIKM) is currently edited by Professor Suliman Hawamdeh
More articles in Journal of Information & Knowledge Management (JIKM) from World Scientific Publishing Co. Pte. Ltd.
Bibliographic data for series maintained by Tai Tone Lim ().