A Hybrid Approach to Retrieve Knowledge from a Document
Deepak Sahoo and
Rakesh Chandra Balabantaray
Additional contact information
Deepak Sahoo: IIIT-Bhubaneswar, Bhubaneswar, India
Rakesh Chandra Balabantaray: IIIT Bhubaneswar, Bhubaneswar, India
International Journal of Knowledge Management (IJKM), 2020, vol. 16, issue 1, 83-100
Abstract:
The task of retrieving the theme of a document and presenting a shorter form compared to the original text to the user is a challenging assignment. In this article, a hybrid approach to extract knowledge from a text document is presented, in which three key sentence level relationships in association with the Markov clustering algorithm is used to cluster sentences in the document. After clustering, sentences are ranked in each cluster and the highest ranked sentences in each cluster are merged. In the end, to get the final theme of the document, the Gradient boosting technique XGboost is used to compress the newly generated sentence. The DUC-2002 data set is used to evaluate the proposed system and it has been observed that the performance of the proposed system is better than other existing systems.
Date: 2020
References: Add references at CitEc
Citations:
Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve. ... 4018/IJKM.2020010104 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:igg:jkm000:v:16:y:2020:i:1:p:83-100
Access Statistics for this article
International Journal of Knowledge Management (IJKM) is currently edited by Hakikur Rahman
More articles in International Journal of Knowledge Management (IJKM) from IGI Global
Bibliographic data for series maintained by Journal Editor ().