What have you read? based Multi-Document Summarization
Sabina Irum (),
Jamal Abdul Nasir and
Zakia Jalil
Additional contact information
Sabina Irum: National University of Modern Languages Islamabad Pakistan
Jamal Abdul Nasir: Department of Computer Science Business Information Systems NUI Galway, Ireland
Zakia Jalil: Faculty of Basic and Applied Sciences International Islamic University, Islamabad, Pakistan
International Journal of Innovations in Science & Technology, 2022, vol. 4, issue 5, 94-102
Abstract:
Due to the tremendous amount of data available today, extracting essential information from such a large volume of data is quite tough. Particularly in the case of text documents, which need a significant amount of time from the user to read the material and extract useful information. The major problem is identifying the user's relevant documents, removing the most significant pieces of information, determining document relevancy, excluding extraneous information, reducing details, and generating a compact, consistent report. For all these issues, we proposed a novel technique that solves the problem of extracting important information from a huge amount of text data and using previously read documents to generate summaries of new documents. Our technique is more focused on extracting topics (also known as topic signatures) from the previously read documents and then selecting the sentences that are more relevant to these topics based on update summary generation. Besides this, the concept of overlapping value is used that digs out the meaningful words and word similarities. Another thing that makes our work better is the Dice Coefficient which measures the intersection of words between document sets and helps to eliminate redundancy. The summary generated is based on more diverse and highly representative sentences with an average length. Empirically, we have observed that our proposed novel technique performed better with baseline competitors on the real-world TAC2008 dataset.
Keywords: Data mining; Text mining; Text summarization; Topic Signature; Density peak; Update Summarization (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://journal.50sea.com/index.php/IJIST/article/view/331/253 (application/pdf)
https://journal.50sea.com/index.php/IJIST/article/view/331 (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:abq:ijist1:v:4:y:2022:i:5:p:94-102
DOI: 10.33411/IJIST/2022040508
Access Statistics for this article
International Journal of Innovations in Science & Technology is currently edited by Prof. Dr. Veraldo Lisenberg, Prof Dr. Ali Iqtedar Mirza
More articles in International Journal of Innovations in Science & Technology from 50sea
Bibliographic data for series maintained by Hafiz Haroon Ahmad, Iqra Nazeer ().