EconPapers    
Economics at your fingertips  
 

Extractive Text Summarization-Based Framework for Sindhi Language

Aqsa Memon, Zainab Memon,Akhtar Hussain Jalbani
Additional contact information
Aqsa Memon, Zainab Memon,Akhtar Hussain Jalbani: Department of Computer Science, Quaid-e-Awam University of Engineering Science & Technology Nawabshah, Sindh

International Journal of Innovations in Science & Technology, 2025, vol. 7, issue 6, 147-155

Abstract: This paper presents an extractive text summarization method specially designed for Sindhi, a culturally rich but low-resource Indo-Aryan language spoken widely in Pakistan. The study focuses on selecting the most relevant sentences from Sindhi texts, employing Natural Language Processing (NLP) techniques to generate concise summaries.The proposed system incorporates essential preprocessing steps, including text cleaning, tokenization, and stemming/lemmatization. For future extraction, it utilizes TF-IDF and sentence embeddings. After scoring the sentences, the most significant ones areselected to form the final summary. To evaluate the system's performance in five test paragraphs, several metrics are used, including F1 score, precision, recall, cosine similarity, normalization level distance, and accuracy. The system demonstrates reliable and accurate summarization, and consistency achieving high precision (1.0), strong F1 score (0.89-0.92), a low a low normalized error (0.04), and an overall accuracy of 0.86. Graphic analysis further confirms the model's coherence, semantic retention, and low error rates.By leveraging NLP for information summarization, this study contributes to preserving and promoting the Sindhi language—potential applications including digital accessibility, education, and content curation. Future research aims to enhance contextual understanding by exploring transformer-based models like BERT and extending the approach to abstraction summarization.

Keywords: Sindhi Language; Extractive Summarization; Natural Language Processing (NLP); Sentence Selection; TF-IDF; Sentence Embeddings (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
https://journal.50sea.com/index.php/IJIST/article/view/1371/1882 (application/pdf)
https://journal.50sea.com/index.php/IJIST/article/view/1371 (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:abq:ijist1:v:7:y:2025:i:6:p:147-155

Access Statistics for this article

International Journal of Innovations in Science & Technology is currently edited by Prof. Dr. Syed Amer Mahmood

More articles in International Journal of Innovations in Science & Technology from 50sea
Bibliographic data for series maintained by Iqra Nazeer ().

 
Page updated 2025-10-22
Handle: RePEc:abq:ijist1:v:7:y:2025:i:6:p:147-155