EconPapers    
Economics at your fingertips  
 

Reading between the lines with topic models and machine learning: Islam’s representation on Wikipedia

Sazid Zaman Khan (), Jamil As-ad (), Md Khaliluzzaman (), Toni Anwar () and Rashedul Islam ()
Additional contact information
Sazid Zaman Khan: International Islamic University Chittagong
Jamil As-ad: International Islamic University Chittagong
Md Khaliluzzaman: International Islamic University Chittagong
Toni Anwar: Universiti Teknologi Petronas
Rashedul Islam: International Islamic University Chittagong

Journal of Computational Social Science, 2025, vol. 8, issue 4, No 8, 19 pages

Abstract: Abstract Islam is a highly searched topic on the World Wide Web. Thousands of articles on Islam can be found on the web. While there are tons of websites, articles and blogs on the web, Wikipedia is one of the primary sources of information from which an interested reader can know about Islam. The representation of Islam on such an important information source is worthy of investigation. In this work, we first construct a representative dataset on Islam using Wikipedia articles. Afterwards, we apply several topic modelling and machine learning based approaches on the newly constructed dataset to find representation of Islam on Wikipedia. Also, we design two algorithms based on word2vec to find the inter topic similarity and intra topic similarity for the topic models. The intra topic similarity algorithm agrees well with human judgment of topic resolution and coherence of topics. As topic models find the dominant topics prevailing in a natural language document corpus, the intra topic similarity algorithm can be used as a new metric to find the coherence of single topics within the topic model.

Keywords: Representation of Islam; Islamic data mining; Topic modelling; Natural language processing (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s42001-025-00415-6 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:jcsosc:v:8:y:2025:i:4:d:10.1007_s42001-025-00415-6

Ordering information: This journal article can be ordered from
http://www.springer. ... iences/journal/42001

DOI: 10.1007/s42001-025-00415-6

Access Statistics for this article

Journal of Computational Social Science is currently edited by Takashi Kamihigashi

More articles in Journal of Computational Social Science from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-08-19
Handle: RePEc:spr:jcsosc:v:8:y:2025:i:4:d:10.1007_s42001-025-00415-6