EconPapers    
Economics at your fingertips  
 

Improving interpretations of topic modeling in microblogs

Sarah A. Alkhodair, Benjamin C. M. Fung, Osmud Rahman and Patrick C. K. Hung

Journal of the Association for Information Science & Technology, 2018, vol. 69, issue 4, 528-540

Abstract: Topic models were proposed to detect the underlying semantic structure of large collections of text documents to facilitate the process of browsing and accessing documents with similar ideas and topics. Applying topic models to short text documents to extract meaningful topics is challenging. The problem becomes even more complicated when dealing with short and noisy micro†posts in Twitter that are about one general topic. In such a case, the goal of applying topic models is to extract subtopics. This results in topics represented by similar sets of keywords, which in turn makes the process of topic interpretation more confusing. In this paper we propose a new method that incorporates Twitter†LDA, WordNet, and hashtags to enhance the keyword labels that represent each topic. We emphasize the importance of different keywords to different topics based on the semantic relationships and the co†occurrences of keywords in hashtags. We also propose a method to find the best number of topics to represent the text document collection. Experiments on two real†life Twitter datasets on fashion suggest that our method performs better than the original Twitter†LDA in terms of perplexity, topic coherence, and the quality of keywords for topic labeling.

Date: 2018
References: Add references at CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://doi.org/10.1002/asi.23980

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bla:jinfst:v:69:y:2018:i:4:p:528-540

Ordering information: This journal article can be ordered from
http://www.blackwell ... bs.asp?ref=2330-1635

Access Statistics for this article

More articles in Journal of the Association for Information Science & Technology from Association for Information Science & Technology
Bibliographic data for series maintained by Wiley Content Delivery ().

 
Page updated 2025-03-19
Handle: RePEc:bla:jinfst:v:69:y:2018:i:4:p:528-540