Sarcasm detection in microblogs using Naïve Bayes and fuzzy clustering
Shubhadeep Mukherjee and
Pradip Kumar Bala
Technology in Society, 2017, vol. 48, issue C, 19-27
Abstract:
Sarcasm detection of online text is a task of growing importance in the globalized world. Large corporations are interested in knowing how consumers perceive the various products launched by the companies based on analysis of microblogs, such as - Twitter, about their products.These reviews/comments/posts are under the constant threat of being classified in the wrong category due to use of sarcasm in sentences. Automatic detection of sarcasm in microblogs, such as - Twitter, is a difficult task. It requires a system that can use some knowledge to interpret the linguistic styles of authors. In this work, we try to provide this knowledge to the system by considering different sets of features which are relatively independent of the text, namely - function words and part of speech n-grams. We test a range of different feature sets using the Naïve Bayes and fuzzy clustering algorithms. Our results show that the sarcasm detection task benefits from the inclusion of features which capture authorial style of the microblog authors. We achieve an accuracy of approximately 65% which is on the higher side of the sarcasm detection literature.
Date: 2017
References: View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0160791X16300070
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:teinso:v:48:y:2017:i:c:p:19-27
DOI: 10.1016/j.techsoc.2016.10.003
Access Statistics for this article
Technology in Society is currently edited by Charla Griffy-Brown
More articles in Technology in Society from Elsevier
Bibliographic data for series maintained by Catherine Liu ().