Sentiment-based Overlapping Community Discovery
Fulya Ozcan
A chapter in Topics in Identification, Limited Dependent Variables, Partial Observability, Experimentation, and Flexible Modeling: Part A, 2019, vol. 40A, pp 41-63 from Emerald Publishing Ltd
Abstract:
Abstract This chapter investigates the behavior of Reddit’s news subreddit users and the relationship between their sentiment on exchange rates. Using graphical models and natural language processing, hidden online communities among Reddit users are discovered. The data set used in this project is a mixture of text and categorical data from Reddit’s news subreddit. These data include the titles of the news pages, as well as a few user characteristics, in addition to users’ comments. This data set is an excellent resource to study user reaction to news since their comments are directly linked to the webpage contents. The model considered in this chapter is a hierarchical mixture model which is a generative model that detects overlapping networks using the sentiment from the user generated content. The advantage of this model is that the communities (or groups) are assumed to follow a Chinese restaurant process, and therefore it can automatically detect and cluster the communities. The hidden variables and the hyperparameters for this model are obtained using Gibbs sampling.
Keywords: Graphical models; hierarchical mixture models; hidden network discovery; mixture communities; natural language processing; overlapping communities; online networks (search for similar items in EconPapers)
Date: 2019
References: Add references at CitEc
Citations: Track citations by RSS feed
Downloads: (external link)
http://www.emeraldinsight.com/10.1108/S0731-905320 ... RePEc&WT.mc_id=RePEc (text/html)
Access to full text is restricted to subscribers
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eme:aecozz:s0731-90532019000040a004
Ordering information: This item can be ordered from
Emerald Group Publishing, Howard House, Wagon Lane, Bingley, BD16 1WA, UK
http://www.emeraldgr ... ies.htm?id=0731-9053
Access Statistics for this chapter
More chapters in Advances in Econometrics from Emerald Publishing Ltd
Bibliographic data for series maintained by Charlotte Maiorana ().