EconPapers    
Economics at your fingertips  
 

Sentiment-based Overlapping Community Discovery

Fulya Ozcan

A chapter in Topics in Identification, Limited Dependent Variables, Partial Observability, Experimentation, and Flexible Modeling: Part A, 2019, vol. 40A, pp 41-63 from Emerald Group Publishing Limited

Abstract: This chapter investigates the behavior of Reddit’s news subreddit users and the relationship between their sentiment on exchange rates. Using graphical models and natural language processing, hidden online communities among Reddit users are discovered. The data set used in this project is a mixture of text and categorical data from Reddit’s news subreddit. These data include the titles of the news pages, as well as a few user characteristics, in addition to users’ comments. This data set is an excellent resource to study user reaction to news since their comments are directly linked to the webpage contents. The model considered in this chapter is a hierarchical mixture model which is a generative model that detects overlapping networks using the sentiment from the user generated content. The advantage of this model is that the communities (or groups) are assumed to follow a Chinese restaurant process, and therefore it can automatically detect and cluster the communities. The hidden variables and the hyperparameters for this model are obtained using Gibbs sampling.

Keywords: Graphical models; hierarchical mixture models; hidden network discovery; mixture communities; natural language processing; overlapping communities; online networks (search for similar items in EconPapers)
Date: 2019
References: Add references at CitEc
Citations:

Downloads: (external link)
https://www.emerald.com/insight/content/doi/10.110 ... d&utm_campaign=repec (text/html)
https://www.emerald.com/insight/content/doi/10.110 ... 1-90532019000040A004
https://www.emerald.com/insight/content/doi/10.110 ... d&utm_campaign=repec (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:eme:aecozz:s0731-90532019000040a004

DOI: 10.1108/S0731-90532019000040A004

Access Statistics for this chapter

More chapters in Advances in Econometrics from Emerald Group Publishing Limited
Bibliographic data for series maintained by Emerald Support ().

 
Page updated 2025-04-15
Handle: RePEc:eme:aecozz:s0731-90532019000040a004