Attribute Sentiment Scoring With Online Text Reviews: Accounting for Language Structure and Attribute Self-Selection
Minkyung Kim and
K. Sudhir ()
Additional contact information
Ishita Chakraborty: School of Management, Yale University
Minkyung Kim: School of Management, Yale University
K. Sudhir: Cowles Foundation & School of Management, Yale University, http://faculty.som.yale.edu/ksudhir/
No 2176, Cowles Foundation Discussion Papers from Cowles Foundation for Research in Economics, Yale University
The authors address two novel and significant challenges in using online text reviews to obtain attribute level ratings. First, they introduce the problem of inferring attribute level sentiment from text data to the marketing literature and develop a deep learning model to address it. While extant bag of words based topic models are fairly good at attribute discovery based on frequency of word or phrase occurrences, associating sentiments to attributes requires exploiting the spatial and sequential structure of language. Second, they illustrate how to correct for attribute self-selection—reviewers choose the subset of attributes to write about—in metrics of attribute level restaurant performance. Using Yelp.com reviews for empirical illustration, they find that a hybrid deep learning (CNN-LSTM) model, where CNN and LSTM exploit the spatial and sequential structure of language respectively provide the best performance in accuracy, training speed and training data size requirements. The model does particularly well on the “hard” sentiment classification problems. Further, accounting for attribute self-selection significantly impacts sentiment scores, especially on attributes that are frequently missing.
Keywords: Text mining; Natural language processing (NLP); Convolutional neural networks (CNN); Long-short term memory (LSTM) Networks; Deep learning; Lexicons; Endogeneity; Self-selection; Online reviews; Online ratings; Customer satisfaction (search for similar items in EconPapers)
JEL-codes: M1 M3 C8 C5 (search for similar items in EconPapers)
Pages: 55 pages
New Economics Papers: this item is included in nep-big, nep-cmp and nep-mst
References: View references in EconPapers View complete reference list from CitEc
Citations: Track citations by RSS feed
Downloads: (external link)
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
Persistent link: https://EconPapers.repec.org/RePEc:cwl:cwldpp:2176
Ordering information: This working paper can be ordered from
Cowles Foundation, Yale University, Box 208281, New Haven, CT 06520-8281 USA
The price is None.
Access Statistics for this paper
More papers in Cowles Foundation Discussion Papers from Cowles Foundation for Research in Economics, Yale University Yale University, Box 208281, New Haven, CT 06520-8281 USA. Contact information at EDIRC.
Bibliographic data for series maintained by Matthew Regan ().