Topic Modeling of Online Accommodation Reviews via Latent Dirichlet Allocation
Ian Sutherland,
Youngseok Sim,
Seul Ki Lee,
Jaemun Byun and
Kiattipoom Kiatkawsin
Additional contact information
Ian Sutherland: Department of Hospitality and Tourism Management, Tourism Industry Data Analytics Lab (TIDAL), Sejong University, Seoul 05006
Youngseok Sim: Department of Hospitality and Tourism Management, Tourism Industry Data Analytics Lab (TIDAL), Sejong University, Seoul 05006
Seul Ki Lee: Department of Hospitality and Tourism Management, Tourism Industry Data Analytics Lab (TIDAL), Sejong University, Seoul 05006
Jaemun Byun: Department of Hospitality and Tourism Management, Tourism Industry Data Analytics Lab (TIDAL), Sejong University, Seoul 05006
Kiattipoom Kiatkawsin: Department of Hospitality and Tourism Management, Tourism Industry Data Analytics Lab (TIDAL), Sejong University, Seoul 05006
Sustainability, 2020, vol. 12, issue 5, 1-15
Abstract:
There is a lot of attention given to the determinants of guest satisfaction and consumer behavior in the tourism literature. While much extant literature uses a deductive approach for identifying guest satisfaction dimensions, we apply an inductive approach by utilizing large unstructured text data of 104,161 online reviews of Korean accommodation customers to frame which topics of interest guests find important. Using latent Dirichlet allocation, a generative, Bayesian, hierarchical statistical model, we extract and validate topics of interest in the dataset. The results corroborate extant literature in that dimensions, such as location and service quality, are important. However, we extend existing dimensions of importance by more precisely distinguishing aspects of location and service quality. Furthermore, by comparing the characteristics of the accommodations in terms of metropolitan versus rural and the type of accommodation, we reveal differences in topics of importance between different characteristics of the accommodations. Specifically, we find a higher importance for points of competition and points of uniqueness among the accommodation characteristics. This has implications for how managers can improve customer satisfaction and how researchers can more precisely measure customer satisfaction in the hospitality industry.
Keywords: topic modeling; latent Dirichlet allocation; tourism 4.0; online travel agency; online review; text analytics; improve customer satisfaction; inductive approach; dimensions of interest; era of big data (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (9)
Downloads: (external link)
https://www.mdpi.com/2071-1050/12/5/1821/pdf (application/pdf)
https://www.mdpi.com/2071-1050/12/5/1821/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:12:y:2020:i:5:p:1821-:d:326363
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().