Exploration of Topic Classification in the Tourism Field with Text Mining Technology—A Case Study of the Academic Journal Papers
I-Cheng Chang,
Jeou-Shyan Horng,
Chih-Hsing Liu,
Sheng-Fang Chou and
Tai-Yi Yu
Additional contact information
I-Cheng Chang: Department of Environmental Engineering, National Ilan University, Yilan 260007, Taiwan
Jeou-Shyan Horng: Department of Food and Beverage, Shih Chien University, Taipei 104336, Taiwan
Chih-Hsing Liu: Department of Tourism Management, National Kaohsiung University of Science and Technology, Kaohsiung 811532, Taiwan
Sheng-Fang Chou: Department of Hospitality Management, Ming Chuan University, Taoyuan 333321, Taiwan
Tai-Yi Yu: Department of Risk Management and Insurance, Ming Chuan University, Taipei 111005, Taiwan
Sustainability, 2022, vol. 14, issue 7, 1-21
Abstract:
This study collects abstracts of SSCI tourism journal papers between 2010 and 2019 from the WoS (Web of Science) database and uses a novel method of topic classification to explore the vocabulary characteristics of the classified articles. The corpora of abstracts are given quantitative Term Frequency–Inverse Document Frequency (TF–IDF) weights. A hierarchical K-means cluster analysis is then performed to automatically classify the articles; co-word analysis techniques are used to show the characteristics of feature words for distinct clusters, titles, and the consistency of the classified articles. Based on the results for 5783 abstracts, cluster analysis classifies the number of K-means clusters into six categories: travel, culture, sustainability, model, behavior, and hotel. A cross-check method is applied to assess the consistency of the topic classifications, list titles and keywords of the documents with the three smallest distances in each category and apply a strategic diagram to present the features of the distinct categories.
Keywords: cluster analysis; text mining; word cloud; co-word analysis; strategic diagram (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
https://www.mdpi.com/2071-1050/14/7/4053/pdf (application/pdf)
https://www.mdpi.com/2071-1050/14/7/4053/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:14:y:2022:i:7:p:4053-:d:782341
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().