EconPapers    
Economics at your fingertips  
 

Sentiment Classification of Chinese Tourism Reviews Based on ERNIE-Gram+GCN

Senqi Yang, Xuliang Duan (), Zeyan Xiao, Zhiyao Li, Yuhai Liu, Zhihao Jie, Dezhao Tang and Hui Du
Additional contact information
Senqi Yang: College of Information Engineering, Sichuan Agricultural University, Ya’an 625000, China
Xuliang Duan: College of Information Engineering, Sichuan Agricultural University, Ya’an 625000, China
Zeyan Xiao: College of Information Engineering, Sichuan Agricultural University, Ya’an 625000, China
Zhiyao Li: College of Information Engineering, Sichuan Agricultural University, Ya’an 625000, China
Yuhai Liu: College of Information Engineering, Sichuan Agricultural University, Ya’an 625000, China
Zhihao Jie: College of Information Engineering, Sichuan Agricultural University, Ya’an 625000, China
Dezhao Tang: College of Information Engineering, Sichuan Agricultural University, Ya’an 625000, China
Hui Du: Housing and Urban-Rural Development Bureau of Lincheng County, Xingtai 054000, China

IJERPH, 2022, vol. 19, issue 20, 1-20

Abstract: Nowadays, tourists increasingly prefer to check the reviews of attractions before traveling to decide whether to visit them or not. To respond to the change in the way tourists choose attractions, it is important to classify the reviews of attractions with high precision. In addition, more and more tourists like to use emojis to express their satisfaction or dissatisfaction with the attractions. In this paper, we built a dataset for Chinese attraction evaluation incorporating emojis (CAEIE) and proposed an explicitly n-gram masking method to enhance the integration of coarse-grained information into a pre-training (ERNIE-Gram) and Text Graph Convolutional Network (textGCN) (E2G) model to classify the dataset with a high accuracy. The E2G preprocesses the text and feeds it to ERNIE-Gram and TextGCN. ERNIE-Gram was trained using its unique mask mechanism to obtain the final probabilities. TextGCN used the dataset to construct heterogeneous graphs with comment text and words, which were trained to obtain a representation of the document output category probabilities. The two probabilities were calculated to obtain the final results. To demonstrate the validity of the E2G model, this paper was compared with advanced models. After experiments, it was shown that E2G had a good classification effect on the CAEIE dataset, and the accuracy of classification was up to 97.37%. Furthermore, the accuracy of E2G was 1.37% and 1.35% ahead of ERNIE-Gram and TextGCN, respectively. In addition, two sets of comparison experiments were conducted to verify the performance of TextGCN and TextGAT on the CAEIE dataset. The final results showed that ERNIE and ERNIE-Gram combined TextGCN and TextGAT, respectively, and TextGCN performed 1.6% and 2.15% ahead. This paper compared the effects of eight activation functions on the second layer of the TextGCN and the activation-function-rectified linear unit 6 (RELU6) with the best results based on experiments.

Keywords: sustainable tourism; tourism satisfaction; scenic spot evaluation text; natural language processing (NLP); text classification (search for similar items in EconPapers)
JEL-codes: I I1 I3 Q Q5 (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/1660-4601/19/20/13520/pdf (application/pdf)
https://www.mdpi.com/1660-4601/19/20/13520/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jijerp:v:19:y:2022:i:20:p:13520-:d:946692

Access Statistics for this article

IJERPH is currently edited by Ms. Jenna Liu

More articles in IJERPH from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jijerp:v:19:y:2022:i:20:p:13520-:d:946692