EconPapers    
Economics at your fingertips  
 

Systematic investigation of keywords selection and processing strategy on search engine forecasting: a case of tourist volume in Beijing

Ziqi Yuan and Guozhu Jia ()
Additional contact information
Ziqi Yuan: Sichuan Normal University
Guozhu Jia: Sichuan Normal University

Information Technology & Tourism, 2022, vol. 24, issue 4, No 5, 547-580

Abstract: Abstract The timeliness, precision, and low cost of search data have great potential for projecting tourist volume. Obtaining valuable information for decision-making, particularly for predicting, is hampered by the vast amount of search data. A systematic investigation of keyword selection and processing has been conducted. Using Beijing tourist volume as an example, 11 different feature extraction algorithms were selected and combined with long short-term memory (LSTM), random forest (RF) and fuzzy time series (FTS) for forecasting tourist volume. A total of 1612 keywords were retrieved from Baidu Index demand mapping using the direct word extraction method, range word extraction method and empirical selection method. The remaining 813 keywords were subjected to feature extraction. Based on the forecasting results of medium and short-term (1-day, 7-days and 10-days), the forecasting results of Kernel principal component analysis (KPCA) and locally linear embedding (LLE) are relatively stable when the dimensionality is reduced to 5 dimensions. The forecasting results of t-stochastic neighbor embedding (t-SNE), isometric mapping (IsoMap) and locally linear embedding (LLE), locality preserving projections (LPP), independent component correlation (ICA) are relatively stable when the dimensionality is reduced to 10 dimensions. Accurately forecasting many factors (transportation, attraction, food, lodging, travel, tips, tickets, and weather) provides a solid foundation for tourism demand optimization and scientific management and a resource for tourists' holistic vacation planning.

Keywords: Tourist volume forecasting; Search engine data; Feature extraction algorithm; LSTM; FTS; RF (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
http://link.springer.com/10.1007/s40558-022-00238-5 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:infott:v:24:y:2022:i:4:d:10.1007_s40558-022-00238-5

Ordering information: This journal article can be ordered from
http://www.springer. ... ystems/journal/40558

DOI: 10.1007/s40558-022-00238-5

Access Statistics for this article

Information Technology & Tourism is currently edited by Zheng Xiang

More articles in Information Technology & Tourism from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-04-06
Handle: RePEc:spr:infott:v:24:y:2022:i:4:d:10.1007_s40558-022-00238-5