EconPapers    
Economics at your fingertips  
 

The impact of tree-based machine learning models, length of training data, and quarantine search query on tourist arrival prediction’s accuracy under COVID-19 in Indonesia

Mochammad Agus Afrianto and Meditya Wasesa

Current Issues in Tourism, 2022, vol. 25, issue 23, 3854-3870

Abstract: This study presents the extreme gradient boosting (XGBoost) and random forest (RF) models to predict tourism demand by incorporating international COVID-19 cases, international tourist arrivals, and the destination's quarantine policy predictors. Unlike other ‘black box’ machine learning models, those two tree-based models offer better interpretability with explicit feature importance and tree structure representations. This paper evaluates the accuracy of these models in predicting international tourist arrivals in Indonesia during the COVID-19 pandemic using long-range (January 2008–June 2021) and short-range (January 2018–June 2021) training datasets. The performance of these two models is compared with benchmark models, such as the artificial neural network, autoregressive integrated moving average, and seasonal ARIMA models. In general, the tree-based machine learning models outperformed all benchmark models. International COVID-19 cases and tourist arrivals predictors have dominating feature importance scores in XGBoost models. Meanwhile, Google trends keywords on quarantine policies show significant importance in RF models but not in the XGBoost models. Moreover, RF models are better than the XGBoost models in terms of accuracy and overcoming overfitting cases.

Date: 2022
References: Add references at CitEc
Citations:

Downloads: (external link)
http://hdl.handle.net/10.1080/13683500.2022.2085079 (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:taf:rcitxx:v:25:y:2022:i:23:p:3854-3870

Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/rcit20

DOI: 10.1080/13683500.2022.2085079

Access Statistics for this article

Current Issues in Tourism is currently edited by Jennifer Tunstall

More articles in Current Issues in Tourism from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().

 
Page updated 2025-03-20
Handle: RePEc:taf:rcitxx:v:25:y:2022:i:23:p:3854-3870