EconPapers    
Economics at your fingertips  
 

A Method for Analysing Large-Scale UGC Data for Tourism: Application to the Case of Catalonia

Estela Marine-Roig () and Salvador Anton Clave ()
Additional contact information
Estela Marine-Roig: Rovira i Virgili University
Salvador Anton Clave: Rovira i Virgili University

A chapter in Information and Communication Technologies in Tourism 2015, 2015, pp 3-17 from Springer

Abstract: Abstract In recent years, many articles have been published about the study of user-generated content (UGC) data in the domains of tourism and hospitality, in particular concerning quantitative and qualitative content analysis of travel blogs and online travel reviews (OTR). In general, researchers have worked on more or less population-representative samples of travel diaries, of tens or hundreds of files, which enables their manual processing. However, due to their dramatic growth, especially in the case of hospitality OTRs, this article proposes a method for semi-automatic downloading, arranging, cleaning, debugging, and analysing large-scale travel blog and OTR data. The main goal is to classify the collected webpages by dates and destinations and to be able to perform offline content analysis of the written text as provided by the author. This methodology is applied to analyse about 85,000 diaries of tourists who visited Catalonia between 2004 and 2013, and significant results are obtained in terms of content analysis.

Keywords: Travel blog; Online travel review; Web harvesting; Web data mining; Massive content analysis; Catalonia (search for similar items in EconPapers)
Date: 2015
References: Add references at CitEc
Citations: View citations in EconPapers (2)

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-3-319-14343-9_1

Ordering information: This item can be ordered from
http://www.springer.com/9783319143439

DOI: 10.1007/978-3-319-14343-9_1

Access Statistics for this chapter

More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-04-02
Handle: RePEc:spr:sprchp:978-3-319-14343-9_1