Design and validation of annotation schemas for aspect-based sentiment analysis in the tourism sector
Antonio Moreno-Ortiz (),
Soluna Salles-Bernal () and
Aroa Orrequia-Barea ()
Additional contact information
Antonio Moreno-Ortiz: Universidad de Málaga
Soluna Salles-Bernal: Universidad de Málaga
Aroa Orrequia-Barea: Universidad de Jaén
Information Technology & Tourism, 2019, vol. 21, issue 4, No 4, 535-557
Abstract:
Abstract The use of linguistic resources beyond the scope of language studies, e.g., commercial purposes, has become commonplace since the availability of massive amounts of data and the development of software tools to process them. An interesting perspective on these data is provided by Sentiment Analysis, which attempts to identify the polarity of a text, but can also pursue further, more challenging aims, such as the automatic identification of the specific entities and aspects being discussed in the evaluative speech act, along with the polarity associated with them. This approach, known as aspect-based sentiment analysis, seeks to offer fine-grained information from raw text, but its success depends largely on the existence of pre-annotated domain-specific corpora, which in turn calls for the design and validation of an annotation schema. This paper examines the methodological aspects involved in the creation of such annotation schema and is motivated by the scarcity of information found in the literature. We describe the insights we obtained from the annotation schema generation and validation process within our project, whose objectives include the development of advanced sentiment analysis software of user reviews in the tourism sector. We focus on the identification of the relevant entities and attributes in the domain, which we extract from a corpus of user reviews, and go on to describe the schema creation and validation process. We begin by describing the corpus annotation process and its further iterative refinement by means of several inter-annotator agreement measurements, which we believe is key to a successful annotation schema.
Keywords: Annotation schema; Aspect-based sentiment analysis; Inter-rater agreement; Tourism industry; User-generated content (search for similar items in EconPapers)
Date: 2019
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (4)
Downloads: (external link)
http://link.springer.com/10.1007/s40558-019-00155-0 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:infott:v:21:y:2019:i:4:d:10.1007_s40558-019-00155-0
Ordering information: This journal article can be ordered from
http://www.springer. ... ystems/journal/40558
DOI: 10.1007/s40558-019-00155-0
Access Statistics for this article
Information Technology & Tourism is currently edited by Zheng Xiang
More articles in Information Technology & Tourism from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().