Methods for Data Representation
Ramón Zatarain Cabada,
Héctor Manuel Cárdenas López and
Hugo Jair Escalante
Additional contact information
Ramón Zatarain Cabada: Instituto Tecnológico de Culiacán
Héctor Manuel Cárdenas López: Instituto Tecnológico de Culiacán
Hugo Jair Escalante: Instituto Nacional de Astrofísica
Chapter Chapter 5 in Multimodal Affective Computing, 2023, pp 55-65 from Springer
Abstract:
Abstract This chapter describes the different methods for data representation in sentiment analysis from text. The chapter starts with an introduction to preprocessing algorithms commonly used for data preparation. Afterward, tokenization techniques used in sentiment analysis and natural language processing for representing text-based data into vectors are described. Next, the parsing technique for data representation, diving into a parsing tree to represent a sentence, is discussed, followed by the difference between stemming and lemmatization as partial representations of text joined with tokenization for another type of text-based data representation. Finally, this section describes word embeddings, some of the algorithms used in this representation technique, and some conclusions on preparing data through data representation for ML and DL model training in sentiment analysis. The goal of this section is to introduce the reader to a number of data representation techniques that can be implemented in text-based sentiment analysis.
Date: 2023
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-3-031-32542-7_5
Ordering information: This item can be ordered from
http://www.springer.com/9783031325427
DOI: 10.1007/978-3-031-32542-7_5
Access Statistics for this chapter
More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().