Fraudulent review detection model focusing on emotional expressions and explicit aspects: investigating the potential of feature engineering
Ajay Kumar (),
Ram D. Gopal,
Ravi Shankar and
Kim Hua Tan
Additional contact information
Ajay Kumar: EM - EMLyon Business School
Ram D. Gopal: WBS - Warwick Business School - University of Warwick [Coventry]
Ravi Shankar: IIT Delhi - Indian Institute of Technology Delhi
Kim Hua Tan: Nottingham University Business School [Nottingham]
Post-Print from HAL
Abstract:
Reading customer reviews before purchasing items online has become a common practice; however, some companies use machine learning (ML) algorithms to generate false reviews in order to create positive brand images of their own products and negative images of competitors' offerings. Existing techniques use review content to identify fraudulent reviewers; however, spammers become more intelligent, started to learn from their mistakes, and changed their tactics in order to avoid detection techniques. Thus, investigating fraudulent accounts' behaviour of generating fake negative or positive reviews for competitors or themselves and the necessity of ML classifiers to identify fraudulent reviews, is more important than ever. In this research, we present a novel feature engineering approach in which we (1) extract several "review-centric" and "reviewer-centric" features from a dataset; (2) combine the cumulative effects of features distributions into a unified model that represents overall behavior of the fraudulent reviewers; (3) investigate the role of effective data pre-processing to improve detection accuracy; and (4) develop a probabilistic approach to detect fraudulent reviewers by learning a novel M-SMOTE model over a derived balanced dataset and feature distributions, which outperforms other ML models. Our study contributes to the literature on digital platforms and fraudulent review detection with significant managerial and theoretical implications through these novel findings.
Keywords: online reviews; Digital platforms; Review manipulation; Machine learning; Opinion spamming; Feature engineering (search for similar items in EconPapers)
Date: 2022-04-01
Note: View the original document on HAL open archive server: https://hal.science/hal-03630420v1
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (12)
Published in Decision Support Systems, 2022, 155, ⟨10.1016/j.dss.2021.113728⟩
Downloads: (external link)
https://hal.science/hal-03630420v1/document (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:hal:journl:hal-03630420
DOI: 10.1016/j.dss.2021.113728
Access Statistics for this paper
More papers in Post-Print from HAL
Bibliographic data for series maintained by CCSD ().