EconPapers    
Economics at your fingertips  
 

Feature selection based on genetic algorithm and hybrid model for sentiment polarity classification

P. Kalaivani and K.L. Shunmuganathan

International Journal of Data Mining, Modelling and Management, 2016, vol. 8, issue 4, 315-329

Abstract: Sentiment classification is to find the polarity of product or user reviews. Supervised machine learning algorithms is used for opinion mining such as naive Bayes, K-nearest neighbour, decision trees, maximum entropy and hidden Markov model and support vector machine. KNN is a simple algorithm, but a less efficient classification algorithm. In this paper, we propose an improved KNN algorithm. An optimised feature selection, genetic algorithm that incorporates the information gain for feature selection and combined with bagging technique and KNN for improving the accuracy of sentiment classification. Specifically, we compared two approaches and traditional KNN for sentiment classification of movie reviews and product reviews. The same approach has been applied to other machine learning algorithms such as support vector machine and naive Bayes and the result is compared with POS-based feature set method. The proposed method is evaluated and experimental results using information gain, genetic algorithm with bagging technique indicate higher performance result with accuracy of 87.50% of the movie reviews and exhibits better performance in terms of accuracy, precision and recall for movie, DVD, electronics and kitchen reviews.

Keywords: sentiment classification; supervised machine learning; feature selection; genetic algorithms; product reviews; user reviews; movie reviews; film reviews; information gain; bagging; opinion mining; K-nearest neighbour; kNN; support vector machines; SVM; naive Bayes; electronics reviews; DVD reviews; kitchen reviews. (search for similar items in EconPapers)
Date: 2016
References: Add references at CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
http://www.inderscience.com/link.php?id=81242 (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:ids:ijdmmm:v:8:y:2016:i:4:p:315-329

Access Statistics for this article

More articles in International Journal of Data Mining, Modelling and Management from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().

 
Page updated 2025-03-19
Handle: RePEc:ids:ijdmmm:v:8:y:2016:i:4:p:315-329