Opinion mining: a hybrid framework based on lexicon and machine learning approaches

Sentiment analysis is a practical technique that allows businesses, researchers, governments, politicians, and organizations to know about people's sentiments, which play an important role in decision-making processes. Sentiment classification techniques are mainly divided into lexicon-based me...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	International journal of computers & applications 2021-09, Vol.43 (8), p.786-794
Hauptverfasser:	Elsaid Moussa, Mohammed, Hussein Mohamed, Ensaf, Hassan Haggag, Mohamed
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Data mining Decision making Feature extraction hybrid technique Machine learning Model accuracy Natural language processing Sentiment analysis
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Sentiment analysis is a practical technique that allows businesses, researchers, governments, politicians, and organizations to know about people's sentiments, which play an important role in decision-making processes. Sentiment classification techniques are mainly divided into lexicon-based methods, machine learning methods, and hybrid methods. There are limitations in each approach; Traditional machine learning approaches are based on complex features extraction process, and lexicon-based approaches suffer from scalability and are limited by unreliable sentiment lexicons that are commonly created manually by experts. In this paper, we seek to improve the performance of machine learning techniques by integrating it with our enhanced lexicon based technique Sum-of-Votes Model. Sum-of-Votes model is a generic extendable lexicon based model that beats traditional lexicon based models in accuracy and provides good solutions to previous challenges and drawbacks such as scalability, domain dependency, and unreliability, but its accuracy was 81.62%. So, in this paper we proposed a novel framework based upon both Sum-of-Votes and Bag-of-Words models; we applied them, then their outputs were fed as features to Machine Learning Classifiers. We got higher accuracy than all the individual lexicons and the entire old framework.
ISSN:	1206-212X 1925-7074
DOI:	10.1080/1206212X.2019.1615250