Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias

Sentiment analysis is widely studied to extract opinions from user generated content (UGC), and various methods have been proposed in recent literature. However, these methods are likely to introduce sentiment bias, and the classification results tend to be positive or negative, especially for the l...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	PloS one 2018-08, Vol.13 (8), p.e0202523-e0202523
Hauptverfasser:	Han, Hongyu, Zhang, Yongshi, Zhang, Jianpei, Yang, Jing, Zou, Xiaomei
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Attitude Bias Classification Computer and Information Sciences Data mining Engineering and Technology Humans Investment advisors Language Linguistics Online information services Performance enhancement Physical Sciences Polarity Psychological aspects Research and Analysis Methods Semantics Sentiment analysis Sentimentality Social media Social Media - trends Social networks Social Sciences Teaching methods User generated content Weight
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Sentiment analysis is widely studied to extract opinions from user generated content (UGC), and various methods have been proposed in recent literature. However, these methods are likely to introduce sentiment bias, and the classification results tend to be positive or negative, especially for the lexicon-based sentiment classification methods. The existence of sentiment bias leads to poor performance of sentiment analysis. To deal with this problem, we propose a novel sentiment bias processing strategy which can be applied to the lexicon-based sentiment analysis method. Weight and threshold parameters learned from a small training set are introduced into the lexicon-based sentiment scoring formula, and then the formula is used to classify the reviews. In this paper, a completed sentiment classification framework is proposed. SentiWordNet (SWN) is used as the experimental sentiment lexicon, and review data of four products collected from Amazon are used as the experimental datasets. Experimental results show that the bias processing strategy reduces polarity bias rate (PBR) and improves performance of the lexicon-based sentiment analysis method.
ISSN:	1932-6203 1932-6203
DOI:	10.1371/journal.pone.0202523