A hybrid approach for building a Vietnamese sentiment dictionary

Sentiment (or emotional) dictionaries play a key role and are “the materials” for sentiment analysis applications, especially in Vietnamese where the corpus is not yet available. In this research, we propose a sentiment dictionary for Vietnamese, known as VNSD, which adopts machine translation, Logi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of intelligent & fuzzy systems 2018-01, Vol.35 (1), p.967-978
Hauptverfasser: Tran, Thien Khai, Phan, Tuoi Thi
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Sentiment (or emotional) dictionaries play a key role and are “the materials” for sentiment analysis applications, especially in Vietnamese where the corpus is not yet available. In this research, we propose a sentiment dictionary for Vietnamese, known as VNSD, which adopts machine translation, Logistic Regression, “double propagation,” and fuzzy rules. The sentiment dictionary counts approximately 5,000 adjectives, 2,000 verbs, 300 nouns, and more than 200 adverbs, which lead to a combination of more than 100,000 sentiment phrases covering the Vietnamese emotional lexicons. We found that a hybrid approach offers better performance compared to stand-alone data-mining or natural language processing techniques. The key contribution of VNSD that distinguishes it from related work is a deep investment in exploiting Vietnamese linguistic characteristics to propose suitable rules for computing the sentiment scores of Vietnamese text phrases.
ISSN:1064-1246
1875-8967
DOI:10.3233/JIFS-172053