Mapreduce framework based sentiment analysis of twitter data using hierarchical attention network with chronological leader algorithm

Sentiment analysis (SA) or opinion mining is a general dialogue preparation chore that intends to discover sentiments behind the opinions in texts on changeable subjects. Recently, researchers in an area of SA have been considered for assessing opinions on diverse themes like commercial products, ev...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Social network analysis and mining 2024-08, Vol.14 (1), p.172
Hauptverfasser: Jagdale, Jayashree, Sreemathy, R, Jagdale, Balaso, Ghag, Kranti
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Sentiment analysis (SA) or opinion mining is a general dialogue preparation chore that intends to discover sentiments behind the opinions in texts on changeable subjects. Recently, researchers in an area of SA have been considered for assessing opinions on diverse themes like commercial products, everyday social problems and so on. Twitter is a region, wherein tweets express opinions, and acquire an overall knowledge of unstructured data. This process is more time-consuming and the accuracy needs to be improved. Here, the Chronological Leader Algorithm Hierarchical Attention Network (CLA_HAN) is presented for SA of Twitter data. Firstly, the input Twitter data concerned is subjected to a data partitioning phase. The data partitioning of input Tweets are conducted by Deep Embedded Clustering (DEC). Thereafter, partitioned data is subjected to MapReduce framework, which comprises of mapper and reducer phase. In the mapper phase, Bidirectional Encoder Representations from Transformers (BERT) tokenization and feature extraction are accomplished. In the reducer phase, feature fusion is carried out by Deep Neural Network (DNN) whereas SA of Twitter data is executed utilizing a Hierarchical Attention Network (HAN). Moreover, HAN is tuned by CLA which is the integration of chronological concept with the Mutated Leader Algorithm (MLA). Furthermore, CLA_HAN acquired maximal values of f-measure, precision and recall about 90.6%, 90.7% and 90.3%.
ISSN:1869-5450
1869-5469
DOI:10.1007/s13278-024-01293-y