Novel cluster set optimization model with unique identifier tagging for twitter data analysis
Blogs, internet forums, social networks, and micro-blogging sites are some of the growing number of places where users can voice their opinions. Opinions on any given product, issue, service, or idea are contained in data, making them a valuable resource in their own right. Popular social networking...
Gespeichert in:
Veröffentlicht in: | Journal of intelligent & fuzzy systems 2023-01, Vol.44 (2), p.2031-2039 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Blogs, internet forums, social networks, and micro-blogging sites are some of the growing number of places where users can voice their opinions. Opinions on any given product, issue, service, or idea are contained in data, making them a valuable resource in their own right. Popular social networking services like Twitter, Facebook, and Google+ allows expressing views on a variety of topics, participating in discussions, or sending messages to a global user. Twitter sentiment analysis has received a lot of attention recently.Sentiment analysis is finding how a person feels about a topic from their written response about it and it can be separated into positive and negative through its use. Doing so enables to classify the tweets made by a user in to appropriate classification category based on which some decisions can be made. The literature proposed approaches to develop the classifiers on the Twitter datasets. Operations, including tokenization, stop-word removal, and stemming will be performed. NLP converts the text to a machine-readable representation. Artificial Intelligence (AI) combines NLP data to evaluate if a situation is positive or negative. The document’s subjectivity can be identified using ML and NLP techniques to categorize them in to positive, neutral, or negative. Performing sentiment analysis in Twitter data can be tedious due to limited size, unstructured nature, misspellings, slang, and abbreviations. For this task, a Tweet Analyzing Model for Cluster Set Optimization with Unique Identifier Tagging (TAM-CSO-UIT) was built using prospects to determine positive or negative sentiment in tweets obtained from Twitter. This approach assigns a +ve/-ve value to each entry in the Tweet database based on probability assignment using n-gram model. To perform this effectively the tweet dataset is considered as a sliding window of length L. The proposed model accurately analyses and classifies the tweets. |
---|---|
ISSN: | 1064-1246 1875-8967 |
DOI: | 10.3233/JIFS-220033 |