A Comprehensive Analysis of ‘Machine Learning and Deep Learning’ Methods for Sentiment Analysis in Twitter

The present research is meant to assist in making the best possible optimization in sentiment analysis for Twitter information data gathered using scrape and Tweepy. For that, various preprocessing methods are thoroughly compared with each other in terms of their effect when the dataset is as-is, ov...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:SN computer science 2024-09, Vol.5 (7), p.915, Article 915
Hauptverfasser: Shetty, Poornima, Kini, Shashidhar, Fernandes, Roshan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The present research is meant to assist in making the best possible optimization in sentiment analysis for Twitter information data gathered using scrape and Tweepy. For that, various preprocessing methods are thoroughly compared with each other in terms of their effect when the dataset is as-is, oversampled, and under-sampling. Moving on from pre-processing, we applied two feature extraction techniques: CountVectorizer and TF-IDF. We want to find out the preprocessing and feature extraction combination that works best in sentiment classification. They were after that applied to performance evaluation, undertaking a comparative performance evaluation role using machine learning and deep learning techniques. These were the Support Vector Machines and the Long Short-Term Memory. From the results obtained, this study established that the classification ensemble of oversampling, CountVectorizer, and SVM was the highest at 93.3% and attributed this to the fact that this combination. Further, the performance of the LSTM model, when applied to this pre-processed data, shows a distinct accuracy of 87.7%.Oversampling with the CountVectorizer led to the best improvement in performance from the SVM classifier. This, therefore, invokes the development of more robust frameworks for analysing sentiment in social media data, which has more value to areas beyond which it is applied to public opinion mining and social media analytics.
ISSN:2661-8907
2662-995X
2661-8907
DOI:10.1007/s42979-024-03216-2