A multiobjective weighted voting ensemble classifier based on differential evolution algorithm for text sentiment classification
• A novel multi-objective differential evolution algorithm based classifier ensemble for text sentiment classification.• An empirical comparison of weighted and unweighted voting schemes.• Extensive empirical analysis on metaheuristic based voting schemes for sentiment analysis.• High classification...
Gespeichert in:
Veröffentlicht in: | Expert systems with applications 2016-11, Vol.62, p.1-16 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | • A novel multi-objective differential evolution algorithm based classifier ensemble for text sentiment classification.• An empirical comparison of weighted and unweighted voting schemes.• Extensive empirical analysis on metaheuristic based voting schemes for sentiment analysis.• High classification accuracies for text sentiment classification (98.86% for Laptop dataset).
Typically performed by supervised machine learning algorithms, sentiment analysis is highly useful for extracting subjective information from text documents online. Most approaches that use ensemble learning paradigms toward sentiment analysis involve feature engineering in order to enhance the predictive performance. In response, we sought to develop a paradigm of a multiobjective, optimization-based weighted voting scheme to assign appropriate weight values to classifiers and each output class based on the predictive performance of classification algorithms, all to enhance the predictive performance of sentiment classification. The proposed ensemble method is based on static classifier selection involving majority voting error and forward search, as well as a multiobjective differential evolution algorithm. Based on the static classifier selection scheme, our proposed ensemble method incorporates Bayesian logistic regression, naïve Bayes, linear discriminant analysis, logistic regression, and support vector machines as base learners, whose performance in terms of precision and recall values determines weight adjustment. Our experimental analysis of classification tasks, including sentiment analysis, software defect prediction, credit risk modeling, spam filtering, and semantic mapping, suggests that the proposed classification scheme can predict better than conventional ensemble learning methods such as AdaBoost, bagging, random subspace, and majority voting. Of all datasets examined, the laptop dataset showed the best classification accuracy (98.86%). |
---|---|
ISSN: | 0957-4174 1873-6793 |
DOI: | 10.1016/j.eswa.2016.06.005 |